Nigeria No1. Music site And Complete Entertainment portal for Music Promotion WhatsApp:- +2349077287056
Saturday 27 April 2024
Show HN: Data Bonsai: a Python package to clean your data with LLMs https://bit.ly/3Qp8FfA
Show HN: Data Bonsai: a Python package to clean your data with LLMs I've been doing some data cleaning for my fine tuning projects using LLMs, and decided to just build a package for it as a side project. Check it out here: https://bit.ly/3xY2bOo Some features: - categorization (labelling), transformation and decomposition (text into structured format) - validates llm outputs - batch mode batches up the inputs/outputs so you don't send the prompt (schema, fewshot examples) for every row of data, saving a significant amount of tokens There are some similarities to the Instructor repo, but this is simpler and made for datasets. Would love any feedback/suggestions (and a star if you like it!) https://bit.ly/3xY2bOo April 27, 2024 at 11:59PM
Labels:
Hacker News
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment