The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
-
Updated
May 26, 2024 - Python
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Superpipe - optimized LLM pipelines for structured data
NSF I-Corps Summer 2024 Survey i-star
Client interface for all things Cleanlab Studio
Code accompanying the TOP paper "Predicting the Demographics of Twitter Users with Programmatic Weak Supervision".
🚤 Label data at scale. Fun and precision included.
A curated list of awesome data labeling tools
Segments.ai Python SDK
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
A labeling interface for classifying old visualizations
A labeling interface for segmenting old visualizations
Toloka-Kit is a Python library for working with Toloka API.
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
A Data Centric NER annotation tool for your Named Entity Recognition projects
A labeler-app for video labeling, for example for machine learning. For timewindows (|....|) and pointactivities (.|.)
Open source annotation tool for machine learning practitioners.
Make drawing and labeling bounding boxes easy as cake
An internationalized highly customizable annotation and evaluation tool for Natural Language Processing (NLP) tasks
Add a description, image, and links to the data-labeling topic page so that developers can more easily learn about it.
To associate your repository with the data-labeling topic, visit your repo's landing page and select "manage topics."