Synthetic data generators for structured and unstructured text
The open-source tool for building high-quality datasets
ThetaGang is an IBKR bot for collecting money
No-code AI workflow
Automatically find issues in image datasets
A tool for semi-automatic cell type classification, harmonization
A minimalist command line knowledge base manager
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
The toolkit to test, validate, and evaluate your models and surface
An open source multi-tool for exploring and publishing data
Data integration platform for ELT pipelines from APIs, databases
Training data (data labeling, annotation, workflow) for all data types
Train machine learning models within Docker containers
Gen-AI Chat for Teams
Streamline your ML workflow
Always know what to expect from your data
Project structure for doing and sharing data science work
Yahoo! Finance market data downloader
Progress bars for threading and multiprocessing tasks on terminal
The common language for platforms, agents and businesses.
Self-hosted platform to unify wearable health data
Make your own running home page
Specification and documentation for the Universal Commerce Protocol
airda(Air Data Agent
The standard data-centric AI package for data quality and ML