An AI-powered data science team of agents
Clean Jupyter notebooks of outputs, metadata, and empty cells
Image polygonal annotation with Python
ExtractThinker is a Document Intelligence library for LLMs
Data and tools for generating and inspecting OLMo pre-training data
Links to everything you'd ever want to learn about data engineering
A natural language interface for computers
Big Model Application Development Practice 1
An end-to-end Data Scientist
Scalable data pre processing and curation toolkit for LLMs
PandasAI is a Python library that integrates generative AI
Refine and quantize messy AI pixel art into clean, perfect pixels
Simplify the maintenance and cleaning of Linux systems.
AI code-writing assistant that understands data content
A free and open-source program to free up disk space
AI agent that streamlines the entire process of data analysis
Clean up of torrent files using the RPC protocal
Data Preprocessing Automation: A GUI for easy data cleaning & visualiz
All-in-one text de-duplication
Mine parameterized URLs from web archives for security testing
Resources, corpora, and tools for Chinese natural language processing
Virtual Assistant Maintenance System
Experiment tracking and metric logging for Amazon SageMaker notebooks