Haystack is an open source NLP framework to interact with your data
Scale your Pandas workflows by changing a single line of code
A framework for browser automation and testing with Selenium
Streamline your ML workflow
SQL builder for AWS Athena, inspired by sparkSQL
Bringing all of PostgreSQL's awesomeness to Django
A suite of tools to develop RAG, semantic search, and other AI apps
Build, run, and manage data pipelines for integrating data
The Logfire MCP Server is here
RL implementations
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation
TikZ figures for concepts in physics/chemistry/ML
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Train a 26M-parameter GPT from scratch in just 2h
An open-source RAG-based tool for chatting with your documents
Non-official Python library for works with API service Index
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
A lightweight opinionated ETL framework, halfway between plain scripts
dude uncomplicated data extraction: A simple framework
Synthetic data generators for tabular and time-series data
Trainable models and NN optimization tools
Create HTML profiling reports from pandas DataFrame objects
airda(Air Data Agent
Making DAG construction easier
Check links in web documents or full websites