Data processing for and with foundation models
Python Stream Processing
A full spaCy pipeline and models for scientific/biomedical documents
A Repo For Document AI
Han Language Processing
GUI for a Vocal Remover that uses Deep Neural Networks
Robust Speech Recognition via Large-Scale Weak Supervision
Image processing in Python
Real time face swap and one-click video deepfake
Open source libraries and APIs to build custom preprocessing pipelines
The Classical Language Toolkit
An LLM-powered knowledge curation system that researches topics
ExtractThinker is a Document Intelligence library for LLMs
Stable Diffusion web UI
Open Source Differentiable Computer Vision Library
Easy-to-use and high-performance NLP and LLM framework
Underthesea - Vietnamese NLP Toolkit
ReFT: Representation Finetuning for Language Models
Python bindings for llama.cpp
Hub of ready-to-use datasets for ML models
Pretrained model hub for Keras 3
Data and tools for generating and inspecting OLMo pre-training data
The most accurate natural language detection library for Python
The no-nonsense RAG chunking library
Official repository for LTX-Video