48khz stereo neural audio codec for general audio
Optax is a gradient processing and optimization library for JAX
JAX-based neural network library
kaldi-asr/kaldi is the official location of the Kaldi project
A simple native web interface that uses ChatTTS to synthesize text
Unified Multimodal Understanding and Generation Models
Python framework for AI workflows and pipelines with chain of thought
AI Toolkit for Healthcare Imaging
A nearly-live implementation of OpenAI's Whisper
Text and image to video generation: CogVideoX and CogVideo
Self-healing browser harness that enables LLMs to complete any task
Open source NLP guide with models, methods, and real use cases
Open-source abilities for OpenHome agents
Pluggable SOTA multi-object tracking modules for segmentation
A simple, easy-to-hack GraphRAG implementation
ZAPI by Adopt AI is an open-source Python library
Python SDK for Claude Agent
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Multimodal embedding and reranking models built on Qwen3-VL
Lightweight framework for building Agents with memory, knowledge, etc.
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
PaddlePaddle End-to-End Development Toolkit
Generate audiobooks from EPUBs, PDFs and text with captions
Implementation of TurboQuant (ICLR 2026)
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles