The agent that grows with you
Unofficial Python API and agentic skill for Google NotebookLM
Fast stable diffusion on CPU and AI PC
Faster Whisper transcription with CTranslate2
The highest-scoring AI memory system ever benchmarked
A sound cloning tool with a web interface, using your voice
Open-source autonomous AI software engineer
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A high-quality tool for convert PDF to Markdown and JSON
An AI personal assistant for your digital brain
A Python wrapper you can't refuse
A Family of Open Sourced Music Foundation Models
Ready-to-use OCR with 80+ supported languages
Foundation model for image generation
Tokenizer-Free TTS for Multilingual Speech Generation
Text and image to video generation: CogVideoX and CogVideo
World's first open-source, agentic video production system
Automatic Speech Recognition with Word-level Timestamps
Offline Text To Speech synthesis for python
State-of-the-art TTS model under 25MB
Code to accompany "A Method for Animating Children's Drawings"
Powerful tool that lets you create and run intelligent agents
A community-supported supercharged version of paperless
Label Studio is a multi-type data labeling and annotation tool