Faster Whisper transcription with CTranslate2
Official inference repo for FLUX.1 models
Offline Text To Speech synthesis for python
A high-throughput and memory-efficient inference and serving engine
OBLITERATE THE CHAINS THAT BIND YOU
Python inference and LoRA trainer package for the LTX-2 audio–video
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Code for running inference and finetuning with SAM 3 model
Toolkit to help you get started with Spec-Driven Development
Automatic Speech Recognition with Word-level Timestamps
Unofficial Python API and agentic skill for Google NotebookLM
Code to accompany "A Method for Animating Children's Drawings"
Build resilient language agents as graphs
Machine learning in Python
Use Microsoft Edge's online text-to-speech service from Python
The highest-scoring AI memory system ever benchmarked
A high-quality tool for convert PDF to Markdown and JSON
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Generate audiobooks from e-books, voice cloning & 1107+ languages
A Python wrapper you can't refuse
Effortless data labeling with AI support from Segment Anything
InvokeAI is a leading creative engine for Stable Diffusion models
Open source AI VTuber platform with voice chat and Live2D avatars
Your Personal AI Assistant; easy to install, deploy on local or coud
An Open Source implementation of Notebook LM with more flexibility