Open source no-code system for text annotation and building of text
Open source annotation tool for machine learning practitioners
Mozc - a Japanese Input Method Editor designed for multi-platform
The behavior guidance framework for customer-facing LLM agents
OCR software, free and offline
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
SOTA Open Source TTS
Robust Speech Recognition via Large-Scale Weak Supervision
A minimalist command line knowledge base manager
A Family of Open Sourced Music Foundation Models
Contexts Optical Compression
Implementation of Imagen, Google's Text-to-Image Neural Network
Official inference repo for FLUX.2 models
Edit PDF files with Nano Banana
A Powerful Native Multimodal Model for Image Generation
FastAPI framework, high performance, easy to learn, fast to code
Text and image to video generation: CogVideoX and CogVideo
A generative speech model for daily dialogue
Label Studio is a multi-type data labeling and annotation tool
A simple tool for reading in poorly redacted documents
Qwen3-TTS is an open-source series of TTS models
A fast TTS architecture with conditional flow matching
Cut videos with a text editor
MTEB: Massive Text Embedding Benchmark
Tokenizer-Free TTS for Multilingual Speech Generation