Code for running inference and finetuning with SAM 3 model
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Crowdsourcing platform for full text transcription and tagging
A powerful text/code editor for Android
A pure-python PDF library capable of splitting, merging, cropping
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
A Python utility / library to sort imports
Cut videos with a text editor
PDF to Markdown with vision models
The python library for real-time communication
Robust Speech Recognition via Large-Scale Weak Supervision
A TTS that fits in your CPU (and pocket)
Converts text to speech in realtime
Open source no-code system for text annotation and building of text
Automatic Speech Recognition with Word-level Timestamps
An open-source toolkit for monitoring Language Learning Models (LLMs)
Qwen3-TTS is an open-source series of TTS models
Ready-to-use OCR with 80+ supported languages
High-Quality Voice Cloning TTS for 600+ Languages
A Python toolbox for gaining geometric insights
The behavior guidance framework for customer-facing LLM agents
A minimalist command line knowledge base manager
Official inference repo for FLUX.2 models
Speech recognition module for Python
A simple tool for reading in poorly redacted documents