A text editor in less than 1000 LOC with syntax highlight and search
A simple interface for working with TeX documents
Lightweight and flexible command-line JSON processor
Extra tools for OpenOffice under weak copyleft or other licenses
Code for running inference and finetuning with SAM 3 model
Contexts Optical Compression
OpenGL text using one vertex buffer, one texture and FreeType
A Family of Open Sourced Music Foundation Models
Code for openai.fm, a demo for the OpenAI Speech API
A Powerful Native Multimodal Model for Image Generation
Qwen3-TTS is an open-source series of TTS models
Official inference repo for FLUX.2 models
Robust Speech Recognition via Large-Scale Weak Supervision
A lightweight text-to-speech model with zero-shot voice cloning
Use Microsoft Edge's online text-to-speech service from Python
Python library and CLI tool to interface with Google Translate
Image generation model with single-stream diffusion transformer
A high-quality rapid TTS voice cloning model
Open source text-to-speech tool, supports extra-long text
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
The home of the ICU project source code
Towards Human-Level Text-to-Speech through Style Diffusion
Audiocraft is a library for audio processing and generation
A robust, efficient, low-latency speech-to-text library
Speech-AI-Forge is a project developed around TTS generation model