Contexts Optical Compression
Use Microsoft Edge's online text-to-speech service from Python
Visual Causal Flow
Robust Speech Recognition via Large-Scale Weak Supervision
Audiocraft is a library for audio processing and generation
Node.js example app from the OpenAI API quickstart tutorial
Official repository for LTX-Video
Python library and CLI tool to interface with Google Translate
Fast backend for long-term AI user memory via structured profiles
A TTS that fits in your CPU (and pocket)
End-to-end speech processing toolkit
95% token savings. 155x faster queries. 16 languages
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
TextWorld is a sandbox learning environment for the training
Chinese XLNet pre-trained model
Workflow and speech recognition app
Qwen3-ASR is an open-source series of ASR models
General-purpose image editing model that delivers high-fidelity
Lightning-fast, on-device TTS, running natively via ONNX
A suite of advanced multi-modal LLMs
Multi-Agent daTa geneRation Infra and eXperimentation framework
Open-source multi-speaker long-form text-to-speech model
Sora AI Video Generator by Sora.FM
The python library for real-time communication
Improve your resumes with Resume Matcher