The official Python library for the OpenAI API
Controllable & emotion-expressive zero-shot TTS
A single Gradio + React WebUI with extensions for ACE-Step
A react-based starter app for using the Live API over websockets
A Web UI for easy subtitle using whisper model
Component library and custom registry built on top of shadcn/ui
A general fine-tuning kit geared toward image/video/audio diffusion
Python library and CLI tool to interface with Google Translate
Python inference and LoRA trainer package for the LTX-2 audio–video
A high-quality rapid TTS voice cloning model
Build Vision Agents quickly with any model or video provider
Qwen3-ASR is an open-source series of ASR models
Code and models for ICML 2024 paper, NExT-GPT
A python tool that uses GPT-4, FFmpeg, and OpenCV
The official Python Library for the Groq API
Subtitle Creation Assistant
Spring AI Alibaba examples for building and testing AI apps
Cross-platform, customizable ML solutions
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Industrial-level controllable zero-shot text-to-speech system
Qwen3-TTS is an open-source series of TTS models
The official Python SDK for the ElevenLabs API
Instantly generate AI-powered subtitles on your device
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper
A suite of advanced multi-modal LLMs