Component library and custom registry built on top of shadcn/ui
Open Source Speech Language Model
A general fine-tuning kit geared toward image/video/audio diffusion
A high-quality rapid TTS voice cloning model
Generate audiobooks from e-books
Web presentation editor replicating many PowerPoint features online
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Qwen3-ASR is an open-source series of ASR models
The official Python library for the OpenAI API
Code and models for ICML 2024 paper, NExT-GPT
Industrial-level controllable zero-shot text-to-speech system
Subtitle Creation Assistant
Build Vision Agents quickly with any model or video provider
Instantly generate AI-powered subtitles on your device
Python library and CLI tool to interface with Google Translate
Cross-platform, customizable ML solutions
State-of-the-art TTS model under 25MB
Speech Note Linux app. Note taking, reading and translating
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper
A python tool that uses GPT-4, FFmpeg, and OpenCV
The official Python Library for the Groq API
An Open Source text-to-speech system built by inverting Whisper
Official repository for LTX-Video
State-of-the-art diffusion models for image and audio generation
The official Node.js / Typescript library for the Groq API