Build Vision Agents quickly with any model or video provider
A Conversational Speech Generation Model
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Official Python inference and LoRA trainer package
Pre-trained Deep Learning models and demos
A very simple framework for state-of-the-art NLP
Mice speech to text with MX Cinnamon OS ISO
Open source personal AI Assistant for Linux, Windows and Mac
Models for the spaCy Natural Language Processing (NLP) library
Synchronized Translation for Videos
Transforming Multimodal Content into Captivating Multilingual Audio
Stanford NLP Python library for many human languages
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Low-latency AI inference engine optimized for mobile devices
AI framework for automated short video creation and editing tools
Unlimited, private and free Speech-To-Text program
A python tool that uses GPT-4, FFmpeg, and OpenCV
Aligns tokens in two versions of a text with differing tokenization.
SoundTranscriber can be used to generate automatic transcription / aut
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
High-quality multi-lingual text-to-speech library by MyShell.ai
mice stt tts
Run GGUF models easily with a UI or API. One File. Zero Install.
Towards Human-Level Text-to-Speech through Style Diffusion
Two Integrated Text To Speech Engines uses MMS & Silero