Trained models & code to predict toxic comments
Framework for building real-time voice and multimodal AI agents
LLM Large Model of Selling Anchor
A sound cloning tool with a web interface, using your voice
Open source AI VTuber platform with voice chat and Live2D avatars
Controllable and fast Text-to-Speech for over 7000 languages
Python Audio Analysis Library: Feature Extraction, Classification
Industrial-strength Natural Language Processing (NLP)
AI-powered video clipping and highlight generation
Large Audio Language Model built for natural interactions
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Multi-modal large language model designed for audio understanding
Data manipulation and transformation for audio signal processing
Bring the notion of Model-as-a-Service to life
A very simple framework for state-of-the-art NLP
Low-latency AI inference engine optimized for mobile devices
Build Vision Agents quickly with any model or video provider
Get your documents ready for gen AI
Models for the spaCy Natural Language Processing (NLP) library
Stanford NLP Python library for many human languages
A Conversational Speech Generation Model
A lightning fast audio upsampler
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Software that uses AI to perform real-time voice conversion
Automatically generate and overlay subtitles for any video