A sound cloning tool with a web interface, using your voice
A fast TTS architecture with conditional flow matching
Label Studio is a multi-type data labeling and annotation tool
text and image to video generation: CogVideoX (2024) and CogVideo
Implementation of "MobileCLIP" CVPR 2024
SoTA open-source TTS
Offline inference engine for art, real-time voice conversations
21 Lessons, Get Started Building with Generative AI
Open-source choice to scale, assess and maintain natural language data
⚡ Building applications with LLMs through composability ⚡
The best ChatGPT that $100 can buy
Text to Speech Utility
A graphical manager for ollama that can manage your LLMs
A webui for different audio related Neural Networks
Python package for easily interfacing with chat apps
Local image generation using VQGAN-CLIP or CLIP guided diffusion