ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Lightning-fast, on-device TTS, running natively via ONNX
Speech-AI-Forge is a project developed around TTS generation model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Chat & pretrained large vision language model
A Model Context Protocol (MCP) server
A community-supported supercharged version of paperless
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
VITS2 backbone with multilingual-bert
A fast TTS architecture with conditional flow matching
21 Lessons, Get Started Building with Generative AI
tiktoken is a fast BPE tokeniser for use with OpenAI's models
text and image to video generation: CogVideoX (2024) and CogVideo
Open source personal AI Assistant for Linux, Windows and Mac
A very simple framework for state-of-the-art NLP
lightweight package to simplify LLM API calls
Easy-to-use and powerful NLP library with Awesome model zoo
StreamSpeech is a seamless model for offline speech recognition
Industrial-level controllable zero-shot text-to-speech system
Industrial-strength Natural Language Processing (NLP)
Obsei is a low code AI powered automation tool
A full spaCy pipeline and models for scientific/biomedical documents
An Open Source text-to-speech system built by inverting Whisper
Evaluate and monitor ML models from validation to production
Towards Human-Sounding Speech