LLM
Scalable data pre processing and curation toolkit for LLMs
Chat & pretrained large audio language model proposed by Alibaba Cloud
Generate blog articles from video or audio
Implementation of Imagen, Google's Text-to-Image Neural Network
Lightning-fast, on-device TTS, running natively via ONNX
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
VITS2 backbone with multilingual-bert
A fast TTS architecture with conditional flow matching
SoTA open-source TTS
Implementation of Video Diffusion Models
A very simple framework for state-of-the-art NLP
Industrial-strength Natural Language Processing (NLP)
Easy-to-use and powerful NLP library with Awesome model zoo
StreamSpeech is a seamless model for offline speech recognition
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Obsei is a low code AI powered automation tool
A full spaCy pipeline and models for scientific/biomedical documents
Evaluate and monitor ML models from validation to production
An Open Source text-to-speech system built by inverting Whisper
Towards Human-Sounding Speech
Open source personal AI Assistant for Linux, Windows and Mac
Open Source Document Management System for Digital Archives