VITS2 backbone with multilingual-bert
A fast TTS architecture with conditional flow matching
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Implementation of Video Diffusion Models
RPG Maker 2000/2003 and EasyRPG games interpreter
CLI tool to extract (meta)data from PDF and manipulate PDF files
Get free HTTPS certificates forever from Let's Encrypt
Scalable data pre processing and curation toolkit for LLMs
D2 is a modern diagram scripting language that turns text to diagrams
Industrial-strength Natural Language Processing (NLP)
A very simple framework for state-of-the-art NLP
Tools for manipulating datasets
Easy-to-use and powerful NLP library with Awesome model zoo
StreamSpeech is a seamless model for offline speech recognition
Designed for text embedding and ranking tasks
Open source personal AI Assistant for Linux, Windows and Mac
Python 3.7 to JavaScript compiler
Obsei is a low code AI powered automation tool
A full spaCy pipeline and models for scientific/biomedical documents
Transforming Multimodal Content into Captivating Multilingual Audio
An Open Source text-to-speech system built by inverting Whisper
Extract one time password (OTP) secrets from QR codes
Evaluate and monitor ML models from validation to production
Towards Human-Sounding Speech
An easy-to-use backup tool for GNU Linux using rsync in the back