Sempare Template (scripting) Engine for Delphi
SOTA Open Source TTS
Spark-TTS Inference Code
Code for openai.fm, a demo for the OpenAI Speech API
PersonaPlex code
An Open Source text-to-speech system built by inverting Whisper
Long-form streaming TTS system for multi-speaker dialogue generation
A simple, high-quality voice conversion tool focused on ease of use
Offline Text To Speech synthesis for python
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Gp.nvim (GPT prompt) Neovim AI plugin
Multi-lingual large voice generation model, providing inference
Offline inference engine for art, real-time voice conversations
NLP Cloud serves high performance pre-trained or custom models for NER
Official PyTorch Implementation
Repo of Qwen2-Audio chat & pretrained large audio language model
An opinionated CLI to transcribe Audio files w/ Whisper on-device
A TTS model capable of generating ultra-realistic dialogue
Framework for building neural networks
Instant voice cloning by MIT and MyShell. Audio foundation model
LLM-based Reinforcement Learning audio edit model
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Provides CTP stock options and Zhongtai Securities XTP
Pre-trained Deep Learning models and demos
NLP Cloud serves high performance pre-trained or custom models for NER