Interface for OuteTTS models
Foundational Models for State-of-the-Art Speech and Text Translation
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
LLM Large Model of Selling Anchor
Inference code for CodeLlama models
A simple native web interface that uses ChatTTS to synthesize text
Large Audio Language Model built for natural interactions
Repo of Qwen2-Audio chat & pretrained large audio language model
Multi-modal large language model designed for audio understanding
In-App assistant SDK to build a multimodal conversational UX websites
From Images to High-Fidelity 3D Assets
Gp.nvim (GPT prompt) Neovim AI plugin
Framework for building neural networks
NLP Cloud serves high performance pre-trained or custom models for NER
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Offline inference engine for art, real-time voice conversations
Natural speech programming assistant for the software developers
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Build voice-based LLM agents. Modular + open source
High-quality multi-lingual text-to-speech library by MyShell.ai
LLM-based Reinforcement Learning audio edit model
Empowering Code Generation with OSS-Instruct
Code for the paper "Evaluating Large Language Models Trained on Code"