Open source AI VTuber platform with voice chat and Live2D avatars
Geometric deep learning extension library for PyTorch
Text and image to video generation: CogVideoX and CogVideo
A set of Docker images for training and serving models in TensorFlow
Tensor Learning in Python
Multilingual Automatic Speech Recognition with word-level timestamps
Library for OCR-related tasks powered by Deep Learning
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Generate audiobooks from e-books
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Openai style api for open large language models
Sparsity-aware deep learning inference runtime for CPUs
Official inference framework for 1-bit LLMs
SGLang is a fast serving framework for large language models
Towards Human-Sounding Speech
2D and 3D Face alignment library build using pytorch
The largest collection of PyTorch image encoders / backbones
The Triton Inference Server provides an optimized cloud
A simple native web interface that uses ChatTTS to synthesize text
Standardized Serverless ML Inference Platform on Kubernetes
Simplifies the local serving of AI models from any source
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A lightweight text-to-speech model with zero-shot voice cloning
Gemma open-weight LLM library, from Google DeepMind
Fast State-of-the-Art Static Embeddings