ChatGLM2-6B: An Open Bilingual Chat LLM
3D reconstruction software
Supercharge Your LLM with the Fastest KV Cache Layer
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
A Customizable Image-to-Video Model based on HunyuanVideo
Gemma open-weight LLM library, from Google DeepMind
text and image to video generation: CogVideoX (2024) and CogVideo
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Running large language models on a single GPU
SGLang is a fast serving framework for large language models
A unified framework for scalable computing
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Multilingual Automatic Speech Recognition with word-level timestamps
Our first fully AI generated deep learning system
Python package built to ease deep learning on graph
Interface for OuteTTS models
Open platform for training, serving, and evaluating language models
AI Suite for upscaling, interpolating & restoring images/videos
High quality, fast, modular reference implementation of SSD in PyTorch
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
A computer vision framework to create and deploy apps in minutes
Fast Python collaborative filtering for implicit feedback datasets
Lightweight anchor-free object detection model
Facebook AI Research Sequence-to-Sequence Toolkit written in Python