A cross-platform, GPU-accelerated terminal emulator
Fast and memory-efficient exact attention
Voice Recognition to Text Tool
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Enables the best performance on NVIDIA RTX Graphics Cards
RAPIDS Machine Learning Library
DeepEP: an efficient expert-parallel communication library
Meridian is an MMM framework
A high-quality rapid TTS voice cloning model
High-performance CPU, GPU, and memory profiler for Python
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Fast, flexible and easy to use probabilistic modelling in Python
Sharp Monocular Metric Depth in Less Than a Second
A nearly-live implementation of OpenAI's Whisper
Fast inference engine for Transformer models
Making large AI models cheaper, faster and more accessible
CodeGeeX2: A More Powerful Multilingual Code Generation Model
GPU-accelerated GUI development for Node.js and the browser
ChatGLM2-6B: An Open Bilingual Chat LLM
Software that uses AI to perform real-time voice conversion
The GPU-powered AI application database
Effortless data labeling with AI support from Segment Anything
Unified web UI for training and running open models locally
High-performance neural network inference framework for mobile
A generic, simple and fast implementation of Deepmind's AlphaZero