Voice Recognition to Text Tool
3D reconstruction software
2D and 3D Face alignment library build using pytorch
The official repo of Qwen chat & pretrained large language model
Unified web UI for training and running open models locally
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
A nearly-live implementation of OpenAI's Whisper
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
Generate audiobooks from e-books
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
LightLLM is a Python-based LLM (Large Language Model) inference
An implementation of a deep learning recommendation model (DLRM)
Self-supervised visual learning using momentum contrast in PyTorch
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Reference PyTorch implementation and models for DINOv3
Recipes to train reward model for RLHF
Hackable and optimized Transformers building blocks
Minimal Python framework for scalable AI inference servers fast
Multilingual Document Layout Parsing in a Single Vision-Language Model
A high-performance ML model serving framework, offers dynamic batching
RGBD video generation model conditioned on camera input
The official repository for ERNIE 4.5 and ERNIEKit
Capable of understanding text, audio, vision, video
High-Resolution Image Synthesis with Latent Diffusion Models
A simple, performant and scalable Jax LLM