An implementation of a deep learning recommendation model (DLRM)
Official DeiT repository
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Reference PyTorch implementation and models for DINOv3
Volcano Engine Reinforcement Learning for LLMs
Open-source, code-first Python toolkit for building, evaluating, etc.
Python scraper based on AI
Flexible Photo Recrafting While Preserving Your Identity
A SOTA open-source image editing model
Reading book source
Scalable generative AI framework built for researchers and developers
Semantic search and workflows for medical/scientific papers
Real-time voice interactive digital human
The largest collection of PyTorch image encoders / backbones
Industrial-level controllable zero-shot text-to-speech system
Collection of reference environments, offline reinforcement learning
PPTAgent: Generating and Evaluating Presentations
SOTA Open Source TTS
Multi-lingual large voice generation model, providing inference
ContextGem: Effortless LLM extraction from documents
We write your reusable computer vision tools
Interpretable prompting and models for NLP
Fast and Universal 3D reconstruction model for versatile tasks
LLM training in simple, raw C/CUDA
Image processing in Python