Tooling for the Common Objects In 3D dataset
Block Diffusion for Ultra-Fast Speculative Decoding
Open-weight, large-scale hybrid-attention reasoning model
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Phi-3.5 for Mac: Locally-run Vision and Language Models
Repo for SeedVR2 & SeedVR
Large-language-model & vision-language-model based on Linear Attention
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
CLIP, Predict the most relevant text snippet given an image
Real-time behaviour synthesis with MuJoCo, using Predictive Control
FAIR Sequence Modeling Toolkit 2
A Production-ready Reinforcement Learning AI Agent Library
Official DeiT repository
Diffusion Transformer with Fine-Grained Chinese Understanding
Open-source large language model family from Tencent Hunyuan
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Example Discord bot written in Python that uses the completions API
Dataset of GPT-2 outputs for research in detection, biases, and more
A Conversational Speech Generation Model
Open Multilingual Multimodal Chat LMs
800,000 step-level correctness labels on LLM solutions to MATH problem
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Learning to Act by Watching Unlabeled Online Videos