Powerful AI language model (MoE) optimized for efficiency/performance
Reference PyTorch implementation and models for DINOv3
PyTorch implementation of JiT
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
PyTorch code and models for the DINOv2 self-supervised learning
Open-source, high-performance AI model with advanced reasoning
Implementation of "MobileCLIP" CVPR 2024
From Images to High-Fidelity 3D Assets
Multimodal model achieving SOTA performance
4M: Massively Multimodal Masked Modeling
Memory-efficient and performant finetuning of Mistral's models
A Customizable Image-to-Video Model based on HunyuanVideo
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Official Python inference and LoRA trainer package
An experimental version of DeepSeek model
Language modeling in a sentence representation space
From Vibe Coding to Agentic Engineering
High-resolution models for human tasks
A Systematic Framework for Interactive World Modeling
RGBD video generation model conditioned on camera input
Kimi K2 is the large language model series developed by Moonshot AI
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Video understanding codebase from FAIR for reproducing video models
CLIP, Predict the most relevant text snippet given an image
A Family of Open Sourced Music Foundation Models