Generate Any 3D Scene in Seconds
Build GenAI application quick and easy
Advanced techniques for RAG systems
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Implementation of Vision Transformer, a simple way to achieve SOTA
The best ChatGPT that $100 can buy
A Model Context Protocol server for searching and analyzing arXiv
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
Set of tools to assess and improve LLM security
ICLR2024 Spotlight: curation/training code, metadata, distribution
PyTorch code and models for V-JEPA self-supervised learning from video
A PyTorch library for implementing flow matching algorithms
An implementation of a deep learning recommendation model (DLRM)
ImageBind One Embedding Space to Bind Them All
Hackable and optimized Transformers building blocks
[CVPR 2025 Best Paper Award] VGGT
Code to accompany "A Method for Animating Children's Drawings"
Anthropic's educational courses
Memory-efficient and performant finetuning of Mistral's models
Official implementation of DreamCraft3D
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
Diffusion Transformer with Fine-Grained Chinese Understanding