Collection of Gemma 3 variants that are trained for performance
Implementation of "MobileCLIP" CVPR 2024
Official implementation of Watermark Anything with Localized Messages
High-resolution models for human tasks
Video understanding codebase from FAIR for reproducing video models
Tool for exploring and debugging transformer model behaviors
Personalize Any Characters with a Scalable Diffusion Transformer
Genome modeling and design across all domains of life
Achieving 3+ generation speedup on reasoning tasks
Ultra-Efficient LLMs on End Device
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Open-source deep-learning framework
Generate Any 3D Scene in Seconds
Fast and Universal 3D reconstruction model for versatile tasks
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
Foundation Models for Time Series
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution
A Production-ready Reinforcement Learning AI Agent Library
Hackable and optimized Transformers building blocks
GLM-4-Voice | End-to-End Chinese-English Conversational Model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Memory-efficient and performant finetuning of Mistral's models
Official implementation of DreamCraft3D