Chinese and English multimodal conversational language model
Open image model at the forefront of design
26m function call model that runs on incredibly small devices
Fast-stable-diffusion + DreamBooth
Multimodal embedding and reranking models built on Qwen3-VL
Official implementation of Watermark Anything with Localized Messages
High-resolution models for human tasks
Tool for exploring and debugging transformer model behaviors
CLIP, Predict the most relevant text snippet given an image
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Project Lyra: Open Generative 3D World Models
Pretrained time-series foundation model developed by Google Research
Easy Docker setup for Stable Diffusion with user-friendly UI
Inference script for Oasis 500M
Fast and Universal 3D reconstruction model for versatile tasks
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
Foundation Models for Time Series
FAIR Sequence Modeling Toolkit 2
A Production-ready Reinforcement Learning AI Agent Library
A PyTorch library for implementing flow matching algorithms
Hackable and optimized Transformers building blocks
Memory-efficient and performant finetuning of Mistral's models
Official implementation of DreamCraft3D
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI