Language modeling in a sentence representation space
Code for running inference and finetuning with SAM 3 model
Towards Real-World Vision-Language Understanding
Advanced language and coding AI model
Z80-μLM is a 2-bit quantized language model
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Official inference repo for FLUX.2 models
Large-language-model & vision-language-model based on Linear Attention
Unified Multimodal Understanding and Generation Models
LLM-based Reinforcement Learning audio edit model
Qwen3-omni is a natively end-to-end, omni-modal LLM
High-resolution models for human tasks
This repository contains the official implementation of FastVLM
A PyTorch library for implementing flow matching algorithms
Open-source framework for intelligent speech interaction
Video understanding codebase from FAIR for reproducing video models
Real-time behaviour synthesis with MuJoCo, using Predictive Control
4M: Massively Multimodal Masked Modeling
Renderer for the harmony response format to be used with gpt-oss
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Chinese LLaMA & Alpaca large language model + local CPU/GPU training