The ChatGPT Retrieval Plugin lets you easily find personal documents
A series of math-specific large language models of our Qwen2 series
Inference framework for 1-bit LLMs
Qwen3-omni is a natively end-to-end, omni-modal LLM
The official PyTorch implementation of Google's Gemma models
Inference code for scalable emulation of protein equilibrium ensembles
Diversity-driven optimization and large-model reasoning ability
A state-of-the-art open visual language model
Official implementation of Watermark Anything with Localized Messages
High-resolution models for human tasks
Towards Real-World Vision-Language Understanding
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
Personalize Any Characters with a Scalable Diffusion Transformer
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Inference script for Oasis 500M
4M: Massively Multimodal Masked Modeling
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official DeiT repository
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
A Customizable Image-to-Video Model based on HunyuanVideo