A Powerful Native Multimodal Model for Image Generation
Hackable and optimized Transformers building blocks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A Customizable Image-to-Video Model based on HunyuanVideo
Open-source deep-learning framework
FAIR Sequence Modeling Toolkit 2
Programmatic access to the AlphaGenome model
Code for running inference with the SAM 3D Body Model 3DB
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Generating Immersive, Explorable, and Interactive 3D Worlds
The official repo of Qwen chat & pretrained large language model
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Video Object and Interaction Deletion
Block Diffusion for Ultra-Fast Speculative Decoding
LTX-Video Support for ComfyUI
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Towards Real-World Vision-Language Understanding
Foundation Models for Time Series
Controllable & emotion-expressive zero-shot TTS
An AI-powered security review GitHub Action using Claude
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Qwen3-ASR is an open-source series of ASR models
ChatGLM-6B: An Open Bilingual Dialogue Language Model
HY-Motion model for 3D character animation generation
Sharp Monocular Metric Depth in Less Than a Second