Fast and Universal 3D reconstruction model for versatile tasks
Qwen3 is the large language model series developed by Qwen team
YOLOv5 is the world's most loved vision AI
Industrial-level controllable zero-shot text-to-speech system
The Memory layer for AI Agents
Reference PyTorch implementation and models for DINOv3
CodeGeeX2: A More Powerful Multilingual Code Generation Model
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
A comprehensive set of fairness metrics for datasets
Wan2.2: Open and Advanced Large-Scale Video Generative Model
GPT4V-level open-source multi-modal model based on Llama3-8B
Large Multimodal Models for Video Understanding and Editing
FAIR Sequence Modeling Toolkit 2
CodeGeeX4-ALL-9B, a versatile model for all AI software development
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Offline Text To Speech synthesis for python
Code for running inference and finetuning with SAM 3 model
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Official inference repo for FLUX.2 models
MARS5 speech model (TTS) from CAMB.AI
PyTorch code and models for the DINOv2 self-supervised learning
Python inference and LoRA trainer package for the LTX-2 audio–video
Advanced language and coding AI model