Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Advanced language and coding AI model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A Family of Open Sourced Music Foundation Models
New family of code large language models (LLMs)
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
GPT4V-level open-source multi-modal model based on Llama3-8B
Renderer for the harmony response format to be used with gpt-oss
An experimental version of DeepSeek model
Accurate × Fast × Comprehensive
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Lets make video diffusion practical
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A trainable PyTorch reproduction of AlphaFold 3
Large Multimodal Models for Video Understanding and Editing
Ultra-Efficient LLMs on End Device
Repo for SeedVR2 & SeedVR
Multimodal Diffusion with Representation Alignment
Advancing Open-source World Models
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Block Diffusion for Ultra-Fast Speculative Decoding
Inference code for scalable emulation of protein equilibrium ensembles