A Powerful Native Multimodal Model for Image Generation
State-of-the-art TTS model under 25MB
Multimodal Diffusion with Representation Alignment
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Powerful AI language model (MoE) optimized for efficiency/performance
Qwen3 is the large language model series developed by Qwen team
An experimental version of DeepSeek model
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
A Unified Framework for Text-to-3D and Image-to-3D Generation
Reference PyTorch implementation and models for DINOv3
AlphaFold 3 inference pipeline
Foundation Models for Time Series
tiktoken is a fast BPE tokeniser for use with OpenAI's models
GLM-4-Voice | End-to-End Chinese-English Conversational Model
FAIR Sequence Modeling Toolkit 2
Inference code for scalable emulation of protein equilibrium ensembles
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Pushing the Limits of Mathematical Reasoning in Open Language Models
Unified Multimodal Understanding and Generation Models
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Implementation of the Surya Foundation Model for Heliophysics
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Lets make video diffusion practical