ChatGLM-6B: An Open Bilingual Dialogue Language Model
Hackable and optimized Transformers building blocks
Memory-efficient and performant finetuning of Mistral's models
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
FlashMLA: Efficient Multi-head Latent Attention Kernels
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Lets make video diffusion practical
The most powerful local music generation model
High-Resolution Image Synthesis with Latent Diffusion Models
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Open-source large language model family from Tencent Hunyuan
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Flux 2 image generation model pure C inference
An experimental version of DeepSeek model
Advancing Open-source World Models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Text and image to video generation: CogVideoX and CogVideo
The official repo of Qwen chat & pretrained large language model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A Multi-Modal World Model for Reconstructing, Generating, Simulation
A trainable PyTorch reproduction of AlphaFold 3
Z80-μLM is a 2-bit quantized language model
PyTorch code and models for the DINOv2 self-supervised learning