ChatGLM-6B: An Open Bilingual Dialogue Language Model
Hackable and optimized Transformers building blocks
Memory-efficient and performant finetuning of Mistral's models
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Lets make video diffusion practical
The most powerful local music generation model
High-Resolution Image Synthesis with Latent Diffusion Models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Open-source large language model family from Tencent Hunyuan
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
An experimental version of DeepSeek model
Advancing Open-source World Models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Text and image to video generation: CogVideoX and CogVideo
The official repo of Qwen chat & pretrained large language model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A Multi-Modal World Model for Reconstructing, Generating, Simulation
A trainable PyTorch reproduction of AlphaFold 3
Z80-μLM is a 2-bit quantized language model
PyTorch code and models for the DINOv2 self-supervised learning
ChatGPT interface with better UI
GPT4V-level open-source multi-modal model based on Llama3-8B
Diversity-driven optimization and large-model reasoning ability