C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
ChatGLM-6B: An Open Bilingual Dialogue Language Model
High-Resolution Image Synthesis with Latent Diffusion Models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
FlashMLA: Efficient Multi-head Latent Attention Kernels
Text and image to video generation: CogVideoX and CogVideo
gpt-oss-120b and gpt-oss-20b are two open-weight language models
A latent text-to-image diffusion model
Lightweight multimodal translation model for 55 languages
Compact 8B multimodal instruct model optimized for edge deployment
Small 3B-base multimodal model ideal for custom AI on edge hardware
Efficient 14B multimodal instruct model with edge deployment and FP8
OpenAI’s open-weight 120B model optimized for reasoning and tooling
Quantized 675B multimodal instruct model optimized for NVFP4