Port of Facebook's LLaMA model in C/C++
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Python bindings for llama.cpp
MiniMax M2.1, a SOTA model for real-world dev & agents.
Image generation model with single-stream diffusion transformer
Qwen3-Coder is the code version of Qwen3
State of the art LLM and coding model
Qwen3.6 is the large language model series developed by Qwen team
Extension index for stable-diffusion-webui
FlashMLA: Efficient Multi-head Latent Attention Kernels
Scaling Reinforcement Learning with LLMs
Ling is a MoE LLM provided and open-sourced by InclusionAI
Production-tested AI infrastructure tools
Collection of Gemma 3 variants that are trained for performance
Open-source deep-learning framework
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Release for Improved Denoising Diffusion Probabilistic Models
Open-source, high-performance Mixture-of-Experts large language model
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
DeepSeek LLM: Let there be answers
Dataset of GPT-2 outputs for research in detection, biases, and more
ChatGPT integration with Unity Editor
An implementation of model parallel GPT-2 and GPT-3-style models
Code for reproducing key results in the paper
Model that fuses instruct, reasoning and agentic skills