Low-code app builder for RAG and multi-agent AI applications
Research code artifacts for Code World Model (CWM)
Port of Facebook's LLaMA model in C/C++
Instant, controllable, local pre-trained AI models in Rust
Open-source, high-performance AI model with advanced reasoning
Kimi K2 is the large language model series developed by Moonshot AI
Qwen3-Coder is the code version of Qwen3
LLM inference in C/C++
From Vibe Coding to Agentic Engineering
ChatWiki WeChat official account's AI knowledge base workflow agent
Powerful AI language model (MoE) optimized for efficiency/performance
Fully automatic censorship removal for language models
Large-language-model & vision-language-model based on Linear Attention
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A high-performance ML model serving framework, offers dynamic batching
Utilities intended for use with Llama models
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Open source LLM engineering platform: LLM Observability, metrics, etc.
Parallax is a distributed model serving framework
Framework and no-code GUI for fine-tuning LLMs
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
The official Meta Llama 3 GitHub site
Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.
Uncertainty Quantification for Language Models, is a Python package
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference