Structured outputs for llms
Python bindings for llama.cpp
Build AI WhatsApp Bots with Pure Python
Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.
Run Local LLMs on Any Device. Open-source
A Simple and Universal Swarm Intelligence Engine
Port of Facebook's LLaMA model in C/C++
Agentic, Reasoning, and Coding (ARC) foundation models
Interact with your documents using the power of GPT
Fully automatic censorship removal for language models
Low-code app builder for RAG and multi-agent AI applications
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
AirLLM 70B inference with single 4GB GPU
The official Meta Llama 3 GitHub site
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
lightweight package to simplify LLM API calls
Universal LLM Deployment Engine with ML Compilation
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Build a large language model from 0 only with Python foundation
Chat with your documents using local AI
All-in-one WebUI for AI generative image and video creation
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Advanced language and coding AI model