Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
E2B Desktop Sandbox for LLMs. E2B Sandbox
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Access large language models from the command-line
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Open-source observability for your LLM application
The official Meta Llama 3 GitHub site
A modular graph-based Retrieval-Augmented Generation (RAG) system
Multilingual sentence & image embeddings with BERT
A python module to repair invalid JSON from LLMs
AirLLM 70B inference with single 4GB GPU
Language-model investigation agent with a terminal UI
Open-Source Financial Large Language Models
Qwen3-Coder is the code version of Qwen3
A guidance language for controlling large language models
Inference code for CodeLlama models
State-of-the-art Parameter-Efficient Fine-Tuning
The Multi-Agent Framework
ChatGLM2-6B: An Open Bilingual Chat LLM
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
Collection of awesome LLM apps with AI Agents and RAG using OpenAI
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Operating LLMs in production
Official Repo for ICML 2024 paper
Build a large language model from 0 only with Python foundation