PyTorch library of curated Transformer models and their components
950 line, minimal, extensible LLM inference engine built from scratch
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Deploy your agentic worfklows to production
Collect, organize, use, and share, all in OmniBox
A modular Agentic RAG built with LangGraph
Replace OpenAI GPT with another LLM in your app
Framework to easily create LLM powered bots over any dataset
GLM-4 series: Open Multilingual Multimodal Chat LMs
Concatenate a directory full of files into a single prompt
A lightweight framework for building LLM-based agents
Utilities intended for use with Llama models
An agentless approach to automatically solve software development
A new open-source framework to build and deploy intelligent agents
How to optimize some algorithm in cuda
The SOTA Open-Source Browser Agent
Collection of awesome LLM apps with AI Agents and RAG using OpenAI
Designed for text embedding and ranking tasks
Your Personal Research Multi-Tool
Tutorial tailored for Chinese babies on rapid fine-tuning
LLM training in simple, raw C/CUDA
LLM powered fuzzing via OSS-Fuzz
Repo of Qwen2-Audio chat & pretrained large audio language model
Bringing BERT into modernity via both architecture changes and scaling