Tensor search for humans
Inference code for CodeLlama models
MiniMax M2.1, a SOTA model for real-world dev & agents.
The Multi-Agent Framework
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Qwen2.5-VL is the multimodal large language model series
User toolkit for analyzing and interfacing with Large Language Models
Open-source end-to-end LLM Development Platform
Build AI-powered applications with React, Svelte, Vue, and Solid
MobileLLM Optimizing Sub-billion Parameter Language Models
OpenDAN is an open source Personal AI OS
ChatGLM2-6B: An Open Bilingual Chat LLM
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A RWKV management and startup tool, full automation, only 8MB
Seamlessly integrate LLMs into scikit-learn
Train a 26M-parameter GPT from scratch in just 2h
A modular graph-based Retrieval-Augmented Generation (RAG) system
A frontier, first-principles handbook
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Text generator is a handy plugin for Obsidian
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Code for the paper "Evaluating Large Language Models Trained on Code"
Revolutionizing Database Interactions with Private LLM Technology
Dramatron uses large language models to generate coherent scripts