Structured outputs for llms
Python bindings for llama.cpp
User toolkit for analyzing and interfacing with Large Language Models
Run Local LLMs on Any Device. Open-source
Toolkit for conversational AI
Chinese and English multimodal conversational language model
Agentic, Reasoning, and Coding (ARC) foundation models
Learn AI and LLMs from scratch using free resources
Port of Facebook's LLaMA model in C/C++
Low-code app builder for RAG and multi-agent AI applications
Code for Language models can explain neurons in language models paper
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A high-throughput and memory-efficient inference and serving engine
Qwen3 is the large language model series developed by Qwen team
Diversity-driven optimization and large-model reasoning ability
Interact with your documents using the power of GPT
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
lightweight package to simplify LLM API calls
Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
PandasAI is a Python library that integrates generative AI
The official repo of Qwen chat & pretrained large language model
State-of-the-art Parameter-Efficient Fine-Tuning
Inference code for CodeLlama models