Structured outputs for llms
Python bindings for llama.cpp
Run Local LLMs on Any Device. Open-source
Agentic, Reasoning, and Coding (ARC) foundation models
A high-throughput and memory-efficient inference and serving engine
PandasAI is a Python library that integrates generative AI
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen3 is the large language model series developed by Qwen team
Interact with your documents using the power of GPT
Diversity-driven optimization and large-model reasoning ability
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The unofficial python package that returns response of Google Bard
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
lightweight package to simplify LLM API calls
Access large language models from the command-line
A guidance language for controlling large language models
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Simple, Pythonic building blocks to evaluate LLM applications
The official repo of Qwen chat & pretrained large language model
Qwen3-Coder is the code version of Qwen3
State-of-the-art Parameter-Efficient Fine-Tuning
Inference code for CodeLlama models
Ongoing research training transformer models at scale
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)