Structured outputs for llms
Python bindings for llama.cpp
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Run Local LLMs on Any Device. Open-source
A state-of-the-art open visual language model
Agentic, Reasoning, and Coding (ARC) foundation models
A high-throughput and memory-efficient inference and serving engine
Repo of Qwen2-Audio chat & pretrained large audio language model
Qwen3 is the large language model series developed by Qwen team
Access large language models from the command-line
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen3-omni is a natively end-to-end, omni-modal LLM
A guidance language for controlling large language models
Operating LLMs in production
PandasAI is a Python library that integrates generative AI
lightweight package to simplify LLM API calls
Code for Language models can explain neurons in language models paper
Inference code for CodeLlama models
Simple, Pythonic building blocks to evaluate LLM applications
The unofficial python package that returns response of Google Bard
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A modular graph-based Retrieval-Augmented Generation (RAG) system
Interact with your documents using the power of GPT
Powerful AI language model (MoE) optimized for efficiency/performance
LLM abstractions that aren't obstructions