Structured outputs for llms
Python bindings for llama.cpp
Run Local LLMs on Any Device. Open-source
Agentic, Reasoning, and Coding (ARC) foundation models
A high-throughput and memory-efficient inference and serving engine
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen3 is the large language model series developed by Qwen team
Access large language models from the command-line
Operating LLMs in production
lightweight package to simplify LLM API calls
Inference code for CodeLlama models
Powerful AI language model (MoE) optimized for efficiency/performance
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Framework that is dedicated to making neural data processing
A guidance language for controlling large language models
PandasAI is a Python library that integrates generative AI
Open-source, high-performance AI model with advanced reasoning
Database system for building simpler and faster AI-powered application
The Multi-Agent Framework
Qwen3-Coder is the code version of Qwen3
A modular graph-based Retrieval-Augmented Generation (RAG) system
Interact with your documents using the power of GPT
Ongoing research training transformer models at scale
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)