A Model Context Protocol (MCP) server
Sparsity-aware deep learning inference runtime for CPUs
Standalone, small, language-neutral
Diversity-driven optimization and large-model reasoning ability
Scalable data pre processing and curation toolkit for LLMs
User toolkit for analyzing and interfacing with Large Language Models
Operating LLMs in production
Phi-3.5 for Mac: Locally-run Vision and Language Models
Building applications with LLMs through composability
A Python Automated Machine Learning tool that optimizes ML
Documentation for Google's Gen AI site - including Gemini API & Gemma
Easy-to-use and high-performance NLP and LLM framework
The no-nonsense RAG chunking library
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Interact with your documents using the power of GPT
lightweight package to simplify LLM API calls
Open-source, high-performance AI model with advanced reasoning
Semantic search and workflows for medical/scientific papers
State-of-the-art Parameter-Efficient Fine-Tuning
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Agents write python code to call tools and orchestrate other agents
Chinese and English multimodal conversational language model
Train a 26M-parameter GPT from scratch in just 2h
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Offline speech recognition API for Android, iOS, Raspberry Pi