Sparsity-aware deep learning inference runtime for CPUs
Standalone, small, language-neutral
Diversity-driven optimization and large-model reasoning ability
Scalable data pre processing and curation toolkit for LLMs
User toolkit for analyzing and interfacing with Large Language Models
Phi-3.5 for Mac: Locally-run Vision and Language Models
Operating LLMs in production
CodeGeeX4-ALL-9B, a versatile model for all AI software development
A natural language interface for computers
Easy-to-use and high-performance NLP and LLM framework
The no-nonsense RAG chunking library
DeepSeek Coder: Let the Code Write Itself
lightweight package to simplify LLM API calls
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Stanford NLP Python library for many human languages
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Interact with your documents using the power of GPT
State-of-the-art Parameter-Efficient Fine-Tuning
Open-source, high-performance AI model with advanced reasoning
Chinese and English multimodal conversational language model
SGLang is a fast serving framework for large language models
Data and tools for generating and inspecting OLMo pre-training data
Inference code for CodeLlama models
Model Context Protocol tool support for LangChain
Simple, Pythonic building blocks to evaluate LLM applications