C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Python bindings for llama.cpp
Provides convenient access to the Anthropic REST API from any Python 3
Example Discord bot written in Python that uses the completions API
Python example app from the OpenAI API quickstart tutorial
Python SDK for Claude Agent
Port of Facebook's LLaMA model in C/C++
Revolutionizing Database Interactions with Private LLM Technology
Agentic, Reasoning, and Coding (ARC) foundation models
Contexts Optical Compression
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Phi-3.5 for Mac: Locally-run Vision and Language Models
State-of-the-art TTS model under 25MB
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Qwen3 is the large language model series developed by Qwen team
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Powerful AI language model (MoE) optimized for efficiency/performance
Code for running inference and finetuning with SAM 3 model
Code for running inference with the SAM 3D Body Model 3DB
From Images to High-Fidelity 3D Assets
Official inference repo for FLUX.2 models
Wan2.2: Open and Advanced Large-Scale Video Generative Model
RGBD video generation model conditioned on camera input
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model