GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Jupyter notebook tutorials for OpenVINO
Gaussian processes in TensorFlow
Low-latency AI inference engine optimized for mobile devices
State-of-the-art diffusion models for image and audio generation
Ling is a MoE LLM provided and open-sourced by InclusionAI
Powering Amazon custom machine learning chips
Operating LLMs in production
Accessible large language models via k-bit quantization for PyTorch
A lightweight vLLM implementation built from scratch
A Pythonic framework to simplify AI service building
Official inference repo for FLUX.1 models
Accelerate local LLM inference and finetuning
Integrate, train and manage any AI models and APIs with your database
Phi-3.5 for Mac: Locally-run Vision and Language Models
PyTorch library of curated Transformer models and their components
Run Local LLMs on Any Device. Open-source
Qwen3 is the large language model series developed by Qwen team
Official Python inference and LoRA trainer package
Simplifies the local serving of AI models from any source
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Data manipulation and transformation for audio signal processing
Multi-lingual large voice generation model, providing inference
Multilingual Automatic Speech Recognition with word-level timestamps
Pruna is a model optimization framework built for developers