A Pythonic framework to simplify AI service building
Pytorch domain library for recommendation systems
LLMFlows - Simple, Explicit and Transparent LLM Apps
A high-performance ML model serving framework, offers dynamic batching
Visual Instruction Tuning: Large Language-and-Vision Assistant
MII makes low-latency and high-throughput inference possible
Probabilistic reasoning and statistical analysis in TensorFlow
A library for accelerating Transformer models on NVIDIA GPUs
Run 100B+ language models at home, BitTorrent-style
A lightweight vision library for performing large object detection
The unofficial python package that returns response of Google Bard
Sparsity-aware deep learning inference runtime for CPUs
An easy-to-use LLMs quantization package with user-friendly apis
Database system for building simpler and faster AI-powered application
Lightweight Python library for adding real-time multi-object tracking
Phi-3.5 for Mac: Locally-run Vision and Language Models
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
OpenMMLab Model Deployment Framework
Gaussian processes in TensorFlow
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Libraries for applying sparsification recipes to neural networks
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Neural Network Compression Framework for enhanced OpenVINO
Efficient few-shot learning with Sentence Transformers
Openai style api for open large language models