The AI-native (edge and LLM) proxy for agents
lightweight, standalone C++ inference engine for Google's Gemma models
Multilingual Automatic Speech Recognition with word-level timestamps
Database system for building simpler and faster AI-powered application
Lightweight Python library for adding real-time multi-object tracking
AIMET is a library that provides advanced quantization and compression
High quality, fast, modular reference implementation of SSD in PyTorch
Create HTML profiling reports from pandas DataFrame objects
Library for serving Transformers models on Amazon SageMaker
LLMFlows - Simple, Explicit and Transparent LLM Apps
Superduper: Integrate AI models and machine learning workflows
LLMs and Machine Learning done easily
Libraries for applying sparsification recipes to neural networks
An easy-to-use LLMs quantization package with user-friendly apis
Gaussian processes in TensorFlow
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Visual Instruction Tuning: Large Language-and-Vision Assistant
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Framework which allows you transform your Vector Database
Openai style api for open large language models
A Unified Library for Parameter-Efficient Learning
Images to inference with no labeling
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
PyTorch library of curated Transformer models and their components