A Unified Library for Parameter-Efficient Learning
OpenMMLab Model Deployment Framework
Pure C++ implementation of several models for real-time chatting
Open-Source AI Camera. Empower any camera/CCTV
Tensor search for humans
Adversarial Robustness Toolbox (ART) - Python Library for ML security
The Triton Inference Server provides an optimized cloud
LLMs as Copilots for Theorem Proving in Lean
Multilingual Automatic Speech Recognition with word-level timestamps
A high-performance ML model serving framework, offers dynamic batching
Unified Model Serving Framework
Bring the notion of Model-as-a-Service to life
An innovative library for efficient LLM inference
The unofficial python package that returns response of Google Bard
Trainable models and NN optimization tools
Probabilistic reasoning and statistical analysis in TensorFlow
A GPU-accelerated library containing highly optimized building blocks
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Serve, optimize and scale PyTorch models in production
Framework which allows you transform your Vector Database
OpenAI swift async text to image for SwiftUI app using OpenAI
A set of Docker images for training and serving models in TensorFlow
The AI-native (edge and LLM) proxy for agents
lightweight, standalone C++ inference engine for Google's Gemma models
Bolt is a deep learning library with high performance