Run Local LLMs on Any Device. Open-source
Ready-to-use OCR with 80+ supported languages
A high-throughput and memory-efficient inference and serving engine
Everything you need to build state-of-the-art foundation models
Library for OCR-related tasks powered by Deep Learning
Data manipulation and transformation for audio signal processing
Bring the notion of Model-as-a-Service to life
A Pythonic framework to simplify AI service building
Deep learning optimization library: makes distributed training easy
FlashInfer: Kernel Library for LLM Serving
The Triton Inference Server provides an optimized cloud
The official Python client for the Huggingface Hub
Phi-3.5 for Mac: Locally-run Vision and Language Models
An MLOps framework to package, deploy, monitor and manage models
A library for accelerating Transformer models on NVIDIA GPUs
Easiest and laziest way for building multi-agent LLMs applications
Uncover insights, surface problems, monitor, and fine tune your LLM
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Multilingual Automatic Speech Recognition with word-level timestamps
Trainable models and NN optimization tools
State-of-the-art diffusion models for image and audio generation
PyTorch extensions for fast R&D prototyping and Kaggle farming
Lightweight Python library for adding real-time multi-object tracking
A set of Docker images for training and serving models in TensorFlow
Open-source tool designed to enhance the efficiency of workloads