Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
A high-throughput and memory-efficient inference and serving engine
Library for OCR-related tasks powered by Deep Learning
State-of-the-art diffusion models for image and audio generation
Easy-to-use deep learning framework with 3 key features
Bring the notion of Model-as-a-Service to life
Open-Source AI Camera. Empower any camera/CCTV
Easiest and laziest way for building multi-agent LLMs applications
Deep learning optimization library: makes distributed training easy
Unified Model Serving Framework
An easy-to-use LLMs quantization package with user-friendly apis
Probabilistic reasoning and statistical analysis in TensorFlow
Framework that is dedicated to making neural data processing
Large Language Model Text Generation Inference
Data manipulation and transformation for audio signal processing
PyTorch library of curated Transformer models and their components
Pytorch domain library for recommendation systems
Easy-to-use Speech Toolkit including Self-Supervised Learning model
PyTorch extensions for fast R&D prototyping and Kaggle farming
Low-latency REST API for serving text-embeddings
A library for accelerating Transformer models on NVIDIA GPUs
Uncover insights, surface problems, monitor, and fine tune your LLM
A GPU-accelerated library containing highly optimized building blocks
Implementation of "Tree of Thoughts