Database system for building simpler and faster AI-powered application
Lightweight Python library for adding real-time multi-object tracking
Pytorch domain library for recommendation systems
LLMFlows - Simple, Explicit and Transparent LLM Apps
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Build your chatbot within minutes on your favorite device
Phi-3.5 for Mac: Locally-run Vision and Language Models
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
A Unified Library for Parameter-Efficient Learning
Large Language Model Text Generation Inference
Images to inference with no labeling
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
GPU environment management and cluster orchestration
PyTorch library of curated Transformer models and their components
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Open-source tool designed to enhance the efficiency of workloads
State-of-the-art Parameter-Efficient Fine-Tuning
Superduper: Integrate AI models and machine learning workflows
A high-performance ML model serving framework, offers dynamic batching
Framework that is dedicated to making neural data processing
MII makes low-latency and high-throughput inference possible
PyTorch extensions for fast R&D prototyping and Kaggle farming
Probabilistic reasoning and statistical analysis in TensorFlow
Low-latency REST API for serving text-embeddings