Probabilistic reasoning and statistical analysis in TensorFlow
Easiest and laziest way for building multi-agent LLMs applications
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Visual Instruction Tuning: Large Language-and-Vision Assistant
Open-source tool designed to enhance the efficiency of workloads
Data manipulation and transformation for audio signal processing
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
State-of-the-art Parameter-Efficient Fine-Tuning
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Optimizing inference proxy for LLMs
Neural Network Compression Framework for enhanced OpenVINO
Build your chatbot within minutes on your favorite device
GPU environment management and cluster orchestration
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Phi-3.5 for Mac: Locally-run Vision and Language Models
Libraries for applying sparsification recipes to neural networks
An easy-to-use LLMs quantization package with user-friendly apis
Lightweight Python library for adding real-time multi-object tracking
MII makes low-latency and high-throughput inference possible
A unified framework for scalable computing
A set of Docker images for training and serving models in TensorFlow
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference