AI agents autonomously run and improve ML experiments overnight
Implement CPU from scratch and play with large model deployments
Run a full local LLM stack with one command using Docker
Performance-optimized AI inference on your GPUs
Parallax is a distributed model serving framework
gpt-4o for windows, macos and linux
Run AI models end-to-end encrypted
A reactive notebook for Python
Test Suites for validating ML models & data
The agent that grows with you
Run all your local AI together in one package
The Simple Agent Development Kit
AIMET is a library that provides advanced quantization and compression
A course of learning LLM inference serving on Apple Silicon
Data science on data without acquiring a copy
A fast library for AutoML and tuning
DeepVariant is an analysis pipeline that uses a deep neural networks
Find the local LLM that actually runs and performs best
Android Application Identifier for Packers, Protectors and Obfuscators
Deepfakes Software For All
A modular graph-based Retrieval-Augmented Generation (RAG) system
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Faster Whisper transcription with CTranslate2
Jupyter notebook tutorials for OpenVINO
State-of-the-art diffusion models for image and audio generation