Universal LLM Deployment Engine with ML Compilation
Specify a github or local repo, github pull request
Implementation of "MobileCLIP" CVPR 2024
A lightweight approach to removing Google web service dependency
Operating LLMs in production
Build your own Cowork, AI Scientist and other SoTA Agents
Package and deploy machine learning models using Docker containers
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
SGLang is a fast serving framework for large language models
Streamlines and simplifies prompt design for both developers
Self-learning data agent that grounds its answers in layers of content
Powering Amazon custom machine learning chips
Build, evaluate and train General Multi-Agent Assistance with ease
Offline Text To Speech synthesis for python
NumPy aware dynamic Python compiler using LLVM
Lemonade helps users run local LLMs with the highest performance
Sparsity-aware deep learning inference runtime for CPUs
Open source codebase for Scale Agentex
A TTS that fits in your CPU (and pocket)
Next generation AWS IoT Client SDK for Python
Official repository for LTX-Video
A fast TTS architecture with conditional flow matching
Data parsing and validation using Python type hints
Multi-Agent daTa geneRation Infra and eXperimentation framework
Modular AI runtime for robots