SGLang is a fast serving framework for large language models
Lemonade helps users run local LLMs with the highest performance
Offline Text To Speech synthesis for python
A TTS that fits in your CPU (and pocket)
Package and deploy machine learning models using Docker containers
Multi-Agent daTa geneRation Infra and eXperimentation framework
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
A fast TTS architecture with conditional flow matching
Sparsity-aware deep learning inference runtime for CPUs
Low-code framework for building custom LLMs, neural networks
OpenDAN is an open source Personal AI OS
Library for efficiently connecting and optimizing teams of AI agents
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
TFX is an end-to-end platform for deploying production ML pipelines
MLOps simplified. From ML Pipeline ⇨ Data Product without the hassle
Train machine learning models within Docker containers
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
The official Python Library for the Groq API
Unified framework for building enterprise RAG pipelines
950 line, minimal, extensible LLM inference engine built from scratch
AI memory OS for LLM and Agent systems
Z80-μLM is a 2-bit quantized language model
An MLOps framework to package, deploy, monitor and manage models
Python tool for browser-based interactive data apps in one file
SQL-Driven RAG Engine