Low-latency REST API for serving text-embeddings
Open deep learning compiler stack for cpu, gpu, etc.
Towards Human-Sounding Speech
AIMET is a library that provides advanced quantization and compression
Tools for merging pretrained large language models
A cross-platform Python library for differentiable programming
Python package built to ease deep learning on graph
Open-weight, large-scale hybrid-attention reasoning model
Chinese Llama-3 LLMs) developed from Meta Llama 3
Interface for OuteTTS models
Tensor search for humans
The data structure for multimodal data
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Reference implementations of MLPerf™ training benchmarks
Easy-to-use deep learning framework with 3 key features
A Pioneering Open-Source Alternative to GPT-4o
Towards Real-World Vision-Language Understanding
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
Evals is a framework for evaluating LLMs and LLM systems
Run GGUF models easily with a UI or API. One File. Zero Install.
A tool for detecting the presence of leg dystonia from videos.
Local AI file organization with categorization and rename suggestions
Run LLMs locally on Cloud Workstations
Optimized Workforce Learning for General Multi-Agent Assistance
fast C++ library for GPU linear algebra & scientific computing