Private chat with local GPT with document, images, video, etc.
Blazing-fast vector DB with similarity search and metadata filtering
A library for deep learning end-to-end dialog systems and chatbots
Qwen3-omni is a natively end-to-end, omni-modal LLM
Bring the notion of Model-as-a-Service to life
Phi-3.5 for Mac: Locally-run Vision and Language Models
SGLang is a fast serving framework for large language models
a pluggable app that runs a full check on the deployment
Multilingual Automatic Speech Recognition with word-level timestamps
Deep learning optimization library making distributed training easy
AI-powered Quantitative Investment Research Platform
Fast and Universal 3D reconstruction model for versatile tasks
The Simple Agent Development Kit
Embed images and sentences into fixed-length vectors
Global weather forecasting model using graph neural networks and JAX
Tooling for the Common Objects In 3D dataset
NVIDIA Federated Learning Application Runtime Environment
Photorealistic Synthetic Dataset for Holistic Indoor Scene
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools
Collection of common code shared among different research projects
Reusable workflow library for Django
GPT4V-level open-source multi-modal model based on Llama3-8B
Minimal scripts to run the emulator in a container for various systems
Collaborative forensic timeline analysis
MobileLLM Optimizing Sub-billion Parameter Language Models