GPT-powered chat for documentation search & assistance
A simple but complete full-attention transformer
Omnilingual ASR Open-Source Multilingual SpeechRecognition
This repository contains the official implementation of FastVLM
MobileLLM Optimizing Sub-billion Parameter Language Models
A Production-ready Reinforcement Learning AI Agent Library
Pushing the Limits of Mathematical Reasoning in Open Language Models
Official implementation of DreamCraft3D
Research code artifacts for Code World Model (CWM)
Flexible and powerful framework for managing multiple AI agents
A middleware to provide an openAI compatible endpoint
A Model Context Protocol (MCP) server
An official Qdrant Model Context Protocol (MCP) server implementation
Browse the web, directly from Cursor etc.
Optimizing inference proxy for LLMs
Witness the aha moment of VLM with less than $3
Evaluation suite designed to assess the performance of LLMs
TextWorld is a sandbox learning environment for the training
An API standard for multi-agent reinforcement learning environments
The Memory layer for AI Agents
World of apps for benchmarking interactive coding agent
The behavior guidance framework for customer-facing LLM agents
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Sparsity-aware deep learning inference runtime for CPUs