Implementation of Vision Transformer, a simple way to achieve SOTA
Parse files for optimal RAG
Deterministic LLMs Outputs for AI Applications and AI Agents
Open source libraries and APIs to build custom preprocessing pipelines
An official Qdrant Model Context Protocol (MCP) server implementation
Neural Network Compression Framework for enhanced OpenVINO
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Core ML tools contain supporting tools for Core ML model conversion
Deep and Machine Learning for Microscopy
ONNX-TensorRT: TensorRT backend for ONNX
Library to facilitate federated learning research
Massively parallel rigidbody physics simulation
Build voice-based LLM agents. Modular + open source
A simple, secure MCP-to-OpenAPI proxy server
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Open Source Generative Process Automation
OpenMMLab Model Deployment Framework
Open-source autonomous AI software engineer
Standardized Serverless ML Inference Platform on Kubernetes
Neural Search
Central interface to connect your LLM's with external data
Toloka-Kit is a Python library for working with Toloka API
Generating Immersive, Explorable, and Interactive 3D Worlds
A library to communicate with ChatGPT, Claude, Copilot, Gemini