Research code artifacts for Code World Model (CWM)
Inference code for scalable emulation of protein equilibrium ensembles
Pruna is a model optimization framework built for developers
A framework to enable multimodal models to operate a computer
Optax is a gradient processing and optimization library for JAX
A library for accelerating Transformer models on NVIDIA GPUs
MTEB: Massive Text Embedding Benchmark
State-of-the-art Parameter-Efficient Fine-Tuning
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
GPT-powered chat for documentation search & assistance
A simple but complete full-attention transformer
A middleware to provide an openAI compatible endpoint
A Model Context Protocol (MCP) server
An official Qdrant Model Context Protocol (MCP) server implementation
Browse the web, directly from Cursor etc.
Optimizing inference proxy for LLMs
Witness the aha moment of VLM with less than $3
Evaluation suite designed to assess the performance of LLMs
TextWorld is a sandbox learning environment for the training
An API standard for multi-agent reinforcement learning environments
World of apps for benchmarking interactive coding agent
The behavior guidance framework for customer-facing LLM agents
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models