Supercharge Your LLM with the Fastest KV Cache Layer
A Model Context Protocol (MCP) Gateway & Registry
Set of tools to assess and improve LLM security
FAIR Sequence Modeling Toolkit 2
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
ICLR2024 Spotlight: curation/training code, metadata, distribution
A Production-ready Reinforcement Learning AI Agent Library
A PyTorch library for implementing flow matching algorithms
An implementation of a deep learning recommendation model (DLRM)
Hackable and optimized Transformers building blocks
GLM-4-Voice | End-to-End Chinese-English Conversational Model
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Memory-efficient and performant finetuning of Mistral's models
Official implementation of DreamCraft3D
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
Diffusion Transformer with Fine-Grained Chinese Understanding
The Simple Agent Development Kit
Structured outputs for llms
Low-latency REST API for serving text-embeddings
AI-powered tool for developers, simplifying coding tasks
BISHENG is an open LLM devops platform for next generation apps
Scalable and user friendly neural forecasting algorithms.