Memory-efficient and performant finetuning of Mistral's models
DeepEP: an efficient expert-parallel communication library
Diffusion Transformer with Fine-Grained Chinese Understanding
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Stream Processing and Complex Event Processing Engine
A semantic diff utility and library for tree-like files such as JSON
An advanced paper search agent powered by large language models
Large-language-model & vision-language-model based on Linear Attention
A collection of learning resources for curious software engineers
An Efficient, Scalable, Multi-Modality RL Training Framework
An SSH/Telnet/Serial client in your browser
Pokee Deep Research Model Open Source Repo
Stable-diffusion-webui-pixelization
Unified Multimodal Understanding and Generation Models
Volcano Engine Reinforcement Learning for LLMs
LLM powered fuzzing via OSS-Fuzz
Beyond the Imitation Game collaborative benchmark for measuring
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools
Collection of common code shared among different research projects
PyTorch code and models for VJEPA2 self-supervised learning from video
Language modeling in a sentence representation space
Dataset of GPT-2 outputs for research in detection, biases, and more
Code for Language models can explain neurons in language models paper
Evals is a framework for evaluating LLMs and LLM systems
The ChatGPT Retrieval Plugin lets you easily find personal documents