Driving with Graph Visual Question Answering
Unleashing 10,000+ Word Generation from Long Context LLMs
LISA: Reasoning Segmentation via Large Language Model
Anomaly detection related books, papers, videos, and toolboxes
A Family of Open Foundation Models for Code Intelligence
ICLR2024 Spotlight: curation/training code, metadata, distribution
OpenCompass is an LLM evaluation platform
Graph Neural Network Library for PyTorch
MiniMax-M2, a model built for Max coding & agentic workflows
RGBD video generation model conditioned on camera input
New family of code large language models (LLMs)
Democratizing Reinforcement Learning for LLMs
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Open source codebase for Scale Agentex
DeepMind model for tracking arbitrary points across videos & robotics
A modular high-level library to train embodied AI agents
Spatiotemporal Signal Processing with Neural Machine Learning Models
A SOTA open-source image editing model
The Arcade Learning Environment (ALE) -- a platform for AI research
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Agent S: an open agentic framework that uses computers like a human
OCR expert VLM powered by Hunyuan's native multimodal architecture
State of the art LLM and coding model
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention