Open-source evaluation toolkit of large multi-modality models (LMMs)
Open-source model for program synthesis
Training Large Language Model to Reason in a Continuous Latent Space
Qwen-Image is a powerful image generation foundation model
The official PyTorch implementation of Google's Gemma models
MemU is an open-source memory framework for AI companions
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
MiniMax M2.1, a SOTA model for real-world dev & agents.
An Open-Source Toolkit for General-OCR Research and Applications
Code for Cicero, an AI agent that plays the game of Diplomacy
Test-Time Reinforcement Learning
MiroThinker is an open source deep research agent
A Gym environment for web task automation
Papers integrating knowledge graphs (KGs) and large language models
Block Diffusion for Ultra-Fast Speculative Decoding
Anthropic's original performance take-home, now open for you to try
A unified, comprehensive and efficient recommendation library
Knowledge Graph Generation from Any Text
End-to-end speech processing toolkit
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Face recognition with deep neural networks
Decomposable Multiscale Mixing for Time Series Forecasting
Agent framework that enables tool-use agent tasks
AI-Driven Exploration in the Space of Code
Hypernetworks that adapt LLMs for specific benchmark tasks