Large-language-model & vision-language-model based on Linear Attention
Qwen-Image is a powerful image generation foundation model
New family of code large language models (LLMs)
Democratizing Reinforcement Learning for LLMs
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Open source codebase for Scale Agentex
DeepMind model for tracking arbitrary points across videos & robotics
A modular high-level library to train embodied AI agents
Spatiotemporal Signal Processing with Neural Machine Learning Models
A SOTA open-source image editing model
Code for Cicero, an AI agent that plays the game of Diplomacy
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Agent S: an open agentic framework that uses computers like a human
OCR expert VLM powered by Hunyuan's native multimodal architecture
RGBD video generation model conditioned on camera input
Open-weight, large-scale hybrid-attention reasoning model
Reference implementations of MLPerf™ training benchmarks
Linux performance monitoring on-screen or to CSV file
A Pioneering Open-Source Alternative to GPT-4o
Towards Real-World Vision-Language Understanding
Evals is a framework for evaluating LLMs and LLM systems
Simple and lightweight system information viewer for Windows
Benchmark for 50 000 000 prime numbers as single and multicore
Free offline SEM software with HTMT, bootstrapping & exports
Optimized Workforce Learning for General Multi-Agent Assistance