Real-World Centric Foundation GUI Agents
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Context data platform for building observable, self-learning AI agents
Democratizing Reinforcement Learning for LLMs
Provider-agnostic, open-source evaluation infrastructure
When LLM Meets Domain Experts
Open-sourced unified customization model
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
SOTA discrete acoustic codec models with 40/75 tokens per second
End-to-end speech processing toolkit
Pokee Deep Research Model Open Source Repo
Python examples of popular machine learning algorithms
Volcano Engine Reinforcement Learning for LLMs
An alignment auditing agent capable of exploring alignment hypothesis
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools
FAIR Sequence Modeling Toolkit 2
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
PyTorch code and models for VJEPA2 self-supervised learning from video
An AI-powered security review GitHub Action using Claude
GPT4V-level open-source multi-modal model based on Llama3-8B
Educational framework exploring multi-agent orchestration
Official python implementation of UTCP. UTCP is an open standard
Python client for the Telegram's tdlib