A Gym environment for web task automation
AI-Powered Data Processing: Use LOTUS to process all of your datasets
A personal context-agent that learns how you work
Gracefully face hCaptcha challenge with multimodal llms
Learn to build your Second Brain AI assistant with LLMs
LLM Large Model of Selling Anchor
Generative AI reference workflows
A course of learning LLM inference serving on Apple Silicon
The official repository for ERNIE 4.5 and ERNIEKit
The Modular Platform (includes MAX & Mojo)
This repository contains code released by Google Research
Outcome driven agent development framework that evolves
An Efficient Agentic Model for Computer Use
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Build your own Cowork, AI Scientist and other SoTA Agents
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
New family of code large language models (LLMs)
Multimodal embedding and reranking models built on Qwen3-VL
Anthropic's original performance take-home, now open for you to try
Real-World Centric Foundation GUI Agents
End-to-end speech processing toolkit
Collections of robotics environments
An alignment auditing agent capable of exploring alignment hypothesis
ChatGLM2-6B: An Open Bilingual Chat LLM
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning