Automate browser-based workflows with LLMs and Computer Vision
Run Claude Code, Gemini, Codex in a clean, isolated sandbox
Generate Any 3D Scene in Seconds
GLM-4-Voice | End-to-End Chinese-English Conversational Model
An open sourced end-to-end VLM-based GUI Agent
Pretty diff to html javascript library (diff2html)
Game Boy emulator written in Python
Medical imaging toolkit for deep learning
MiniMax-M2, a model built for Max coding & agentic workflows
A Family of Open Foundation Models for Code Intelligence
HunyuanVideo: A Systematic Framework For Large Video Generation Model
ContextGem: Effortless LLM extraction from documents
Deepfakes Software For All
Open-source MCP gateway and control plane for teams
Towards Human-Sounding Speech
Talk to Your AI Agents from Anywhere
Your best AI pair programmer in VS Code
Data manipulation and transformation for audio signal processing
Low-code app builder for RAG and multi-agent AI applications
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Agent framework and applications built upon Qwen>=3.0
DeepMind's software stack for physics-based simulation
Foundation Models for Time Series
MCP Gateway is a reverse proxy and management layer for MCP servers
Hackable and optimized Transformers building blocks