A simple screen parsing tool towards pure vision based GUI agent
The open-source data curation platform for LLMs
Browser userscript that enhances ChatGPT reliability and usability
Powerful Android AI agent with tools, automation, and Linux shell
Open Source Document Management System for Digital Archives
SDK for building interactive UI components over MCP for AI tools
A Personalized LLM-powered Agent Frameworks
Faster and easier training and deployments
A system for agentic LLM-powered data processing and ETL
Concatenate a directory full of files into a single prompt
LLM
A Systematic Framework for Interactive World Modeling
Machine learning, conversational dialog engine for creating chat bots
Using AI models to automatically provide commentary and edit videos
Bailing is a voice dialogue robot similar to GPT-4o
Weaving the Digital Agent Galaxy
Quick illustration of how one can easily read books together with LLMs
Multi-user UI for managing and running Stable Diffusion workflows tool
All-in-one WebUI for AI generative image and video creation
A Frontier Mathematical Coding Agent
AI tool for automating desktop tasks via natural language input
A research prototype of a human-centered web agent
Autonomous LLM agent for end-to-end data science workflows
Structured data extraction and instruction calling with ML, LLM
Tensor search for humans