Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
AI-powered tool for efficient abstract and PDF screening
A modular Agentic RAG built with LangGraph
AudioMuse-AI is an Open Source Dockerized environment
A dataset consists of 15,140 ChatGPT prompts from Reddit
An AI-powered file management tool that ensures privacy
CNCF Sandbox Project
An open-source, modern-design AI training tracking and visualization
Enhances Tesseract OCR output using LLMs (local or API)
Visual intelligence for your home.
Open-source AI hackers to find and fix your app’s vulnerabilities
Request recommended movies, TV shows and anime to Jellyseer/Overseer
The open source post-building layer for agents
Schema-Guided Reasoning (SGR) has agentic system design
Chat with your documents using local AI
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Chat with any codebase in under two minutes | Fully local
Benchmark LLMs by fighting in Street Fighter 3
A tension reasoning engine over 131 S-class problems
An LLM Compiler for Parallel Function Calling
AI Powered Knowledge Graph Generator
The SOTA Open-Source Browser Agent
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Skywork-R1V is an advanced multimodal AI model series
I Agent designed to interact with ROS1- and ROS2-based robotics system