I Agent designed to interact with ROS1- and ROS2-based robotics system
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Automate browser-based workflows with LLMs and Computer Vision
Vibe-Trading: Your Personal Trading Agent
LISA: Reasoning Segmentation via Large Language Model
AI-Researcher: Autonomous Scientific Innovation
Tokenizer-Free TTS for Multilingual Speech Generation
AI-powered document analysis and tagging for Paperless-ngx
Robust Speech Recognition via Large-Scale Weak Supervision
PandasAI is a Python library that integrates generative AI
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Structured data extraction and instruction calling with ML, LLM
AI assistant for ComfyUI workflow generation, debugging, and tuning
The SOTA Open-Source Browser Agent
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Open source AI pair programmer for coding, debugging, automation
Enhances Tesseract OCR output using LLMs (local or API)
The fastest way to bring multi-agent workflows to production
AI bridge enabling assistants to control and automate Unity Editor
A high-quality PDF to Markdown tool based on large language model
Unifying 3D Mesh Generation with Language Models
Collection of Kaggle Solutions and Ideas
The AI toolkit for the AI developer
Open source libraries and APIs to build custom preprocessing pipelines
Request recommended movies, TV shows and anime to Jellyseer/Overseer