AI-driven multi-agent research assistant automating hypothesis
I Agent designed to interact with ROS1- and ROS2-based robotics system
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
LISA: Reasoning Segmentation via Large Language Model
Automate browser-based workflows with LLMs and Computer Vision
Vibe-Trading: Your Personal Trading Agent
AI-Researcher: Autonomous Scientific Innovation
Tokenizer-Free TTS for Multilingual Speech Generation
AI-powered document analysis and tagging for Paperless-ngx
Robust Speech Recognition via Large-Scale Weak Supervision
PandasAI is a Python library that integrates generative AI
AI-Powered Data Processing: Use LOTUS to process all of your datasets
AI assistant for ComfyUI workflow generation, debugging, and tuning
The SOTA Open-Source Browser Agent
Structured data extraction and instruction calling with ML, LLM
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Open source AI pair programmer for coding, debugging, automation
Python library for scraping and analyzing online news articles easily
Unifying 3D Mesh Generation with Language Models
Enhances Tesseract OCR output using LLMs (local or API)
AI bridge enabling assistants to control and automate Unity Editor
Lighter web automation with Python
A high-quality PDF to Markdown tool based on large language model
The fastest way to bring multi-agent workflows to production
Translate the video from one language to another and embed dubbing