A python tool that uses GPT-4, FFmpeg, and OpenCV
Make websites accessible for AI agents
AI video agents framework for next-gen video interactions
A Systematic Framework for Interactive World Modeling
LLM based autonomous agent that does online comprehensive research
Generate high-definition story short videos with one click using AI
Tools to build web AI agents that can authenticate
Wan2.1: Open and Advanced Large-Scale Video Generative Model
AI-powered video clipping and highlight generation
Official repository for LTX-Video
Generate blog articles from video or audio
Opensource browser using agents
An open phone agent model & framework
Multimodal Diffusion with Representation Alignment
Open-Sora: Democratizing Efficient Video Production for All
Automate browser-based workflows with LLMs and Computer Vision
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Pokee Deep Research Model Open Source Repo
Open-source, high-performance AI model with advanced reasoning
PDF scientific paper translation with preserved formats
RGBD video generation model conditioned on camera input
Eva is an A.I. assistant that helps users multi-task.
Large Multimodal Models for Video Understanding and Editing
AI-driven public opinion trend monitor with multi-platform aggregation
An AI personal assistant for your digital brain