Programs to process GoPro MP4 & Generic GPX/FIT files
Benchmarking Multimodal Agents for Open-Ended Tasks
AI-friendly PPT builder skill: 17 hand-polished Chinese PPTX templates
Misc; latest version of waifu2x; 2D video to stereo 3D video
About 24 Lessons, 12 Weeks, Get Started as a Web Developer
Entity Relation Diagrams generation tool
Azure command-line interface
Let agents classify your bank transactions
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Open source feature flagging and remote config service
Phi-3.5 for Mac: Locally-run Vision and Language Models
Qwen3-omni is a natively end-to-end, omni-modal LLM
Browse the web, directly from Cursor etc.
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Main repository for Vispy
Elyra extends JupyterLab with an AI centric approach
Videomass is a free, open source and cross-platform GUI for FFmpeg
RAG-Anything: All-in-One RAG Framework
Foundation model for image generation
Open-source and free to self-host
Automate native Android apps with AI using accessibility APIs
Data manipulation and transformation for audio signal processing
Python package for AutoML on Tabular Data with Feature Engineering
Open multimodal web agent built by Ai2
Zero-code platform for building AI agents from natural language input