Image generation model with single-stream diffusion transformer
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Automate browser-based workflows with LLMs and Computer Vision
One-click local MCP server installation in desktop apps
Subtitle Creation Assistant
Open source large language model by Alibaba
A tool that automates complex file operations.
Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People
Open-source, high-performance Mixture-of-Experts large language model
A graphical manager for ollama that can manage your LLMs
Free & Easy AI Voice Accounting Software For Blind & Speechless People
Award-winning modern data processing SDK in C++20
Easy Tools of PDF, Image, File, Network, Data, and Medias
Locally run an Instruction-Tuned Chat-Style LLM
Common Resource Grep
Generates a sound given: volume, frequency, duration
Userge, Durable as a Serge
Twitch YouTube bot. Automatically make video compilations
Facebook AI research's automatic speech recognition toolkit
Simple Windows application to OCR images
CIntruder - OCR Bruteforcing Toolkit