Robust Speech Recognition via Large-Scale Weak Supervision
3D reconstruction software
Run Local LLMs on Any Device. Open-source
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Focus on prompting and generating
YOLOv5 is the world's most loved vision AI
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Your agent in your terminal, equipped with local tools
OCRmyPDF adds an OCR text layer to scanned PDF files
Image/video AI upscaler app (BSRGAN)
Agentic, Reasoning, and Coding (ARC) foundation models
RGBD video generation model conditioned on camera input
An Async Bot/API wrapper for Twitch made in Python
The most powerful and modular diffusion model GUI, api and backend
Open source machine learning framework to automate text conversations
Python-based neural networks API
A Telegram RSS bot that cares about your reading experience
Open source personal AI Assistant for Linux, Windows and Mac
Powerful AI language model (MoE) optimized for efficiency/performance
Free, open source crypto trading bot
Comprehensive Gradio WebUI for audio processing
Stable Diffusion web UI
NVR with realtime local object detection for IP cameras
Powerful tool that lets you create and run intelligent agents
An open and fair framework for everyone to build AI agents