Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful local music generation model
Image inpainting tool powered by SOTA AI Model
Agentic, Reasoning, and Coding (ARC) foundation models
OBLITERATE THE CHAINS THAT BIND YOU
Awesome multilingual OCR toolkits based on PaddlePaddle
Automatic Speech Recognition with Word-level Timestamps
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCR software, free and offline
Fast and memory-efficient exact attention
NVR with realtime local object detection for IP cameras
Wan2.1: Open and Advanced Large-Scale Video Generative Model
The largest open-source medical AI skills library for OpenClaw
1 min voice data can also be used to train a good TTS model
The agent that grows with you
An Open Source implementation of Notebook LM with more flexibility
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A community-supported supercharged version of paperless
Official inference repo for FLUX.1 models
Lets make video diffusion practical
A lightweight audio-to-MIDI converter with pitch bend detection
Native and Compact Structured Latents for 3D Generation
AI Fully Automated Short Video Engine
AI video generator optimized for low VRAM and older GPUs use
Unified web UI for training and running open models locally