3D reconstruction software
Public repository for Agent Skills
Faster Whisper transcription with CTranslate2
Automatic Speech Recognition with Word-level Timestamps
OBLITERATE THE CHAINS THAT BIND YOU
Instant voice cloning by MIT and MyShell. Audio foundation model
Lets make video diffusion practical
Comprehensive Gradio WebUI for audio processing
Wan2.1: Open and Advanced Large-Scale Video Generative Model
The most powerful local music generation model
1 min voice data can also be used to train a good TTS model
Official inference repo for FLUX.1 models
Powerful AI language model (MoE) optimized for efficiency/performance
AI video generator optimized for low VRAM and older GPUs use
OCR software, free and offline
Advanced language and coding AI model
NVR with realtime local object detection for IP cameras
Improve your Baduk skills by training with KataGo
Oobabooga - The definitive Web UI for local AI, with powerful features
Deepfakes Software For All
AI-powered video clipping and highlight generation
Native and Compact Structured Latents for 3D Generation
Generate audiobooks from e-books, voice cloning & 1107+ languages
Advanced LLM-powered brute-force tool combining AI intelligence
A theoretical reconstruction of the Claude Mythos architecture