GUI for a Vocal Remover that uses Deep Neural Networks
Stable Diffusion web UI
OCR software, free and offline
SkyPilot: Run AI and batch jobs on any infra
Generate audiobooks from EPUBs, PDFs and text with captions
Visual Causal Flow
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
1 min voice data can also be used to train a good TTS model
The easiest way to use deep metric learning in your application
Use Microsoft Edge's online text-to-speech service from Python
AI-data warehouse to enrich, transform and analyze unstructured data
Fast backend for long-term AI user memory via structured profiles
95% token savings. 155x faster queries. 16 languages
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Comprehensive Gradio WebUI for audio processing
Build your own Cowork, AI Scientist and other SoTA Agents
Unlimited, private and free Speech-To-Text program
Geometric deep learning extension library for PyTorch
Qwen3-ASR is an open-source series of ASR models
Contexts Optical Compression
LLM-based agent for general purpose software engineering tasks
Lets make video diffusion practical
Machine Learning automation and tracking
A TTS that fits in your CPU (and pocket)
General-purpose image editing model that delivers high-fidelity