GUI for a Vocal Remover that uses Deep Neural Networks
Web interface for generating images using Stable Diffusion models
OCR software, free and offline
Stable Diffusion web UI
Power Your World with AI
SkyPilot: Run AI and batch jobs on any infra
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
Generate audiobooks from EPUBs, PDFs and text with captions
The easiest way to use deep metric learning in your application
Use Microsoft Edge's online text-to-speech service from Python
Node.js example app from the OpenAI API quickstart tutorial
AI-data warehouse to enrich, transform and analyze unstructured data
1 min voice data can also be used to train a good TTS model
Fast backend for long-term AI user memory via structured profiles
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
Fast inference engine for Transformer models
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Comprehensive Gradio WebUI for audio processing
95% token savings. 155x faster queries. 16 languages
Adds powerful web scraping and search to Cursor and Claude
Lets make video diffusion practical
Open source platform for the machine learning lifecycle
A python tool that uses GPT-4, FFmpeg, and OpenCV
LLM-based agent for general purpose software engineering tasks
Contexts Optical Compression