Web interface for generating images using Stable Diffusion models
GUI for a Vocal Remover that uses Deep Neural Networks
Stable Diffusion web UI
OCR software, free and offline
SkyPilot: Run AI and batch jobs on any infra
Generate audiobooks from EPUBs, PDFs and text with captions
AI-data warehouse to enrich, transform and analyze unstructured data
Use Microsoft Edge's online text-to-speech service from Python
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
Fast backend for long-term AI user memory via structured profiles
The easiest way to use deep metric learning in your application
1 min voice data can also be used to train a good TTS model
Comprehensive Gradio WebUI for audio processing
Node.js example app from the OpenAI API quickstart tutorial
Fast inference engine for Transformer models
Lets make video diffusion practical
Adds powerful web scraping and search to Cursor and Claude
A Repo For Document AI
LLM-based agent for general purpose software engineering tasks
Machine Learning automation and tracking
A python tool that uses GPT-4, FFmpeg, and OpenCV
Unified Model Serving Framework
Official repository for LTX-Video
A high-performance image compression microservice based on MCP
Contexts Optical Compression