GUI for a Vocal Remover that uses Deep Neural Networks
Web interface for generating images using Stable Diffusion models
Stable Diffusion web UI
OCR software, free and offline
SkyPilot: Run AI and batch jobs on any infra
Use Microsoft Edge's online text-to-speech service from Python
Generate audiobooks from EPUBs, PDFs and text with captions
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
The easiest way to use deep metric learning in your application
Fast backend for long-term AI user memory via structured profiles
1 min voice data can also be used to train a good TTS model
AI-data warehouse to enrich, transform and analyze unstructured data
Comprehensive Gradio WebUI for audio processing
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Lets make video diffusion practical
LLM-based agent for general purpose software engineering tasks
Unlimited, private and free Speech-To-Text program
A python tool that uses GPT-4, FFmpeg, and OpenCV
A Repo For Document AI
Unified Model Serving Framework
Contexts Optical Compression
Official repository for LTX-Video
Machine Learning automation and tracking
Elyra extends JupyterLab with an AI centric approach
Python library and CLI tool to interface with Google Translate