Implementation of Make-A-Video, new SOTA text to video generator
Implementation of Video Diffusion Models
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Text generator is a handy plugin for Obsidian
Application that simplifies the installation of AI-related projects
A gradio web UI for running Large Language Models like LLaMA
Build AI-powered applications with React, Svelte, Vue, and Solid
Recognition and resolution of numbers, units, date/time, etc.
One API for plugins and datasets, one interface for prompt engineering
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
Dev tools to reliably understand text and automate conversations
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
InvokeAI is a leading creative engine for Stable Diffusion models
A deep learning toolkit for Text-to-Speech, battle-tested in research
LLM Frontend for Power Users
Integrate with the latest language models, image generation and speech
An open-source, modern-design AI chat framework
NVR with realtime local object detection for IP cameras
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Subtitle Creation Assistant
Transcribe any audio to text, translate and edit subtitles 100% locall
Dealing with all unstructured data, such as reverse image search
Open-source SQL AI Agent for Text-to-SQL. Make Text2SQL Easy
Label Studio is a multi-type data labeling and annotation tool
Implementation of Phenaki Video, which uses Mask GIT