A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Implementation of Video Diffusion Models
Implementation of Make-A-Video, new SOTA text to video generator
InvokeAI is a leading creative engine for Stable Diffusion models
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Label Studio is a multi-type data labeling and annotation tool
Implementation of Phenaki Video, which uses Mask GIT
A walk along memory lane
Implementation of NÜWA, attention network for text to video synthesis
CLIP + FFT/DWT/RGB = text to image/video
Powerful open source team chat application
The data structure for multimodal data
Build AI-powered semantic search applications
Build cross-modal and multimodal applications on the cloud
Generate images from texts. In Russian
Based on the Disco Diffusion, version of the AI art creation software
myplayer Free Karaoke & Media Player Software (Myanmar)
A data augmentations library for audio, image, text, and video
SoundTranscriber can be used to generate automatic transcription / aut
Video automatic transcribe and translated subtitle generator
Search recursively all files, text inside files, and bookmarks
Automated file organization in a user-friendly GUI
A web based MariaDB client.
Software tool that converts text to video for more engaging experience
Uility to make home movies from your digital camera files