Label Studio is a multi-type data labeling and annotation tool
InvokeAI is a leading creative engine for Stable Diffusion models
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Implementation of NÜWA, attention network for text to video synthesis
A walk along memory lane
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
The data structure for multimodal data
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Build AI-powered semantic search applications
Data Lake for Deep Learning. Build, manage, and query datasets
The Triton Inference Server provides an optimized cloud
Build cross-modal and multimodal applications on the cloud
Based on the Disco Diffusion, version of the AI art creation software
SoundTranscriber can be used to generate automatic transcription / aut
Implementation of NWT, audio-to-video generation, in Pytorch
Easy-OCR solution and Tesseract trainer for GNU/Linux
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
PyTorch implementation of convolutional neural networks
Cross Audio-Visual Recognition using 3D Architectures
Just Another Speech Recognition and Text to Speech software.