Audio generation using diffusion models, in PyTorch
A gradio web UI for running Large Language Models like LLaMA
An open source RDP server
Robust Speech Recognition via Large-Scale Weak Supervision
Speech recognition module for Python
A deep learning toolkit for Text-to-Speech, battle-tested in research
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
LilyPond sheet music text editor
Speech-to-text, text-to-speech, and speaker recognition
A safe home for all your data
The deep learning toolkit for speech-to-text
Venom is the most complete javascript library for Whatsapp
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Implementation of AudioLM audio generation model in Pytorch
Web component framework for building ads, emails, websites and more
A web application that allows users to interact with OpenAI's models
Integrate with the latest language models, image generation and speech
Remote desktop and file transfer tool
A real-time collaborative document editor for the web
Git extension for versioning large files
Implementation of MusicLM music generation model in Pytorch
Label Studio is a multi-type data labeling and annotation tool
Easy-to-use Speech Toolkit including Self-Supervised Learning model
API samples for the Universal Windows Platform.
JUCE is an open-source cross-platform C++ application framework