Audio generation using diffusion models, in PyTorch
A gradio web UI for running Large Language Models like LLaMA
An open source RDP server
Robust Speech Recognition via Large-Scale Weak Supervision
Speech recognition module for Python
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A deep learning toolkit for Text-to-Speech, battle-tested in research
Transcribe any audio to text, translate and edit subtitles 100% locall
Speech-to-text, text-to-speech, and speaker recognition
LilyPond sheet music text editor
A safe home for all your data
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Integrate with the latest language models, image generation and speech
Label Studio is a multi-type data labeling and annotation tool
JUCE is an open-source cross-platform C++ application framework
Git extension for versioning large files
Web component framework for building ads, emails, websites and more
API samples for the Universal Windows Platform.
Remote desktop and file transfer tool
Implementation of AudioLM audio generation model in Pytorch
Lightweight, efficient Tags input component in Vanilla JS
RPG Maker 2000/2003 and EasyRPG games interpreter
A web application that allows users to interact with OpenAI's models
A walk along memory lane
The deep learning toolkit for speech-to-text