Audio generation using diffusion models, in PyTorch
A gradio web UI for running Large Language Models like LLaMA
Robust Speech Recognition via Large-Scale Weak Supervision
An open source RDP server
Transcribe any audio to text, translate and edit subtitles 100% locall
LilyPond sheet music text editor
A deep learning toolkit for Text-to-Speech, battle-tested in research
Speech recognition module for Python
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Label Studio is a multi-type data labeling and annotation tool
A safe home for all your data
Remote desktop and file transfer tool
Integrate with the latest language models, image generation and speech
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Implementation of MusicLM music generation model in Pytorch
Git extension for versioning large files
Transforming Multimodal Content into Captivating Multilingual Audio
Implementation of AudioLM audio generation model in Pytorch
A web application that allows users to interact with OpenAI's models
Web component framework for building ads, emails, websites and more
Speech-to-text, text-to-speech, and speaker recognition
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
JUCE is an open-source cross-platform C++ application framework
The most powerful screen recorder & annotation tool for Chrome
API samples for the Universal Windows Platform.