PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Transforming Multimodal Content into Captivating Multilingual Audio
Open source personal AI Assistant for Linux, Windows and Mac
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
An open-source, modern-design AI chat framework
Arduino library to play MOD, WAV, FLAC, MIDI, RTTTL, MP3
In-App assistant SDK to build a multimodal conversational UX for iOS
In-App assistant SDK to build a multimodal conversational UX websites
elevenlabs-api is an open source Java wrapper around the ElevenLabs
Convert AI papers to GUI
Implementation of Imagen, Google's Text-to-Image Neural Network
Assistant SDK to build a multimodal conversational UX for Android
Implementation of NÜWA, attention network for text to video synthesis
Implementation of Video Diffusion Models
Omilo is a simple text to speech application
An open-source, multilingual text-to-speech synthesis system
Free open source speech synthesizer for Russian and other languages
Create synth presets from words
A GNU/Linux operating system accessible for visually impaired.
Nyquist is a language for sound synthesis and music composition.
SDK to build a multimodal conversational UX for Flutter apps
MARS5 is a fully open-source, hyper-realistic text-to-speech (TTS).