A neural network that transforms a design mock-up into static websites
Audiocraft is a library for audio processing and generation
Industrial-level controllable zero-shot text-to-speech system
Framework for building neural networks
TTS with kokoro and onnx runtime
Unofficial Parallel WaveGAN
Advanced evolutionary computation library built on top of PyTorch
PyTorch3D is FAIR's library of reusable components for deep learning
CLIP, Predict the most relevant text snippet given an image
Framework that is dedicated to making neural data processing
End-to-end speech processing toolkit
A TTS model capable of generating ultra-realistic dialogue
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second
Code for Language models can explain neurons in language models paper
Fundamentals of Machine Learning and Deep Learning
Generate 3D objects conditioned on text or images
Adaptive Intelligence also known as "Artificial General Intelligence"
Let us control diffusion models
A webui for different audio related Neural Networks
Training and serving large-scale neural networks
Code release for ConvNeXt model
WaveRNN Vocoder + TTS
Classical piano MIDI dataset