Audiocraft is a library for audio processing and generation
Native and Compact Structured Latents for 3D Generation
Restoring old and blurry face photos with AI
Hardware-accelerated video transcoding using Android MediaCodec APIs
A speech-text foundation model for real time dialogue
Streaming Real-time Audio-Driven Avatar Generation
A lightning fast audio upsampler
Python implementation of global optimization with gaussian processes
Software that uses AI to perform real-time voice conversion
OpenMMLab Model Deployment Framework
Video Frame Interpolation & Super Resolution using NVIDIA's TensorRT
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
State-of-the-art deep learning based audio codec
The deep learning toolkit for speech-to-text
Learning to Act by Watching Unlabeled Online Videos
Extreme Attention Guided Salient Object Tracing Network
Open source embedded speech-to-text engine
Deep learning for text to speech
We estimate dense, flicker-free, geometrically consistent depth
Starter code for working with the YouTube-8M dataset
Basic Utilities for PyTorch Natural Language Processing (NLP)
Neural network 3D visualization framework
Leadsheet notation with auto-generated playback, improvisation advice