Learning agent trained in a diffusion world model
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
A fast TTS architecture with conditional flow matching
It's possible for machines to become self-aware.
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
PyTorch Lightning + Hydra. A very user-friendly template
Simple and powerful voice changer for Linux, written with Python & GTK
Facebook AI Research Sequence-to-Sequence Toolkit written in Python
Pytorch framework for doing deep learning on point clouds
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Easy-OCR solution and Tesseract trainer for GNU/Linux
A multi-modeling and simulation environment to study complex systems
Speech recognition for Ubuntu
...a work in progress.