Machine Learning Systems: Design and Implementation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Code for Language models can explain neurons in language models paper
Multimodal-Driven Architecture for Customized Video Generation
Plug-n-play module turning text-to-image models into animation
Learning to Act by Watching Unlabeled Online Videos
A python module for hyperspectral image processing
Deep Hough Voting for 3D Object Detection in Point Clouds
Reinforced Recommendation toolkit built around pytorch 1.7