Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multimodal-Driven Architecture for Customized Video Generation
Machine Learning Systems: Design and Implementation
Learn AI and LLMs from scratch using free resources
Code for Language models can explain neurons in language models paper
Plug-n-play module turning text-to-image models into animation
Learning to Act by Watching Unlabeled Online Videos
A python module for hyperspectral image processing
Deep Hough Voting for 3D Object Detection in Point Clouds
Reinforced Recommendation toolkit built around pytorch 1.7
the intelligent predictive text entry platform
Machine Learning Python