[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
text and image to video generation: CogVideoX (2024) and CogVideo
21 Lessons, Get Started Building with Generative AI
Chinese and English multimodal conversational language model
A state-of-the-art open visual language model
code for Mesh R-CNN, ICCV 2019
Text-to-Image generation. The repo for NeurIPS 2021 paper
We estimate dense, flicker-free, geometrically consistent depth
A low code unified framework for computer vision and deep learning