Janus-Series: Unified Multimodal Understanding and Generation Models
Ainee - AI Notetaking and Learning Companion
Point cloud diffusion for 3D model synthesis
Chinese-language edition of Dive into Deep Learning
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
Tool to parse the command line and configuration files.
MSTParser is a non-projective dependency parser that searches for maxi
err is a plugin based chatbot designed to be easily extensible
ICLR2024 Spotlight: curation/training code, metadata, distribution
Dia-1.6B generates lifelike English dialogue and vocal expressions
CTC-based forced aligner for audio-text in 158 languages
Vision-language-action model for robot control via images and text