My personal Claude Code configuration
Multimodal-Driven Architecture for Customized Video Generation
Multimodal Diffusion with Representation Alignment
From Images to High-Fidelity 3D Assets
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official implementation of DreamCraft3D
LTX-Video Support for ComfyUI
An experimental version of DeepSeek model
Video Object and Interaction Deletion
A Systematic Framework for Interactive World Modeling
Models for object and human mesh reconstruction
Access to Anthropic's safety-first language model APIs
Open-Source Financial Large Language Models
The official PyTorch implementation of Google's Gemma models
Diffusion Transformer with Fine-Grained Chinese Understanding
Pokee Deep Research Model Open Source Repo
FlashMLA: Efficient Multi-head Latent Attention Kernels
Easy Docker setup for Stable Diffusion with user-friendly UI
Bidirectional token-classification model for identifiable info
Pretrained time-series foundation model developed by Google Research
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Project Lyra: Open Generative 3D World Models
A PyTorch library for implementing flow matching algorithms
Research code artifacts for Code World Model (CWM)