My personal Claude Code configuration
Multimodal-Driven Architecture for Customized Video Generation
Multimodal Diffusion with Representation Alignment
From Images to High-Fidelity 3D Assets
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official implementation of DreamCraft3D
An experimental version of DeepSeek model
LTX-Video Support for ComfyUI
An easy 1-click way to create beautiful artwork on your PC using AI
A Systematic Framework for Interactive World Modeling
Video Object and Interaction Deletion
code for Mesh R-CNN, ICCV 2019
Open-Source Financial Large Language Models
Models for object and human mesh reconstruction
Z80-μLM is a 2-bit quantized language model
Access to Anthropic's safety-first language model APIs
An Efficient Agentic Model for Computer Use
The official PyTorch implementation of Google's Gemma models
Revolutionizing Database Interactions with Private LLM Technology
Diffusion Transformer with Fine-Grained Chinese Understanding
Pokee Deep Research Model Open Source Repo
FlashMLA: Efficient Multi-head Latent Attention Kernels
Easy Docker setup for Stable Diffusion with user-friendly UI
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference