My personal Claude Code configuration
Multimodal-Driven Architecture for Customized Video Generation
Multimodal Diffusion with Representation Alignment
From Images to High-Fidelity 3D Assets
ICLR2024 Spotlight: curation/training code, metadata, distribution
Hackable and optimized Transformers building blocks
Official implementation of DreamCraft3D
LTX-Video Support for ComfyUI
An easy 1-click way to create beautiful artwork on your PC using AI
An experimental version of DeepSeek model
Video Object and Interaction Deletion
A Systematic Framework for Interactive World Modeling
Models for object and human mesh reconstruction
Access to Anthropic's safety-first language model APIs
An Efficient Agentic Model for Computer Use
Open-Source Financial Large Language Models
The official PyTorch implementation of Google's Gemma models
Revolutionizing Database Interactions with Private LLM Technology
Diffusion Transformer with Fine-Grained Chinese Understanding
Pokee Deep Research Model Open Source Repo
FlashMLA: Efficient Multi-head Latent Attention Kernels
Easy Docker setup for Stable Diffusion with user-friendly UI
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Advancing Open-source World Models
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference