Please do not feed the models
Sharp Monocular Metric Depth in Less Than a Second
Diffusion Transformer with Fine-Grained Chinese Understanding
Transformers4Rec is a flexible and efficient library
A Customizable Image-to-Video Model based on HunyuanVideo
fast C++ library for GPU linear algebra & scientific computing
A Chinese information extraction tool
A fast GPU accelerated feature extraction software for speech analysis
CTC-based forced aligner for audio-text in 158 languages
Compact 8B multimodal instruct model optimized for edge deployment
Small 3B-base multimodal model ideal for custom AI on edge hardware
Efficient 14B multimodal instruct model with edge deployment and FP8