Diffusion Transformer with Fine-Grained Chinese Understanding
Library for efficient similarity search and clustering dense vectors
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Open-Sora: Democratizing Efficient Video Production for All
A Unified Framework for Image Customization
Tensor search for humans
A language for fast, portable data-parallel computation
A high performance anime upscaler
Implementation of 'lightweight' GAN, proposed in ICLR 2021
A set of Docker images for training and serving models in TensorFlow
"Big Model" trains a visual multimodal VLM with 26M parameters
Simplifies the local serving of AI models from any source
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Flux 2 image generation model pure C inference
MII makes low-latency and high-throughput inference possible
Lightning fast C++/CUDA neural network framework
Capable of understanding text, audio, vision, video
Ready-to-run Docker images containing Jupyter applications
A Universal Customization Method for Single and Multi Conditioning
The data structure for multimodal data
Advancing Open-source World Models
Easy Docker setup for Stable Diffusion with user-friendly UI
Modern C++ Terminal Emulator
Geometric deep learning extension library for PyTorch
RGBD video generation model conditioned on camera input