Real-time image and video processing library similar to GPUImage
Effortless data labeling with AI support from Segment Anything
Diffusion Transformer with Fine-Grained Chinese Understanding
Library for efficient similarity search and clustering dense vectors
High-performance neural network inference framework for mobile
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Open-Sora: Democratizing Efficient Video Production for All
A Unified Framework for Image Customization
Chinese and English multimodal conversational language model
Tensor search for humans
MNN is a blazing fast, lightweight deep learning framework
A language for fast, portable data-parallel computation
A high performance anime upscaler
Implementation of 'lightweight' GAN, proposed in ICLR 2021
A set of Docker images for training and serving models in TensorFlow
"Big Model" trains a visual multimodal VLM with 26M parameters
Simplifies the local serving of AI models from any source
Flux 2 image generation model pure C inference
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
MII makes low-latency and high-throughput inference possible
Capable of understanding text, audio, vision, video
Lightning fast C++/CUDA neural network framework
A Universal Customization Method for Single and Multi Conditioning
The data structure for multimodal data
Advancing Open-source World Models