High-Resolution Image Synthesis with Latent Diffusion Models
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Sharp Monocular Metric Depth in Less Than a Second
Effortless data labeling with AI support from Segment Anything
Diffusion Transformer with Fine-Grained Chinese Understanding
High-performance neural network inference framework for mobile
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Open-Sora: Democratizing Efficient Video Production for All
A Unified Framework for Image Customization
Chinese and English multimodal conversational language model
Tensor search for humans
MNN is a blazing fast, lightweight deep learning framework
Implementation of 'lightweight' GAN, proposed in ICLR 2021
A set of Docker images for training and serving models in TensorFlow
"Big Model" trains a visual multimodal VLM with 26M parameters
Simplifies the local serving of AI models from any source
Flux 2 image generation model pure C inference
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Capable of understanding text, audio, vision, video
MII makes low-latency and high-throughput inference possible
Lightning fast C++/CUDA neural network framework
A Universal Customization Method for Single and Multi Conditioning
The data structure for multimodal data
Advancing Open-source World Models