Wan2.2: Open and Advanced Large-Scale Video Generative Model
Deep learning optimization library: makes distributed training easy
Lets make video diffusion practical
SOTA discrete acoustic codec models with 40/75 tokens per second
Awesome multilingual OCR toolkits based on PaddlePaddle
Running large language models on a single GPU
48khz stereo neural audio codec for general audio
The official repository for ERNIE 4.5 and ERNIEKit
An implementation of a deep learning recommendation model (DLRM)
Python SDK for the Computer Use model Lux, developed by OpenAGI
Embed images and sentences into fixed-length vectors
PyTorch implementation of MAE
Auto-diff neural network library for high-dimensional sparse tensors