Large Multimodal Models for Video Understanding and Editing
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Open-source image generative foundation model
An experimental version of DeepSeek model
Ling is a MoE LLM provided and open-sourced by InclusionAI
State-of-the-art (SoTA) text-to-video pre-trained model
Release for Improved Denoising Diffusion Probabilistic Models
Code release for ConvNeXt V2 model
Code release for "Masked-attention Mask Transformer
Code for the paper "Improved Techniques for Training GANs"
Tencent’s 36-language state-of-the-art translation model