VMZ: Model Zoo for Video Modeling
Ling is a MoE LLM provided and open-sourced by InclusionAI
A series of math-specific large language models of our Qwen2 series
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
CogView4, CogView3-Plus and CogView3(ECCV 2024)
FAIR Sequence Modeling Toolkit 2
Large-language-model & vision-language-model based on Linear Attention
Pretrained time-series foundation model developed by Google Research
A Customizable Image-to-Video Model based on HunyuanVideo
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
A SOTA open-source image editing model
Open-weight, large-scale hybrid-attention reasoning model
ChatGPT interface with better UI
A state-of-the-art open visual language model
Towards Real-World Vision-Language Understanding
Official DeiT repository
Chat & pretrained large vision language model
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Let us control diffusion models
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Code release for ConvNeXt V2 model
A minimal PyTorch re-implementation of the OpenAI GPT
Reference implementation of the Transformer architecture optimized
Code release for "Masked-attention Mask Transformer
A mix of GAN implementations including progressive growing