Awesome multilingual OCR toolkits based on PaddlePaddle
AlphaFold 3 inference pipeline
The most powerful local music generation model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Text and image to video generation: CogVideoX and CogVideo
DeepMind model for tracking arbitrary points across videos & robotics
Open-source large language model family from Tencent Hunyuan
GLM-4 series: Open Multilingual Multimodal Chat LMs
VMZ: Model Zoo for Video Modeling
Ling is a MoE LLM provided and open-sourced by InclusionAI
CogView4, CogView3-Plus and CogView3(ECCV 2024)
FAIR Sequence Modeling Toolkit 2
A series of math-specific large language models of our Qwen2 series
Pretrained time-series foundation model developed by Google Research
A Customizable Image-to-Video Model based on HunyuanVideo
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
A SOTA open-source image editing model
Open-weight, large-scale hybrid-attention reasoning model
A state-of-the-art open visual language model
Chat & pretrained large vision language model
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Let us control diffusion models
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Code release for "Masked-attention Mask Transformer
Learning Continuous Signed Distance Functions for Shape Representation