Awesome multilingual OCR toolkits based on PaddlePaddle
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Advancing Open-source World Models
GLM-4 series: Open Multilingual Multimodal Chat LMs
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Official inference repo for FLUX.2 models
Moonshot's most powerful AI model
LTX-Video Support for ComfyUI
Image generation model with single-stream diffusion transformer
PyTorch code and models for the DINOv2 self-supervised learning
Python inference and LoRA trainer package for the LTX-2 audio–video
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Access to Anthropic's safety-first language model APIs
Inference code for scalable emulation of protein equilibrium ensembles
Programmatic access to the AlphaGenome model
Global weather forecasting model using graph neural networks and JAX
Ling is a MoE LLM provided and open-sourced by InclusionAI
Large Multimodal Models for Video Understanding and Editing
CLIP, Predict the most relevant text snippet given an image
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Repo for SeedVR2 & SeedVR
Proxy that exposes Antigravity provided claude / gemini models
Recovering the Visual Space from Any Views