Contexts Optical Compression
Video understanding codebase from FAIR for reproducing video models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Language modeling in a sentence representation space
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Code release for ConvNeXt V2 model
The official pytorch implementation of our paper