AI tool that converts GitHub repositories into interactive diagrams
Contexts Optical Compression
Driving with Graph Visual Question Answering
Autoregressive Model Beats Diffusion
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
A modern library for 3D data processing
PS2 Covers Collection
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Windrecorder is a memory search app by records everything
Weaving the Digital Agent Galaxy
The most powerful Android RPA agent framework
Multimodal Diffusion with Representation Alignment
Generating Immersive, Explorable, and Interactive 3D Worlds
Lets make video diffusion practical
Reference PyTorch implementation and models for DINOv3
Wan2.1: Open and Advanced Large-Scale Video Generative Model
The library to build & auto-optimize LLM applications
Videomass is a free, open source and cross-platform GUI for FFmpeg
Foundation model for image generation
Create beautiful slides on the web using Claude's frontend skills
CogView4, CogView3-Plus and CogView3(ECCV 2024)
All-in-one AI productivity platform with agents, workflows, and IM
A Python toolbox for gaining geometric insights
Entity Relation Diagrams generation tool