Self-supervised visual learning using momentum contrast in PyTorch
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Jupyter magics and kernels for working with remote Spark clusters
Turns Data and AI algorithms into production-ready web applications
Transform your favorite cities into beautiful, minimalist designs
Python inference and LoRA trainer package for the LTX-2 audio–video
Lets make video diffusion practical
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Taming Stable Diffusion for Lip Sync
"Big Model" trains a visual multimodal VLM with 26M parameters
Open source feature flagging and remote config service
Parallel computing with task scheduling
Elyra extends JupyterLab with an AI centric approach
Is a portable web server suite for windows 64Bit, for Web Development.
OCR expert VLM powered by Hunyuan's native multimodal architecture
Progressbar 2 - A progress bar for Python 2 and Python 3
The book "Performance Analysis and Tuning on Modern CPU"
Inference script for Oasis 500M
Let agents classify your bank transactions
Fast, powerful, git-native ticket tracking in a single bash script
ICLR2024 Spotlight: curation/training code, metadata, distribution
Flexible Photo Recrafting While Preserving Your Identity
A command-line utility for taking automated screenshots of websites