Sample code and notebooks for Generative AI on Google Cloud
Simplest working implementation of Stylegan2
Gemma open-weight LLM library, from Google DeepMind
Framework for building neural networks
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
The standard data-centric AI package for data quality and ML
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Open source demo platform where you can easily showcase your AI models
ktrain is a Python library that makes deep learning AI more accessible
MII makes low-latency and high-throughput inference possible
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Check links in web documents or full websites
LISA: Reasoning Segmentation via Large Language Model
Skywork-R1V is an advanced multimodal AI model series
Refer and Ground Anything Anywhere at Any Granularity
Language modeling in a sentence representation space
High-Resolution Image Synthesis with Latent Diffusion Models
Build cross-modal and multimodal applications on the cloud
Organize files/images from a csv or xlsx file.
A powerful, free and open-source tool for TextureAtlases/Spritesheets
Plug-n-play module turning text-to-image models into animation
Guiding Instruction-based Image Editing via Multimodal Large Language
Run GGUF models easily with a UI or API. One File. Zero Install.