Sample code and notebooks for Generative AI on Google Cloud
Gemma open-weight LLM library, from Google DeepMind
Tooling for the Common Objects In 3D dataset
code for Mesh R-CNN, ICCV 2019
Language modeling in a sentence representation space
The standard data-centric AI package for data quality and ML
ktrain is a Python library that makes deep learning AI more accessible
Create HTML profiling reports from pandas DataFrame objects
Implements weak-to-strong learning for training stronger ML models
MII makes low-latency and high-throughput inference possible
A python library for self-supervised learning on images
Jittor is a high-performance deep learning framework
Build cross-modal and multimodal applications on the cloud
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Free, high-quality text-to-speech API endpoint to replace OpenAI
OpenMMLab Model Deployment Framework
Multi-Voice and Prompt-Controlled TTS Engine
This repository contains the complete code and data for studying primo
High-Resolution Image Synthesis with Latent Diffusion Models
Real-Time High-Resolution Background Matting
Official code for Style Aligned Image Generation via Shared Attention
Embed images and sentences into fixed-length vectors
Plug-n-play module turning text-to-image models into animation