An MCP server for interacting with Google Colab
A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K
Fast-stable-diffusion + DreamBooth
Create videos with Stable Diffusion
Synchronized Translation for Videos
One-click deployment (including offline integration package)
text and image to video generation: CogVideoX (2024) and CogVideo
Unified open dataset enabling cross-embodiment learning for robotics
Solve puzzles. Learn CUDA
DeepMind model for tracking arbitrary points across videos & robotics
Speech-AI-Forge is a project developed around TTS generation model
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Global weather forecasting model using graph neural networks and JAX
A simple, high-quality voice conversion tool focused on ease of use
An open source library for GPU-accelerated robot learning
Generate audiobooks from e-books
Build animated charts in Jupyter Notebook and similar environments
Towards Human-Sounding Speech
Gemma open-weight LLM library, from Google DeepMind
AI discovers 520000 stable inorganic crystal structures for research
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A Python package for interactive mapping and geospatial analysis
Implementation of Dreambooth
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Run GGUF models easily with a UI or API. One File. Zero Install.