An MCP server for interacting with Google Colab
Fast-stable-diffusion + DreamBooth
Create videos with Stable Diffusion
Synchronized Translation for Videos
One-click deployment (including offline integration package)
text and image to video generation: CogVideoX (2024) and CogVideo
Solve puzzles. Learn CUDA
Speech-AI-Forge is a project developed around TTS generation model
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
DeepMind model for tracking arbitrary points across videos & robotics
Global weather forecasting model using graph neural networks and JAX
A simple, high-quality voice conversion tool focused on ease of use
Generate audiobooks from e-books
AI discovers 520000 stable inorganic crystal structures for research
Towards Human-Sounding Speech
Gemma open-weight LLM library, from Google DeepMind
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Implementation of Dreambooth
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Run GGUF models easily with a UI or API. One File. Zero Install.
Run Mixtral-8x7B models in Colab or consumer desktops
Unofficial Parallel WaveGAN
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
An Open Toolkit for Knowledge Graph Extraction and Construction