A sound cloning tool with a web interface, using your voice
Implementation of Make-A-Video, new SOTA text to video generator
Synchronized Translation for Videos
Foundational model for human-like, expressive TTS
Pretrained model hub for Keras 3
Create videos with Stable Diffusion
MARS5 speech model (TTS) from CAMB.AI
Real-time voice interactive digital human
A simple, high-quality voice conversion tool focused on ease of use
Multi-lingual large voice generation model, providing inference
Instant voice cloning by MIT and MyShell. Audio foundation model
Python framework for adversarial attacks, and data augmentation
LLM abstractions that aren't obstructions
Sample code and notebooks for Generative AI on Google Cloud
Scalable generative AI framework built for researchers and developers
High-Resolution Image Synthesis with Latent Diffusion Models
Towards Real-World Vision-Language Understanding
Algorithms for outlier, adversarial and drift detection
AutoGluon: AutoML for Image, Text, and Tabular Data
A modular graph-based Retrieval-Augmented Generation (RAG) system
Stable Diffusion web UI
Provides CTP stock options and Zhongtai Securities XTP
Adding guardrails to large language models
Large-language-model & vision-language-model based on Linear Attention
Guiding Instruction-based Image Editing via Multimodal Large Language