Qwen3 is the large language model series developed by Qwen team
Documentation for Google's Gen AI site - including Gemini API & Gemma
Generate audiobooks from e-books, voice cloning & 1107+ languages
Foundational model for human-like, expressive TTS
Create videos with Stable Diffusion
Pretrained model hub for Keras 3
Automatically translates the text of a video based on a subtitle file
Implementation of AudioLM audio generation model in Pytorch
Real-time voice interactive digital human
Python framework for adversarial attacks, and data augmentation
Generate audiobooks from e-books
A sound cloning tool with a web interface, using your voice
MARS5 speech model (TTS) from CAMB.AI
Synchronized Translation for Videos
Sample code and notebooks for Generative AI on Google Cloud
LLM abstractions that aren't obstructions
Simple, Pythonic building blocks to evaluate LLM applications
Scalable generative AI framework built for researchers and developers
Designed for text embedding and ranking tasks
A simple, high-quality voice conversion tool focused on ease of use
Algorithms for outlier, adversarial and drift detection
Towards Real-World Vision-Language Understanding
Adding guardrails to large language models
Instant voice cloning by MIT and MyShell. Audio foundation model
AutoGluon: AutoML for Image, Text, and Tabular Data