A TTS model capable of generating ultra-realistic dialogue
AutoGluon: AutoML for Image, Text, and Tabular Data
Virtual AI anchor that combines state-of-the-art technology
Machine learning, conversational dialog engine for creating chat bots
An open source implementation of CLIP
Real-World Centric Foundation GUI Agents
Sample code and notebooks for Generative AI on Google Cloud
Visual Causal Flow
Flexible Photo Recrafting While Preserving Your Identity
Bailing is a voice dialogue robot similar to GPT-4o
MARS5 speech model (TTS) from CAMB.AI
Tensor search for humans
Stanford NLP Python library for many human languages
95% token savings. 155x faster queries. 16 languages
Chinese XLNet pre-trained model
Toolkit for audio, music, and speech generation
Generate Any 3D Scene in Seconds
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
CogView4, CogView3-Plus and CogView3(ECCV 2024)
21 Lessons, Get Started Building with Generative AI
A Repo For Document AI
Real-time voice interactive digital human
OCR expert VLM powered by Hunyuan's native multimodal architecture
SoTA open-source TTS
Unified Multimodal Understanding and Generation Models