FugattoNVIDIA
|
Starchild-1Odyssey
|
|||||
Related Products
|
||||||
About
Using text and audio as inputs, a new generative AI model from NVIDIA can create any combination of music, voices, and sounds. A team of generative AI researchers created a Swiss Army knife for sound, one that allows users to control the audio output simply using text. While some AI models can compose a song or modify a voice, none have the dexterity of the new offering. Called Fugatto, it generates or transforms any mix of music, voices, and sounds described with prompts using any combination of text and audio files. For example, it can create a music snippet based on a text prompt, remove or add instruments from an existing song, change the accent or emotion in a voice, and even let people produce sounds never heard before. Supporting numerous audio generation and transformation tasks, Fugatto is the first foundational generative AI model that showcases emergent properties.
|
About
Starchild-1 is the first real-time multimodal world model, built to simulate both the visuals and sounds of the world in real time. Unlike language models, which learn from text, world models learn directly from the world itself through pixels, motion, and actions encoded in large-scale video, becoming capable of understanding and simulating an approximation of the world as it evolves. Starchild-1 goes beyond traditional world models, which have mostly focused on visual generation alone, by autoregressively generating synchronized audio and video while continuously responding to streaming user input. Instead of producing a fixed offline clip, it predicts the next audio and video state of a world based on past observations and live inputs, enabling environments, conversations, ambient sound, and world dynamics to change interactively. Users can stream text, speech, and action inputs into the model during rollout, dynamically altering what is seen and heard in real time.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Individuals and any user in search of a solution to generate music, voices, and sounds with AI
|
Audience
Interactive AI researchers who need a real-time multimodal world model for synchronized audio-video simulation and responsive virtual environments
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
No information available.
Free Version
Free Trial
|
Pricing
No information available.
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationNVIDIA
Founded: 1993
United States
blogs.nvidia.com/blog/fugatto-gen-ai-sound-model/
|
Company InformationOdyssey
Founded: 2023
United States
odyssey.ml/introducing-starchild-1
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
||||||
|
|
||||||
Categories |
Categories |
|||||
Integrations
NVIDIA DGX Cloud
|
||||||
|
|
|