Nvidia has developed an AI model called Fugatto to create realistic sounds
27.11.24
Nvidia has unveiled a new experimental generative AI called Foundational Generative Audio Transformer Opus 1 (Fugatto), which it describes as a universal tool for working with sound. This model is capable of both creating new audio files based on text prompts and modifying existing music, voice, and sound recordings.
Fugatto was developed by an international team of researchers, which made it possible to make the model more adaptable to different accents and languages. Rafael Valle, manager of applied audio research at Nvidia, emphasized that the main goal was to create a model that understands and generates sound the way humans do.
Applications of Fugatto:
- Music industry: Prototype songs with the ability to edit style, instruments, or voices.
- Language learning: Generate educational materials with voice customization to the user’s preference.
- Video games: Create dynamic sound effects that adapt to player choices and actions.
- Complex compositions: Combine commands to generate unique effects, such as an angry accented voice or birds singing against a thunderstorm.
Fugatto can perform tasks that it was not directly trained to do through customization. For example, create sounds that change over time, such as the increasing noise of rain.
Despite the innovation, Nvidia has not yet revealed whether Fugatto will be available to a wider audience. This highlights the competitive struggle in the field of generative audio: similar technologies have already been presented by Meta (their tool generates sounds from text descriptions) and Google with the MusicLM model, which converts text into music.
Don't miss interesting news
Subscribe to our channels and read announcements of high-tech news, tes
Samsung Galaxy Tab S10 Ultra (SM-X926B) tablet: many
The new Samsung Galaxy Tab S10 Ultra tablet has a large 14.6” screen, a top-of-the-line Mediatek Dimensity 9300 processor, and an S Pen stylus. Let’s try to figure out what this device is for.
Nvidia has developed an AI model called Fugatto to create realistic sounds artificial intelligence development Nvidia
Fugatto може виконувати завдання, яким не навчалася безпосередньо завдяки налаштуванню. Наприклад, створювати звуки, що змінюються з часом, такі як наростаючий шум дощу.
Nubia Watch GT smartwatch with AMOLED display has up to 15 days of battery life smart watches
The Nubia Watch GT supports over 100 sports modes and is equipped with dual-frequency GPS navigation for more accurate route tracking.