Google introduced MusicLM – a model for generating music from text

A workforce of engineers from Google offered a brand new music generation AI system called MusicLM. The mannequin creates high-quality music primarily based on textual descriptions resembling “a chilled violin melody backed by a distorted guitar riff.” It really works in an identical approach to DALL-E that generates photos from texts.

MusicLM makes use of AudioLM’s multi-step autoregressive modeling as a generative part, extending it to textual content processing. In an effort to resolve the principle problem of the shortage of paired knowledge, the scientists utilized MuLan – a joint music-text mannequin that’s skilled to challenge music and the corresponding textual content description to representations shut to one another in an embedding area.

Whereas coaching MusicLM on a big dataset of unlabeled music, the mannequin treats the method of making conditional music as a hierarchical sequence modeling process, and generates music at 24kHz that is still fixed for a number of minutes. To handle the dearth of analysis knowledge, the builders launched MusicCaps – a brand new high-quality music caption dataset with 5 500 examples of music-text pairs ready by skilled musicians.

The experiments exhibit that MusicLM outperforms earlier techniques when it comes to each sound high quality and adherence to textual content description. As well as, the MusicLM mannequin could be conditioned on each textual content and melody. The mannequin can generate music in keeping with the model described within the textual description and remodel melodies even when the songs have been whistled or hummed.

See the mannequin demo on the website.

The AI system was taught to create music by coaching it on a dataset containing 5 million audio clips, representing 280,000 hours of songs carried out by singers. MusicLM can create songs of various lengths. For instance, it might generate a fast riff or a complete tune. And it might even transcend that by creating songs with alternating compositions, as is usually the case in symphonies, to create a sense of a narrative. The system may deal with particular requests, resembling requests for sure devices or a sure style. It will possibly additionally generate a semblance of vocals.

The creation of the MusicLM mannequin is a part of deep-learning AI purposes designed to breed human psychological talents, resembling speaking, writing papers, drawing, taking checks, or writing proofs of mathematical theorems.

For now, the builders have introduced that Google is not going to launch the system for public use. Testing has proven that roughly 1% of the music generated by the mannequin is copied instantly from an actual performer. Subsequently, they’re cautious of content material misappropriation and lawsuits.

Source link

Google introduced MusicLM – a model for generating music from text

AI can control computer just like a human

Stable Diffusion 3.5 opens new doors in digital art

Controversial science: AI and Nobel Prizes

LogicMonitor Seeks to Disrupt AI Landscape with an $800 Million Strategic Investment at a Valuation of Approximately $2.4 Billion to Revolutionize Data Centers

Denodo Platform 9.1 Brings New Advanced AI Capabilities and Enhanced Data Lakehouse Performance

Harnessing AI in Agriculture – insideAI News

How Big Data Is Transforming Patient Care Delivery

How to Assist Human Agents & Transform Customer Experience with Conversational AI?

Our Picks

Hey Hyperparameter Tuners: DRL Agents Outperform BS model in U.S. Put Options Hedging Experiment | by Asher Bond | May, 2024

Perspective Through the Lens of Our ‘Artificial Intelligence’ Innovations for Global Citizenship: Human Frontier Access | by Samule | May, 2024

Master Bank Reconciliation Journal Entries

Most Popular

Revolutionizing the Way We Find Love

Will GenAI Replace Data Engineers? No – And Here’s Why.

Assortment Optimization Machine Learning | by Danishaliarshar | Mar, 2024

Google introduced MusicLM – a model for generating music from text

Related Posts