6 Comments

Where is the Dalle2 for music🎶? (8 reasons why it is not that simple)

by joshdance

I have loved learning about generative AI. I use Dalle2, Stable Diffusion, and Midjourney a lot.

It got me thinking, where is the Dalle2 for music? Put prompt in a text box and get music back?

Did a lot of research including getting a lot of links from Indiehackers and wrote this.

https://mythicalai.substack.com/p/where-is-dalle2-for-creating-music

Short story, music is harder than images for 8 reasons.

Lack of data
Lawyers
Music is more like video, than it is a still image
Music takes longer to consume
Digital instruments are worse than real world instruments
We are most strict in our evaluation of music
Music is more subjective
Lyrics are an additional difficulty layer

While there are issue, I have no doubt that we will see a text to music generation model in the next few years.

There are also fun music generation tools you can play with right now. Full list in the article.

But what do you think? Will we get a music generation AI, or is music too hard for computers?

joshdance

on November 12, 2022

Say something nice to joshdance…

Post Comment

2

If the output is a wav or mp3 file-- IE, a fully mixed and produced track, that must be terribly difficult and nuanced. But if it's just, say, sheet music, it should be a lot easier for an AI to generate. From there, a synth on the local machine could render it out with MIDI.

Has anyone tried that or similar?

dv

·
3 years ago
·
Reply
1. 2
  
  Yes. There are a few projects whose output was MIDI. I will post a full list of all the projects I found tomorrow.
  
  joshdance
  
  ·
  3 years ago
  ·
  Reply
  1. 1
    
    https://everynoise.com/
    
    https://www.ampermusic.com/
    
    https://evokemusic.ai/
    
    https://mubert.com/
    
    https://soundful.com/
    
    https://www.audiocipher.com/post/ai-music-app
    
    https://magenta.tensorflow.org/music-transformer
    
    https://www.audiocipher.com/
    
    https://soundraw.io/
    
    https://www.ampermusic.com/
    
    joshdance
    
    ·
    3 years ago
    ·
    Reply
1

What about for food? I love thinking about the cool use cases of AI outside of just image generation which seems to be the popular trend.

Last week, I decided to launch my own AI tool in the food space.
Use GPT-3 to find a recipe for exactly what you want to cook - sentientplatter.com

TatumStatham

·
3 years ago
·
Reply
1. 1
  
  Nice! Tried making a recipe, just errored out. :(
  
  joshdance
  
  ·
  3 years ago
  ·
  Reply
  1. 1
    
    Oh dang... I'm using vercel to generate pages once a new prompt is typed in. It only gives 10secs which is sometimes too long for a GPT-3 response. Working on splitting out some of the function calls, but if you retry after the error, you may see the result!
    
    Thank you for the feedback and for trying!
    
    TatumStatham
    
    ·
    3 years ago
    ·
    Reply