I assume this “AI-generated” music is created the same way an LLM generates text: use samples from a corpus strung together into a new [derivative] output.
But it seems plausible that algorithmic generation can be used at any stage of the process. How much disclosure do we (listeners) require? At what point is it unacceptable “AI-generated” music?
The answers are going to be subjective. And human. And dealing with this, I think, is going to take a direction like the “typewriters in college” headline from a few days ago - human involvement, low automation … things that don’t scale.
> use samples from a corpus strung together into a new [derivative] output.
That’s kind of how the music industry produces music these days. There are a few song writers that write for most artists, music producers who sample other music to string together songs for most artists etc. That’s why most music sounds the same and why AI generated music can be indistinguishable from mainstream music.
They do use diffusion models, but I don't think they would make a detour via images. They can just generate audio directly with audio diffusion rather than image diffusion.
There technically was one experiment early on to trick Stable Diffusion into generating spectrograms that could be converted into audio. And, it worked surprisingly well.
I assume this “AI-generated” music is created the same way an LLM generates text: use samples from a corpus strung together into a new [derivative] output.
But it seems plausible that algorithmic generation can be used at any stage of the process. How much disclosure do we (listeners) require? At what point is it unacceptable “AI-generated” music?
The answers are going to be subjective. And human. And dealing with this, I think, is going to take a direction like the “typewriters in college” headline from a few days ago - human involvement, low automation … things that don’t scale.