Tangent: I assume this “AI-generated” music is created the same way an LLM gener...

darth_avocado · 2026-04-20T16:56:45 1776704205

> use samples from a corpus strung together into a new [derivative] output.

That’s kind of how the music industry produces music these days. There are a few song writers that write for most artists, music producers who sample other music to string together songs for most artists etc. That’s why most music sounds the same and why AI generated music can be indistinguishable from mainstream music.

taneq · 2026-04-20T16:59:23 1776704363

I mean, it was how Beethoven did it with dice, too. This is just much quicker and more comprehensive.

Kye · 2026-04-20T16:48:26 1776703706

My understanding is music generation is more like stable diffusion. It generates a waveform as an image, then turns it into an audio file.

cubefox · 2026-04-20T16:55:55 1776704155

They do use diffusion models, but I don't think they would make a detour via images. They can just generate audio directly with audio diffusion rather than image diffusion.

corysama · 2026-04-20T17:45:10 1776707110

There technically was one experiment early on to trick Stable Diffusion into generating spectrograms that could be converted into audio. And, it worked surprisingly well.

https://web.archive.org/web/20230314190913/https://www.riffu...

https://huggingface.co/riffusion/riffusion-model-v1

But, I'd expect everything in the past 3 years to diffuse the audio waveform directly.

Kye · 2026-04-20T18:42:16 1776710536

That's probably what I was thinking of. I haven't kept up as much on non-text generative AI.