Blog

Dec 16
2022

Riffusion tweaks Stable Diffusion to make AI text to image spectrograms play audio

Tweaks to the system have fine-tuned images of spectrograms.

Stable Diffusion has been tweaked to include an update to its AI routines to include a fine-tuning of the images of spectrograms that are paired to text. Now they are able to generate more precise sounds. The team calls their version of the stable diffusion model, Riffusion.

All the Stable Diffusion features remain.

Merovingian/iStock.

There is audio processing, also but that happens later in the cycle or downstream of the model.

/* */