P
the way hf diffusers code works & defines "sigmas", the prediction is: prediction = latent * sqrt(sigma^2+1) + sigma * noise
"prediction" in this case meaning "what the final image should look like based on this step alone" & "noise" meaning the neural net's prediction of which part of the image was noise
P