Files
2025-06-13 16:29:32 +00:00

1.8 KiB

license, tags
license tags
openrail++
stable-diffusion
text-to-image

SD v2.1-base with Zero Terminal SNR (LAION Aesthetic 6+)

This model is used in Diffusion Model with Perceptual Loss paper as the MSE baseline.

This model is trained using zero terminal SNR schedule following Common Diffusion Noise Schedules and Sample Steps are Flawed paper on LAION aesthetic 6+ data.

This model is finetuned from stabilityai/stable-diffusion-2-1-base.

This model is meant for research demonstration, not for production use.

Usage

from diffusers import StableDiffusionPipeline
prompt = "A young girl smiling"
pipe = StableDiffusionPipeline.from_pretrained("ByteDance/sd2.1-base-zsnr-laionaes6").to("cuda")
pipe(prompt, guidance_scale=7.5, guidance_rescale=0.7).images[0].save("out.jpg")

Cite as

@misc{lin2024diffusion,
      title={Diffusion Model with Perceptual Loss}, 
      author={Shanchuan Lin and Xiao Yang},
      year={2024},
      eprint={2401.00110},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@misc{lin2023common,
      title={Common Diffusion Noise Schedules and Sample Steps are Flawed}, 
      author={Shanchuan Lin and Bingchen Liu and Jiashi Li and Xiao Yang},
      year={2023},
      eprint={2305.08891},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}