Diffusers

Latest version: v0.31.0

Safety actively analyzes 682487 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 16

0.29.0

This release emphasizes Stable Diffusion 3, Stability AI’s latest iteration of the Stable Diffusion family of models. It was introduced in [Scaling Rectified Flow Transformers for High-Resolution Image Synthesis](https://arxiv.org/abs/2403.03206) by Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas Müller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, Dustin Podell, Tim Dockhorn, Zion English, Kyle Lacey, Alex Goodwin, Yannik Marek, and Robin Rombach.

As the model is gated, before using it with `diffusers`, you first need to go to the [Stable Diffusion 3 Medium Hugging Face page](https://huggingface.co/stabilityai/stable-diffusion-3-medium-diffusers), fill in the form and accept the gate. Once you are in, you need to log in so that your system knows you’ve accepted the gate.

bash
huggingface-cli login


The code below shows how to perform text-to-image generation with SD3:

python
import torch
from diffusers import StableDiffusion3Pipeline

pipe = StableDiffusion3Pipeline.from_pretrained("stabilityai/stable-diffusion-3-medium-diffusers", torch_dtype=torch.float16)
pipe = pipe.to("cuda")

image = pipe(
"A cat holding a sign that says hello world",
negative_prompt="",
num_inference_steps=28,
guidance_scale=7.0,
).images[0]
image


![image](https://github.com/huggingface/diffusers/assets/22957388/30917935-6649-447e-8bf2-c4c9378562de)

Refer to [our documentation](https://huggingface.co/docs/diffusers/main/en/api/pipelines/stable_diffusion/stable_diffusion_3) for learning all the optimizations you can apply to SD3 as well as the image-to-image pipeline.

Additionally, we support DreamBooth + LoRA fine-tuning of Stable Diffusion 3 through rectified flow. Check out [this directory](https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/README_sd3.md) for more details.

0.28.2

* Change checkpoint key used to identify CLIP models in single file checkpoints by DN6 in 8319

0.28.1

Significant community contributions

The following contributors have made significant changes to the library over the last release:

* gnobitab
* Tencent Hunyuan Team: add HunyuanDiT related updates (8240)
* Tencent Hunyuan Team - Updated Doc for HunyuanDiT (8383)

0.28.0

Significant community contributions

The following contributors have made significant changes to the library over the last release:

* standardAI
* Fix typos (7411)
* [`IP-Adapter`] Fix IP-Adapter Support and Refactor Callback for `StableDiffusionPanoramaPipeline` (7262)
* [`Docs`] Fix typos (7451)
* Fix Tiling in `ConsistencyDecoderVAE` (7290)
* Fix CPU offload in docstring (7827)
* Fix image upcasting (7858)
* Remove dead code and fix f-string issue (7720)
* Fix several imports (7712)
* Expansion proposal of `diffusers-cli env` (7403)
* Fix a grammatical error in the `raise` messages (8272)
* Fix CPU Offloading Usage & Typos (8230)
* a-r-r-o-w
* [refactor] Fix FreeInit behaviour (7410)
* [Pipeline] AnimateDiff SDXL (6721)
* UmerHA
* Fixed minor error in `test_lora_layers_peft.py` (7394)
* Skip `test_lora_fuse_nan` on mps (7481)
* Implements Blockwise lora (7352)
* Quick-Fix for 7352 block-lora (7523)
* Skip `test_freeu_enabled ` on MPS (7570)
* Fixing implementation of ControlNet-XS (6772)
* bghira
* diffusers7426 fix stable diffusion xl inference on MPS when dtypes shift unexpectedly due to pytorch bugs (7446)
* apple mps: training support for SDXL (ControlNet, LoRA, Dreambooth, T2I) (7447)
* 7529 do not disable autocast for cuda devices (7530)
* 7879 - adjust documentation to use naruto dataset, since pokemon is now gated (7880)
* HyoungwonCho
* Perturbed-Attention Guidance (7512)
* Modification on the PAG community pipeline (re) (7876)
* haikmanukyan
* add HD-Painter pipeline (7520)
* fabiorigano
* Multi-image masking for single IP Adapter (7499)
* Move IP Adapter Face ID to core (7186)
* Restore AttnProcessor2_0 in unload_ip_adapter (7727)
* [Docs] Update image masking and face id example (7780)
* fix AnimateDiff creation with a unet loaded with IP Adapter (7791)
* kabachuha
* Add (Scheduled) Pseudo-Huber Loss training scripts to research projects (7527)
* lawrence-cj
* PixArt-Sigma Implementation (7654)
* [docs] add doc for PixArtSigmaPipeline (7857)
* vanakema
* 7535 Update FloatTensor type hints to Tensor (7883)
* zjysteven
* [Pipeline] Adding BoxDiff to community examples (7947)
* isamu-isozaki
* Adding VQGAN Training script (5483)
* SingleZombie
* [Community Pipeline] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation (8239)
* toshas
* [Pipeline] Marigold depth and normals estimation (7847)

0.27.2

All commits

* [scheduler] fix a bug in add_noise by yiyixuxu in https://github.com/huggingface/diffusers/pull/7386
* [LoRA] fix cross_attention_kwargs problems and tighten tests by sayakpaul in https://github.com/huggingface/diffusers/pull/7388
* Fix issue with prompt embeds and latents in SD Cascade Decoder with multiple image embeddings for a single prompt. by DN6 in https://github.com/huggingface/diffusers/pull/7381

0.27.1

All commits

Page 4 of 16

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.