Buckets:

hf-doc-build
/

doc-dev

Files

xet

hf-doc-build/doc-dev / diffusers /pr_12652 /en /modular_diffusers /modular_pipeline.md

rtrm

17 days ago

preview code

download

raw

21.5 kB

	# ModularPipeline

	[ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline) converts [ModularPipelineBlocks](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks) into an executable pipeline that loads models and performs the computation steps defined in the blocks. It is the main interface for running a pipeline and the API is very similar to [DiffusionPipeline](/docs/diffusers/pr_12652/en/api/pipelines/overview#diffusers.DiffusionPipeline) but with a few key differences.

	- Loading is lazy. With [DiffusionPipeline](/docs/diffusers/pr_12652/en/api/pipelines/overview#diffusers.DiffusionPipeline), [from_pretrained()](/docs/diffusers/pr_12652/en/api/pipelines/overview#diffusers.DiffusionPipeline.from_pretrained) creates the pipeline and loads all models at the same time. With [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline), creating and loading are two separate steps: [from_pretrained()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.from_pretrained) reads the configuration and knows where to load each component from, but doesn't actually load the model weights. You load the models later with [load_components()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.load_components), which is where you pass loading arguments like `torch_dtype` and `quantization_config`.

	- Two ways to create a pipeline. You can use [from_pretrained()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.from_pretrained) with an existing diffusers model repository — it automatically maps to the default pipeline blocks and then converts to a [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline) with no extra setup. You can check the [modular_pipelines_directory](https://github.com/huggingface/diffusers/tree/main/src/diffusers/modular_pipelines) to see which models are currently supported. You can also assemble your own pipeline from [ModularPipelineBlocks](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks) and convert it with the [init_pipeline()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks.init_pipeline) method (see [Creating a pipeline](#creating-a-pipeline) for more details).

	- Running the pipeline is the same. Once loaded, you call the pipeline with the same arguments you're used to. A single [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline) can support multiple workflows (text-to-image, image-to-image, inpainting, etc.) when the pipeline blocks use [`AutoPipelineBlocks`](./auto_pipeline_blocks) to automatically select the workflow based on your inputs.

	Below are complete examples for text-to-image, image-to-image, and inpainting with SDXL.

	```py
	import torch
	from diffusers import ModularPipeline

	pipeline = ModularPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-1.0")
	pipeline.load_components(torch_dtype=torch.float16)
	pipeline.to("cuda")

	image = pipeline(prompt="Astronaut in a jungle, cold color palette, muted colors, detailed, 8k").images[0]
	image.save("modular_t2i_out.png")
	```

	```py
	import torch
	from diffusers import ModularPipeline
	from diffusers.utils import load_image

	pipeline = ModularPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-1.0")
	pipeline.load_components(torch_dtype=torch.float16)
	pipeline.to("cuda")

	url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/sdxl-text2img.png"
	init_image = load_image(url)
	prompt = "a dog catching a frisbee in the jungle"
	image = pipeline(prompt=prompt, image=init_image, strength=0.8).images[0]
	image.save("modular_i2i_out.png")
	```

	```py
	import torch
	from diffusers import ModularPipeline
	from diffusers.utils import load_image

	pipeline = ModularPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-1.0")
	pipeline.load_components(torch_dtype=torch.float16)
	pipeline.to("cuda")

	img_url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/sdxl-text2img.png"
	mask_url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/sdxl-inpaint-mask.png"

	init_image = load_image(img_url)
	mask_image = load_image(mask_url)

	prompt = "A deep sea diver floating"
	image = pipeline(prompt=prompt, image=init_image, mask_image=mask_image, strength=0.85).images[0]
	image.save("modular_inpaint_out.png")
	```

	This guide will show you how to create a [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline), manage its components, and run the pipeline.

	## Creating a pipeline

	There are two ways to create a [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline). Assemble and create a pipeline from [ModularPipelineBlocks](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks) with [init_pipeline()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks.init_pipeline), or load an existing pipeline with [from_pretrained()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.from_pretrained).

	You can also initialize a [`ComponentsManager`](./components_manager) to handle device placement and memory management. If you don't need automatic offloading, you can skip this and move the pipeline to your device manually with `pipeline.to("cuda")`.

	> [!TIP]
	> Refer to the [ComponentsManager](./components_manager) doc for more details about how it can help manage components across different workflows.

	### init_pipeline

	[init_pipeline()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks.init_pipeline) converts any [ModularPipelineBlocks](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks) into a [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline).

	Let's define a minimal block to see how it works:

	```py
	from transformers import CLIPTextModel
	from diffusers.modular_pipelines import (
	ComponentSpec,
	ModularPipelineBlocks,
	PipelineState,
	)

	class MyBlock(ModularPipelineBlocks):
	@property
	def expected_components(self):
	return [
	ComponentSpec(
	name="text_encoder",
	type_hint=CLIPTextModel,
	pretrained_model_name_or_path="openai/clip-vit-large-patch14",
	),
	]

	def __call__(self, components, state: PipelineState) -> PipelineState:
	return components, state
	```

	Call [init_pipeline()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks.init_pipeline) to convert it into a pipeline. The `blocks` attribute on the pipeline is the blocks it was created from — it determines the expected inputs, outputs, and computation logic.

	```py
	block = MyBlock()
	pipe = block.init_pipeline()
	pipe.blocks
	```

	```
	MyBlock {
	"_class_name": "MyBlock",
	"_diffusers_version": "0.37.0.dev0"
	}
	```

	> [!WARNING]
	> Blocks are mutable — you can freely add, remove, or swap blocks before creating a pipeline. However, once a pipeline is created, modifying `pipeline.blocks` won't affect the pipeline because it returns a copy. If you want a different block structure, create a new pipeline after modifying the blocks.

	When you call [init_pipeline()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks.init_pipeline) without a repository, it uses the `pretrained_model_name_or_path` defined in the block's [ComponentSpec](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentSpec) to determine where to load each component from. Printing the pipeline shows the component loading configuration.

	```py
	pipe
	ModularPipeline {
	"_blocks_class_name": "MyBlock",
	"_class_name": "ModularPipeline",
	"_diffusers_version": "0.37.0.dev0",
	"text_encoder": [
	null,
	null,
	{
	"pretrained_model_name_or_path": "openai/clip-vit-large-patch14",
	"revision": null,
	"subfolder": "",
	"type_hint": [
	"transformers",
	"CLIPTextModel"
	],
	"variant": null
	}
	]
	}
	```

	If you pass a repository to [init_pipeline()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks.init_pipeline), it overrides the loading path by matching your block's components against the pipeline config in that repository (`model_index.json` or `modular_model_index.json`).

	In the example below, the `pretrained_model_name_or_path` will be updated to `"stabilityai/stable-diffusion-xl-base-1.0"`.

	```py
	pipe = block.init_pipeline("stabilityai/stable-diffusion-xl-base-1.0")
	pipe
	ModularPipeline {
	"_blocks_class_name": "MyBlock",
	"_class_name": "ModularPipeline",
	"_diffusers_version": "0.37.0.dev0",
	"text_encoder": [
	null,
	null,
	{
	"pretrained_model_name_or_path": "stabilityai/stable-diffusion-xl-base-1.0",
	"revision": null,
	"subfolder": "text_encoder",
	"type_hint": [
	"transformers",
	"CLIPTextModel"
	],
	"variant": null
	}
	]
	}
	```

	If a component in your block doesn't exist in the repository, it remains `null` and is skipped during [load_components()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.load_components).

	### from_pretrained

	[from_pretrained()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.from_pretrained) is a convenient way to create a [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline) without defining blocks yourself.

	It works with three types of repositories.

	A regular diffusers repository. Pass any supported model repository and it automatically maps to the default pipeline blocks. Currently supported models include SDXL, Wan, Qwen, Z-Image, Flux, and Flux2.

	```py
	from diffusers import ModularPipeline, ComponentsManager

	components = ComponentsManager()
	pipeline = ModularPipeline.from_pretrained(
	"stabilityai/stable-diffusion-xl-base-1.0", components_manager=components
	)
	```

	A modular repository. These repositories contain a `modular_model_index.json` that specifies where to load each component from — the components can come from different repositories and the modular repository itself may not contain any model weights. For example, [diffusers/flux2-bnb-4bit-modular](https://huggingface.co/diffusers/flux2-bnb-4bit-modular) loads a quantized transformer from one repository and the remaining components from another. See [Modular repository](#modular-repository) for more details on the format.

	```py
	from diffusers import ModularPipeline, ComponentsManager

	components = ComponentsManager()
	pipeline = ModularPipeline.from_pretrained(
	"diffusers/flux2-bnb-4bit-modular", components_manager=components
	)
	```

	A modular repository with custom code. Some repositories include custom pipeline blocks alongside the loading configuration. Add `trust_remote_code=True` to load them. See [Custom blocks](./custom_blocks) for how to create your own.

	```py
	from diffusers import ModularPipeline, ComponentsManager

	components = ComponentsManager()
	pipeline = ModularPipeline.from_pretrained(
	"diffusers/Florence2-image-Annotator", trust_remote_code=True, components_manager=components
	)
	```

	## Loading components

	A [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline) doesn't automatically instantiate with components. It only loads the configuration and component specifications. You can load components with [load_components()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.load_components).

	This will load all the components that have a valid loading spec.

	```py
	import torch

	pipeline.load_components(torch_dtype=torch.float16)
	```

	You can also load specific components by name. The example below only loads the `text_encoder`.

	```py
	pipeline.load_components(names=["text_encoder"], torch_dtype=torch.float16)
	```

	After loading, printing the pipeline shows which components are loaded — the first two fields change from `null` to the component's library and class.

	```py
	pipeline
	```

	```
	# text_encoder is loaded - shows library and class
	"text_encoder": [
	"transformers",
	"CLIPTextModel",
	{ ... }
	]

	# unet is not loaded yet - still null
	"unet": [
	null,
	null,
	{ ... }
	]
	```

	Loading keyword arguments like `torch_dtype`, `variant`, `revision`, and `quantization_config` are passed through to `from_pretrained()` for each component. You can pass a single value to apply to all components, or a dict to set per-component values.

	```py
	# apply bfloat16 to all components
	pipeline.load_components(torch_dtype=torch.bfloat16)

	# different dtypes per component
	pipeline.load_components(torch_dtype={"transformer": torch.bfloat16, "default": torch.float32})
	```

	[load_components()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.load_components) only loads components that haven't been loaded yet and have a valid loading spec. This means if you've already set a component on the pipeline, calling [load_components()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.load_components) again won't reload it.

	## Updating components

	[update_components()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.update_components) replaces a component on the pipeline with a new one. When a component is updated, the loading specifications are also updated in the pipeline config and [load_components()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.load_components) will skip it on subsequent calls.

	### From AutoModel

	You can pass a model object loaded with `AutoModel.from_pretrained()`. Models loaded this way are automatically tagged with their loading information.

	```py
	from diffusers import AutoModel

	unet = AutoModel.from_pretrained(
	"RunDiffusion/Juggernaut-XL-v9", subfolder="unet", variant="fp16", torch_dtype=torch.float16
	)
	pipeline.update_components(unet=unet)
	```

	### From ComponentSpec

	Use [get_component_spec()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.get_component_spec) to get a copy of the current component specification, modify it, and load a new component.

	```py
	unet_spec = pipeline.get_component_spec("unet")

	# modify to load from a different repository
	unet_spec.pretrained_model_name_or_path = "RunDiffusion/Juggernaut-XL-v9"

	# load and update
	unet = unet_spec.load(torch_dtype=torch.float16)
	pipeline.update_components(unet=unet)
	```

	You can also create a [ComponentSpec](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentSpec) from scratch.

	Not all components are loaded from pretrained weights — some are created from a config (listed under `pipeline.config_component_names`). For these, use [create()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentSpec.create) instead of [load()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentSpec.load).

	```py
	guider_spec = pipeline.get_component_spec("guider")
	guider_spec.config = {"guidance_scale": 5.0}
	guider = guider_spec.create()
	pipeline.update_components(guider=guider)
	```

	Or simply pass the object directly.

	```py
	from diffusers.guiders import ClassifierFreeGuidance

	guider = ClassifierFreeGuidance(guidance_scale=5.0)
	pipeline.update_components(guider=guider)
	```

	See the [Guiders](./guiders) guide for more details on available guiders and how to configure them.

	## Splitting a pipeline into stages

	Since blocks are composable, you can take a pipeline apart and reconstruct it into separate pipelines for each stage. The example below shows how we can separate the text encoder block from the rest of the pipeline, so you can encode the prompt independently and pass the embeddings to the main pipeline.

	```py
	from diffusers import ModularPipeline, ComponentsManager
	import torch

	device = "cuda"
	dtype = torch.bfloat16
	repo_id = "black-forest-labs/FLUX.2-klein-4B"

	# get the blocks and separate out the text encoder
	blocks = ModularPipeline.from_pretrained(repo_id).blocks
	text_block = blocks.sub_blocks.pop("text_encoder")

	# use ComponentsManager to handle offloading across multiple pipelines
	manager = ComponentsManager()
	manager.enable_auto_cpu_offload(device=device)

	# create separate pipelines for each stage
	text_encoder_pipeline = text_block.init_pipeline(repo_id, components_manager=manager)
	pipeline = blocks.init_pipeline(repo_id, components_manager=manager)

	# encode text
	text_encoder_pipeline.load_components(torch_dtype=dtype)
	text_embeddings = text_encoder_pipeline(prompt="a cat").get_by_kwargs("denoiser_input_fields")

	# denoise and decode
	pipeline.load_components(torch_dtype=dtype)
	output = pipeline(
	**text_embeddings,
	num_inference_steps=4,
	).images[0]
	```

	[ComponentsManager](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentsManager) handles memory across multiple pipelines. Unlike the offloading strategies in [DiffusionPipeline](/docs/diffusers/pr_12652/en/api/pipelines/overview#diffusers.DiffusionPipeline) that follow a fixed order, [ComponentsManager](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentsManager) makes offloading decisions dynamically each time a model forward pass runs, based on the current memory situation. This means it works regardless of how many pipelines you create or what order you run them in. See the [ComponentsManager](./components_manager) guide for more details.

	If pipeline stages share components (e.g., the same VAE used for encoding and decoding), you can use [update_components()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.update_components) to pass an already-loaded component to another pipeline instead of loading it again.

	## Modular repository

	A repository is required if the pipeline blocks use pretrained components. The repository supplies loading specifications and metadata.

	[ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline) works with regular diffusers repositories out of the box. However, you can also create a modular repository for more flexibility. A modular repository contains a `modular_model_index.json` file containing the following 3 elements.

	- `library` and `class` shows which library the component was loaded from and its class. If `null`, the component hasn't been loaded yet.
	- `loading_specs_dict` contains the information required to load the component such as the repository and subfolder it is loaded from.

	The key advantage of a modular repository is that components can be loaded from different repositories. For example, [diffusers/flux2-bnb-4bit-modular](https://huggingface.co/diffusers/flux2-bnb-4bit-modular) loads a quantized transformer from `diffusers/FLUX.2-dev-bnb-4bit` while loading the remaining components from `black-forest-labs/FLUX.2-dev`.

	To convert a regular diffusers repository into a modular one, create the pipeline using the regular repository, and then push to the Hub. The saved repository will contain a `modular_model_index.json` with all the loading specifications.

	```py
	from diffusers import ModularPipeline

	# load from a regular repo
	pipeline = ModularPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-1.0")

	# push as a modular repository
	pipeline.save_pretrained("local/path", repo_id="my-username/sdxl-modular", push_to_hub=True)
	```

	A modular repository can also include custom pipeline blocks as Python code. This allows you to share specialized blocks that aren't native to Diffusers. For example, [diffusers/Florence2-image-Annotator](https://huggingface.co/diffusers/Florence2-image-Annotator) contains custom blocks alongside the loading configuration:

	```
	Florence2-image-Annotator/
	├── block.py # Custom pipeline blocks implementation
	├── config.json # Pipeline configuration and auto_map
	├── mellon_config.json # UI configuration for Mellon
	└── modular_model_index.json # Component loading specifications
	```

	The `config.json` file contains an `auto_map` key that tells [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline) where to find the custom blocks:

	```json
	{
	"_class_name": "Florence2AnnotatorBlocks",
	"auto_map": {
	"ModularPipelineBlocks": "block.Florence2AnnotatorBlocks"
	}
	}
	```

	Load custom code repositories with `trust_remote_code=True` as shown in [from_pretrained](#from_pretrained). See [Custom blocks](./custom_blocks) for how to create and share your own.

Xet Storage Details

Size:: 21.5 kB
Xet hash:: 048e7f5d4596d225601081e6a751c4335de530f087a3432d9e8f2ff441f6757b

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.