Buckets:

hf-doc-build
/

doc-dev

Files

xet

hf-doc-build/doc-dev / diffusers /pr_12652 /en /modular_diffusers /components_manager.md

rtrm

16 days ago

preview code

download

raw

5.03 kB

	# ComponentsManager

	The [ComponentsManager](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentsManager) is a model registry and management system for Modular Diffusers. It adds and tracks models, stores useful metadata (model size, device placement, adapters), and supports offloading.

	This guide will show you how to use [ComponentsManager](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentsManager) to manage components and device memory.

	## Connect to a pipeline

	Create a [ComponentsManager](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentsManager) and pass it to a [ModularPipeline](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline) with either [from_pretrained()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline#diffusers.ModularPipeline.from_pretrained) or [init_pipeline()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_blocks#diffusers.ModularPipelineBlocks.init_pipeline).

	```py
	from diffusers import ModularPipeline, ComponentsManager
	import torch

	manager = ComponentsManager()
	pipe = ModularPipeline.from_pretrained("Tongyi-MAI/Z-Image-Turbo", components_manager=manager)
	pipe.load_components(torch_dtype=torch.bfloat16)
	```

	```py
	from diffusers import ModularPipelineBlocks, ComponentsManager
	import torch
	manager = ComponentsManager()
	blocks = ModularPipelineBlocks.from_pretrained("diffusers/Florence2-image-Annotator", trust_remote_code=True)
	pipe= blocks.init_pipeline(components_manager=manager)
	pipe.load_components(torch_dtype=torch.bfloat16)
	```

	Components loaded by the pipeline are automatically registered in the manager. You can inspect them right away.

	## Inspect components

	Print the [ComponentsManager](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentsManager) to see all registered components, including their class, device placement, dtype, memory size, and load ID.

	The output below corresponds to the `from_pretrained` example above.

	```py
	Components:
	=============================================================================================================================
	Models:
	-----------------------------------------------------------------------------------------------------------------------------
	Name_ID \| Class \| Device: act(exec) \| Dtype \| Size (GB) \| Load ID
	-----------------------------------------------------------------------------------------------------------------------------
	text_encoder_140458257514752 \| Qwen3Model \| cpu \| torch.bfloat16 \| 7.49 \| Tongyi-MAI/Z-Image-Turbo\|text_encoder\|null\|null
	vae_140458257515376 \| AutoencoderKL \| cpu \| torch.bfloat16 \| 0.16 \| Tongyi-MAI/Z-Image-Turbo\|vae\|null\|null
	transformer_140458257515616 \| ZImageTransformer2DModel \| cpu \| torch.bfloat16 \| 11.46 \| Tongyi-MAI/Z-Image-Turbo\|transformer\|null\|null
	-----------------------------------------------------------------------------------------------------------------------------

	Other Components:
	-----------------------------------------------------------------------------------------------------------------------------
	ID \| Class \| Collection
	-----------------------------------------------------------------------------------------------------------------------------
	scheduler_140461023555264 \| FlowMatchEulerDiscreteScheduler \| N/A
	tokenizer_140458256346432 \| Qwen2Tokenizer \| N/A
	-----------------------------------------------------------------------------------------------------------------------------
	```

	The table shows models (with device, dtype, and memory info) separately from other components like schedulers and tokenizers. If any models have LoRA adapters, IP-Adapters, or quantization applied, that information is displayed in an additional section at the bottom.

	## Offloading

	The [enable_auto_cpu_offload()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentsManager.enable_auto_cpu_offload) method is a global offloading strategy that works across all models regardless of which pipeline is using them. Once enabled, you don't need to worry about device placement if you add or remove components.

	```py
	manager.enable_auto_cpu_offload(device="cuda")
	```

	All models begin on the CPU and [ComponentsManager](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentsManager) moves them to the appropriate device right before they're needed, and moves other models back to the CPU when GPU memory is low.

	Call [disable_auto_cpu_offload()](/docs/diffusers/pr_12652/en/api/modular_diffusers/pipeline_components#diffusers.ComponentsManager.disable_auto_cpu_offload) to disable offloading.

	```py
	manager.disable_auto_cpu_offload()
	```

Xet Storage Details

Size:: 5.03 kB
Xet hash:: fa2fcc4dfeeba5ca4f27e43674ac00b46aa520a3d59432332f30ff25b67f8583

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.