Diffusers
Safetensors
lsmpp's picture
Upload folder using huggingface_hub
1b7e5b9 verified

Caching methods

Cache methods speedup diffusion transformers by storing and reusing intermediate outputs of specific layers, such as attention and feedforward layers, instead of recalculating them at each inference step.

CacheMixin

[[autodoc]] CacheMixin

PyramidAttentionBroadcastConfig

[[autodoc]] PyramidAttentionBroadcastConfig

[[autodoc]] apply_pyramid_attention_broadcast

FasterCacheConfig

[[autodoc]] FasterCacheConfig

[[autodoc]] apply_faster_cache

FirstBlockCacheConfig

[[autodoc]] FirstBlockCacheConfig

[[autodoc]] apply_first_block_cache