Diffusers
Safetensors
How to use from the
Use from the
Diffusers library
pip install -U diffusers transformers accelerate
import torch
from diffusers import DiffusionPipeline

# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("FudanCVL/SceneDesigner", dtype=torch.bfloat16, device_map="cuda")

prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt).images[0]

SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation

Zhenyuan Qin* · Xincheng Shuai* · Henghui Ding

Fudan University

HuggingFace

⚙️ Quick Start

1. Installation

  1. Install Python environment (recommended to use uv)

    uv sync
    

    Or alternatively:

    pip install -r requirements.txt
    
  2. Install Blender environment

    cd render
    python install.py
    

    If the automatic installation script fails, you can install manually:

    • First download Blender and extract it to the ./render directory
    • Then locate the Blender Python path and install the Python dependencies for Blender, for example:
    cd render
    blender-4.2.8-linux-x64/4.2/python/bin/python3.11 -m pip install -r blender_requirements.txt
    

2. Download Checkpoints

  1. Download the SceneDesigner weights to the checkpoints directory
  2. Download the Stable Diffusion 3.5 base model weights to the checkpoints directory

3. Run Demo

Launch the Gradio app:

python app.py \
  --blender_path render/blender/blender \
  --device cuda:0 \
  --port 7861 
  • Adjust the 9D pose of the cube in the Cube Controls panel
  • Enter text prompts in the Generation Config panel and click the Generate Images button to create images

✒️ Citation

If you find our work useful for your research and applications, please kindly cite using this BibTeX:

@inproceedings{SceneDesigner,
        title={SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation},
        author={Qin, Zhenyuan and Shuai, Xincheng and Ding, Henghui},
        booktitle={NeurIPS},
        year={2025}
      }
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including FudanCVL/SceneDesigner

Paper for FudanCVL/SceneDesigner