Spaces:

H-Liu1997
/

FloodDiffusion-Streaming

Sleeping

Apply for community grant: Academic project (gpu)

by H-Liu1997 - opened Mar 16

H-Liu1997

Owner Mar 16

Dear Hugging Face Team,

I am writing to apply for the GPU Community Grant for our academic project FloodDiffusion, a streaming motion generation demo for our CVPR 2026 paper.

Paper: FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation
Authors: Yiyi Cai, Yuhan Wu, Kunhang Li, You Zhou, Bo Zheng, Haiyang Liu
Affiliations: Shanda AI Research Tokyo, The University of Tokyo
Code: github.com/ShandaAI/FloodDiffusion
Model: ShandaAI/FloodDiffusionTiny

What this Space does

This is an interactive streaming demo that generates infinite-length 3D human motion in real-time from text descriptions. Users can type a motion prompt (e.g., "walk forward", "dance"), watch the 3D skeleton animate in real-time via Three.js, and change the text on-the-fly to seamlessly transition between motions. Multiple visitors can watch the same generation simultaneously (spectator mode).

This is the first streaming motion generation demo on Hugging Face Spaces.

Why we need a persistent GPU (not ZeroGPU)

FloodDiffusion's core feature is streaming generation — the model maintains persistent internal state (diffusion forcing buffer, VAE cache, accumulated motion trajectory) across an unbounded number of generation steps. This requires a GPU that stays allocated continuously, which is incompatible with ZeroGPU's per-request allocation model:

Stateful generation — the model accumulates state across every call; deallocating the GPU would lose all accumulated state
No fixed endpoint — generation is infinite-length; there is no natural point to release the GPU
Real-time latency — cold-start would break the streaming experience

We are currently running on t4-small and would greatly appreciate a community GPU grant to keep the demo accessible for the research community.

Hardware request

GPU: t4-small (16GB VRAM, sufficient for FloodDiffusionTiny)
Sleep: Enabled (auto-sleep after 30 min of inactivity to save resources)

Thank you for considering our application!

Best regards,
Haiyang Liu

H-Liu1997

Owner Mar 16

Side question: Any ideas for ZeroGPU compatibility?

We'd also love to hear if the HF team has any ideas or plans that could make streaming/stateful inference work with ZeroGPU in the future.

Our main blocker is that streaming generation requires persistent GPU state across requests — the model's internal buffer, VAE cache, and diffusion forcing state must survive between consecutive stream_generate_step() calls. Currently ZeroGPU deallocates the GPU after each @spaces.GPU function returns, which would destroy this state.

Some approaches we've considered but aren't sure about:

Long-running @spaces.GPU session — is there a way to keep a ZeroGPU allocation alive for an extended interactive session (e.g., 2-5 minutes)?
State checkpointing to CPU — move all GPU tensors to CPU between calls, restore on next call. Feasible but adds significant latency per step.
WebSocket + single long @spaces.GPU call — wrap the entire streaming session in one GPU call that communicates via WebSocket.

Would any of these work within ZeroGPU's current or planned architecture? Any other suggestions?

Thank you!

H-Liu1997

Owner Mar 16

•

edited Mar 16

cc @hysts @merve @osanseviero — would appreciate your help with the GPU grant request above, and any thoughts on ZeroGPU compatibility for stateful streaming inference. Thank you!

H-Liu1997

Owner Mar 16

(Sorry, correction — cc @hysts @merve @osanseviero for the GPU grant request and ZeroGPU question above. Thanks!)

hysts

Mar 17

Hi @H-Liu1997 , thanks for the detailed explanation. I've just assigned t4-small with a 30-minute sleep time.

Regarding the ZeroGPU compatibility questions, let me CC @cbensimon .

BTW, the next time you open a community grant request, please use the grant request flow described here instead of directly pinging people: https://huggingface.co/docs/hub/en/spaces-gpus#community-gpu-grants
That's the expected process. Also, Omar, whom you pinged, left HF years ago, so he isn't the right person to contact.

H-Liu1997

Owner Mar 17

hi @hysts , thank you so much for your and the hf team's generous support.
and sure, next time i'll follow the request flow. sorry for the inconvenience of the direct pinging and incorrect @.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment