Post
214
I have update the vllm to the latest 0.16rc1 at https://hub.docker.com/repository/docker/hellohal2064/vllm-dgx-spark-gb10/general it will run all of the qwen3 models very well with thinking at 41 tok/s it is only setup to run on one spark. I think the documentation on DockerHub is up to date.