MiniMax M3 - SM100

#38
by celikburak - opened

Hi MiniMax team,

The open-sourced MSA repository appears to target SM100 / compute capability 10.0.
Will MiniMax-M3 support local inference on DGX Spark / GB10 (SM12.1)?
If yes, will there be a vLLM or SGLang deployment guide and compatible MSA kernel/runtime for GB10?

We are especially interested in TP=2 deployment, FP8 or FP4/NVFP4 weights, 1M context, and multimodal input.

FYI. The Spark Arena team (https://spark-arena.com | https://sparkrun.dev) will definitely be focused on enabling DGX Spark compatibility as soon as its released.

Sign up or log in to comment