This is a randomly initialized model from a modified config from Qwen/Qwen3-Next-80B-A3B-Thinking. It is for debugging or testing only.
Because the necessary classes were already merged into 🤗 Transformers, we were able to release this before the official launch of Qwen3.5.
- Downloads last month
- 42