Model Card for Seirenes-4B -Based on Qwen3-4B-Instruct-2507
Adversarial Self-Play with Evolving Distractions for LLM Reasoning
Model Description
This is a math reasoning model.
- Developed by: [Chi Zhang~1909zczc@gmail.com]
- Finetuned from model: [Qwen3-4B-Instruct-2507]
Model Sources
- Repository: [Seirenes]
- Paper: [More Information Needed]
Uses
VLLM or Sglang
Training Details
Training Data
Citation [optional]
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support