Model Card for Seirenes-7B -Based on Qwen2.5-7B-Instruct
Adversarial Self-Play with Evolving Distractions for LLM Reasoning
Model Description
This is a math reasoning model.
- Developed by: [Chi Zhang~1909zczc@gmail.com]
- Finetuned from model: [Qwen2.5-7B-Instruct]
Model Sources
- Repository: [Seirenes]
- Paper: [More Information Needed]
Uses
VLLM or Sglang
Training Details
Training Data
Citation [optional]
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support