murataksit34's picture
Normalize naming and remove duplicated dataset heading
a642e3c verified
---
language:
- en
license: apache-2.0
library_name: transformers
tags:
- qwen3
- english
- data-mining
- data-science
- instruction-tuning
- sft
- insight
datasets:
- zero9tech/data-scientist-insight-dialog-en-16.5k
---
# Qwen3-4B-Data-Science-Insight-16.5K
This model is tuned for decision-oriented data mining and applied data science assistance.
## Training Setup
1. Domain SFT: zero9tech/data-scientist-insight-dialog-en-16.5k.
## Dataset Test Highlights
- Total records: 16,463
- Split: train: 14,021 路 validation: 801 路 test: 1,641
- assistant_first_unique_ratio: 0.8408
- assistant_final_unique_ratio: 1.0000
## Usage Note
Model behavior is optimized for decision-focused responses (method choice, alternatives, risk signals, validation planning).
## Copyright
Copyright (c) Zero9 Tech
## License
Apache-2.0