murataksit34's picture
Normalize naming and remove duplicated dataset heading
a642e3c verified
metadata
language:
  - en
license: apache-2.0
library_name: transformers
tags:
  - qwen3
  - english
  - data-mining
  - data-science
  - instruction-tuning
  - sft
  - insight
datasets:
  - zero9tech/data-scientist-insight-dialog-en-16.5k

Qwen3-4B-Data-Science-Insight-16.5K

This model is tuned for decision-oriented data mining and applied data science assistance.

Training Setup

  1. Domain SFT: zero9tech/data-scientist-insight-dialog-en-16.5k.

Dataset Test Highlights

  • Total records: 16,463
  • Split: train: 14,021 · validation: 801 · test: 1,641
  • assistant_first_unique_ratio: 0.8408
  • assistant_final_unique_ratio: 1.0000

Usage Note

Model behavior is optimized for decision-focused responses (method choice, alternatives, risk signals, validation planning).

Copyright

Copyright (c) Zero9 Tech

License

Apache-2.0