RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models Paper • 2602.17053 • Published Feb 19 • 1