Ratio-Variance Regularized Policy Optimization for Efficient LLM Fine-tuning Paper • 2601.03320 • Published Jan 6 • 2