codearena-rl / server /grader.py

Commit History

Finalizing CodeArena RL Benchmark: frontend improvements, GRPO training scripts, and cleaned environment
03a7eb9

havinashpatil commited on

Complete all tasks: Adaptive curriculum, GRPO, React frontend, LLM-as-a-judge
a448db8

havinashpatil commited on

Upload meta
03defc2

havinashpatil commited on

fix: reset task_id parsing, grader tuple crash fallback, and inference score output
646409d

adityanaikhpt commited on

fix: clamp scores to strict (0.001, 0.999) range
b3485c6

adityanaikhpt commited on

fix: openenv.yaml tasks graders + strict score bounds
9967cb5

adityanaikhpt commited on

fix: add safe_reward() clamp at every reward return point
f14f8d9

adityanaikhpt commited on

Add normalize_reward(); enforce strict (0,1) on all paths
b53855f

adityanaikhpt commited on

Update grading logic to strict (0,1) ratio fallback
78b7e6b

adityanaikhpt commited on