DCAgent2/medagentbench_g1_min_episodes_e1_gpt_long_thinking_tacc_Qwen3_32B_20260428_213325 Viewer • Updated 33 minutes ago • 900
DCAgent2/swebench_verified_random_100_folders_a2_rl_pymethods2test_v3_15_32B_20260428_181639 Viewer • Updated about 1 hour ago • 300
DCAgent2/financeagent_terminal_SWE_agent_LM_32B_20260428_084221 Viewer • Updated about 1 hour ago • 148
DCAgent2/bfcl_parity_g1_min_episodes_e1_gpt_long_thinking_tacc_Qwen3_32B_20260428_213221 Viewer • Updated about 2 hours ago • 369
DCAgent2/dev_set_v2_a2_rl_pymethods2test_v3_15_32B_20260428_181623 Viewer • Updated about 2 hours ago • 297
DCAgent2/swebench_verified_random_100_folders_g1_a1_top16_32b_step3600_20260428_181349 Viewer • Updated about 2 hours ago • 300
DCAgent2/terminal_bench_2_g1_weighted_100k_32b_cont_20260428_002638 Viewer • Updated about 4 hours ago • 262
DCAgent2/swebench_verified_daVinci_Dev_32B_20260427_232320 Viewer • Updated about 7 hours ago • 1.49k
DCAgent2/terminal_bench_2_a2_rl_defects4j_v3_20260428_051007 Viewer • Updated about 11 hours ago • 267
DCAgent2/GLM-4.7-SERAlike-swesmith-25k-full-3samples-131k Viewer • Updated about 11 hours ago • 4.57k
DCAgent2/terminal_bench_2_g1_a1_top16_32b_step2400_20260427_225638 Viewer • Updated about 11 hours ago • 262
DCAgent2/terminal_bench_2_g1_weighted_100k_32b_cont_step1800_20260427_233018 Viewer • Updated about 12 hours ago • 261
DCAgent2/swebench_verified_SWE_agent_LM_32B_20260427_232227 Viewer • Updated about 14 hours ago • 1.49k
DCAgent2/financeagent_terminal_Qwen3_Coder_30B_A3B_Instruct_20260428_061204 Viewer • Updated about 16 hours ago • 150
DCAgent2/swebench_verified_random_100_folders_a2_rl_defects4j_v3_20260428_050848 Viewer • Updated about 16 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_a2_rl_detailed_20260428_050846 Viewer • Updated about 17 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_g1_a1_top16_32b_step2400_20260427_225622 Viewer • Updated about 17 hours ago • 300
DCAgent2/terminal_bench_2_FourDatasetMixQwen3_8B_20260427_180833 Viewer • Updated about 17 hours ago • 265