RL-trained code search agents (1.7B, 4B, 14B) that outperform 2β18Γ larger models using only a Unix terminal. π arxiv.org/abs/2603.17829
-
OpenHands/CodeScout-14B
Text Generation β’ 15B β’ Updated β’ 104 β’ 2 -
OpenHands/CodeScout-4B
Text Generation β’ 4B β’ Updated β’ 101 β’ β’ 1 -
OpenHands/CodeScout-1.7B
Text Generation β’ 2B β’ Updated β’ 392 β’ β’ 1 -
OpenHands/CodeScout-1.7B-RFT
Text Generation β’ 2B β’ Updated β’ 56 β’ β’ 1