GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging Paper • 2508.18993 • Published Aug 26, 2025 • 5
Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale Paper • 2512.10398 • Published Dec 11, 2025 • 14