-
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
Paper • 2402.17553 • Published • 26 -
Learning to Generate Unit Tests for Automated Debugging
Paper • 2502.01619 • Published • 4 -
gitbugactions/gitbug-java
Viewer • Updated • 199 • 34 • 2 -
rufimelo/defects4j
Viewer • Updated • 467 • 231 • 3
Moshood Fakorede
thefabdev
·
AI & ML interests
None yet
Recent Activity
liked a dataset about 2 months ago
MobileDev-Bench/mobiledev-bench upvoted a paper about 2 months ago
MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application Development updated a collection about 2 months ago
Academic Papers