Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization Paper • 2602.23008 • Published about 21 hours ago • 24
ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection Paper • 2505.15182 • Published May 21, 2025 • 6
facebook/mbart-large-50-many-to-many-mmt Translation • 0.6B • Updated Sep 28, 2023 • 79.3k • • 404
CyberNative/Code_Vulnerability_Security_DPO Viewer • Updated Feb 29, 2024 • 4.66k • 1.07k • 147
dbmdz/bert-large-cased-finetuned-conll03-english Token Classification • 0.3B • Updated Sep 6, 2023 • 1.14M • • 94