15 17

Luke Robinson

loganm92

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

liked a dataset about 14 hours ago

GGSheng/common-backup

liked a model 1 day ago

tencent/Hy-MT2-1.8B

View all activity

Organizations

None yet

upvoted a paper about 2 hours ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 4 days ago • 173

liked a dataset about 14 hours ago

GGSheng/common-backup

Updated 23 minutes ago • 15.8k • 5

liked a model 1 day ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated 1 day ago • 2.56k • 403

upvoted a paper 1 day ago

Swift Sampling: Selecting Temporal Surprises via Taylor Series

Paper • 2605.22678 • Published 3 days ago • 6

liked a dataset 2 days ago

OpenAssistant/oasst1

Viewer • Updated May 2, 2023 • 88.8k • 24.5k • 1.52k

upvoted a paper 3 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 12 days ago • 189

liked a model 5 days ago

sma1-rmarud/olmo3-7b-DPO-original-e2-rlvr-e-attack-stepfinal

7B • Updated 5 days ago • 66 • 1

liked a dataset 9 days ago

robot-learning-group47/eval2_bgr_g

Viewer • Updated 9 days ago • 2.4k • 161 • 1

liked a dataset 12 days ago

SpartanEngineer24798/orthogonal_data

Viewer • Updated 11 days ago • 310k • 18.7k • 2

liked a model 17 days ago

WayneW/images

Updated 17 days ago • 1

liked a model 22 days ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1, 2025 • 24k • 3.61k

upvoted a paper 30 days ago

Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models

Paper • 2604.16902 • Published Apr 18 • 6

liked 2 models about 1 month ago

madhusudhan001/qwen2.5-0.5b-materials-science

Text Generation • 0.5B • Updated 23 days ago • 283 • • 1

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 858 • 906

upvoted 4 papers about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 246

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 325

Self-Execution Simulation Improves Coding Models

Paper • 2604.03253 • Published Mar 11 • 35

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 291

liked a dataset about 1 month ago

yangwang825/91e3d56b0a-part008

Updated Apr 14 • 312 • 1

upvoted a paper about 1 month ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

Luke Robinson

AI & ML interests

Recent Activity

Organizations

loganm92's activity