3 55 3

Yumin Choi

YuminChoi

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

upvoted a paper 4 days ago

Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models

upvoted a paper 13 days ago

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

View all activity

Organizations

upvoted a paper 3 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 4 days ago • 51

upvoted a paper 4 days ago

Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models

Paper • 2606.16281 • Published 5 days ago • 31

upvoted a paper 13 days ago

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

Paper • 2606.04743 • Published 17 days ago • 45

upvoted a paper 22 days ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Paper • 2605.29250 • Published 23 days ago • 76

authored a paper 23 days ago

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents

Paper • 2605.17873 • Published May 18 • 12

upvoted 3 papers 23 days ago

Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents

Paper • 2605.28775 • Published 24 days ago • 38

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 24 days ago • 92

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents

Paper • 2605.17873 • Published May 18 • 12

authored a paper 29 days ago

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

Paper • 2605.20258 • Published May 18 • 30

upvoted a paper about 1 month ago

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

Paper • 2605.20258 • Published May 18 • 30

authored a paper about 1 month ago

PREPING: Building Agent Memory without Tasks

Paper • 2605.13880 • Published May 11 • 28

upvoted 3 papers about 1 month ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 34

VibeProteinBench: An Evaluation Benchmark for Language-interfaced Vibe Protein Design

Paper • 2605.10978 • Published May 13 • 19

PREPING: Building Agent Memory without Tasks

Paper • 2605.13880 • Published May 11 • 28

submitted a paper to Daily Papers about 1 month ago

PREPING: Building Agent Memory without Tasks

Paper • 2605.13880 • Published May 11 • 28

upvoted 3 papers about 1 month ago

CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models

Paper • 2605.08735 • Published May 9 • 70

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published May 9 • 81

RLDX-1 Technical Report

Paper • 2605.03269 • Published May 5 • 126

upvoted a paper 2 months ago

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

Paper • 2604.14004 • Published Apr 15 • 30

authored a paper 3 months ago

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Paper • 2603.22341 • Published Mar 21 • 37

Yumin Choi

AI & ML interests

Recent Activity

Organizations

YuminChoi's activity