Byung-Kwan Lee

BK-Lee

https://sites.google.com/view/byungkwanlee

AI & ML interests

Vision-Language Models

Recent Activity

upvoted a paper 4 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

upvoted a paper 10 days ago

Efficient Reasoning with Balanced Thinking

upvoted a paper 10 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

View all activity

Organizations

upvoted a paper 4 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 5 days ago • 41

upvoted 3 papers 10 days ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published 18 days ago • 143

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 11 days ago • 63

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published 12 days ago • 14

authored a paper 28 days ago

Recursive Think-Answer Process for LLMs and VLMs

Paper • 2603.02099 • Published 28 days ago • 6

upvoted a paper 28 days ago

Recursive Think-Answer Process for LLMs and VLMs

Paper • 2603.02099 • Published 28 days ago • 6

submitted a paper to Daily Papers 28 days ago

Recursive Think-Answer Process for LLMs and VLMs

Paper • 2603.02099 • Published 28 days ago • 6

upvoted 2 papers 2 months ago

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Paper • 2601.09575 • Published Jan 14 • 26

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published Jan 14 • 54

upvoted a paper 3 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 229

authored a paper 3 months ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 30

upvoted 3 papers 3 months ago

SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling

Paper • 2512.23162 • Published Dec 29, 2025 • 14

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

Paper • 2512.20927 • Published Dec 24, 2025 • 17

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 30

submitted a paper to Daily Papers 3 months ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 30

upvoted 4 papers 3 months ago

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published Dec 24, 2025 • 43

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 42

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27

Zoom-Zero: Reinforced Coarse-to-Fine Video Understanding via Temporal Zoom-in

Paper • 2512.14273 • Published Dec 16, 2025 • 10

upvoted a paper 4 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 264

Byung-Kwan Lee

AI & ML interests

Recent Activity

Organizations

BK-Lee's activity