19 13

William Scott

trueza2s

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

Video Models Can Reason with Verifiable Rewards

upvoted a paper 1 day ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

upvoted a paper 1 day ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

Video Models Can Reason with Verifiable Rewards

Paper • 2605.15458 • Published 10 days ago • 11

upvoted 3 papers 1 day ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 12 days ago • 189

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 4 days ago • 138

Capturing LLM Capabilities via Evidence-Calibrated Query Clustering

Paper • 2605.17110 • Published 8 days ago • 2

liked a model 2 days ago

Neura-Tech-AI/Neuron-Distill-Qwen2-14B

Text Generation • 15B • Updated 2 days ago • 447 • 4

upvoted a paper 2 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published 10 days ago • 142

upvoted a paper 5 days ago

Does Synthetic Layered Design Data Benefit Layered Design Decomposition?

Paper • 2605.15167 • Published 10 days ago • 8

upvoted a paper 9 days ago

BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning

Paper • 2605.07394 • Published 16 days ago • 4

liked a model 12 days ago

poor7/Other

Updated 27 minutes ago • 2

upvoted a paper 16 days ago

Leveraging Verifier-Based Reinforcement Learning in Image Editing

Paper • 2604.27505 • Published 24 days ago • 57

liked a model 16 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 11M • • 6.19k

liked a model 22 days ago

cmz1024/olmo3-190m-zh-nano

22M • Updated 13 days ago • 191 • 1

liked a dataset 29 days ago

liuhaotian/LLaVA-Instruct-150K

Preview • Updated Jan 3, 2024 • 7.19k • 601

upvoted a paper about 1 month ago

Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning

Paper • 2604.05404 • Published Apr 7 • 43

liked 2 datasets about 1 month ago

harii66/gt

Updated about 1 hour ago • 13k • 4

waltgrace/fiber-optic-drones

Viewer • Updated Apr 8 • 2.26k • 86

upvoted 3 papers about 2 months ago

Brevity Constraints Reverse Performance Hierarchies in Language Models

Paper • 2604.00025 • Published Mar 11 • 23

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

liked a dataset about 2 months ago

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 19.8k • 835

William Scott

AI & ML interests

Recent Activity

Organizations

trueza2s's activity