Cola Chen (SII)

141forever

2 5 1

https://141forever.github.io/

141forever

AI & ML interests

None yet

Recent Activity

upvoted a paper 20 days ago

ReNIO: Reweighting Negative Trajectory Importance for LLM On-Policy Distillation

upvoted a paper about 1 month ago

TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search

upvoted a paper about 1 month ago

POISE: Position-Aware Undetectable Skill Injection on LLM Agents

View all activity

Organizations

upvoted a paper 20 days ago

ReNIO: Reweighting Negative Trajectory Importance for LLM On-Policy Distillation

Paper • 2606.23104 • Published 23 days ago • 5

upvoted 2 papers about 1 month ago

TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search

Paper • 2606.11662 • Published Jun 10 • 10

POISE: Position-Aware Undetectable Skill Injection on LLM Agents

Paper • 2606.07943 • Published Jun 6 • 4

New activity in HuggingFaceH4/on-policy-distillation 4 months ago

How to reproduce the results in your blog?

#7 opened 6 months ago by

141forever

liked a Space 5 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

118

Explore on-policy distillation visualization for any model

New activity in HuggingFaceH4/on-policy-distillation 6 months ago

About lr and evaluation

#6 opened 6 months ago by

141forever

upvoted an article 8 months ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

ServiceNow-AI

•

Nov 19, 2025

• 34

upvoted a paper 9 months ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Paper • 2510.14967 • Published Oct 16, 2025 • 34

Cola Chen (SII)

AI & ML interests

Recent Activity

Organizations

141forever's activity

How to reproduce the results in your blog?

Unlocking On-Policy Distillation for Any Model Family

About lr and evaluation

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models