shuo shen

hyperion-shuo

6 21

Hyperion-shuo

AI & ML interests

reinforcement learning

Recent Activity

upvoted a paper 2 days ago

Group Entropy-Controlled Policy Optimization

upvoted a paper 9 days ago

AdvancedMathBench: A Benchmark Suite for Advanced Mathematical Proof Generation and Verification

upvoted a paper 10 days ago

Scalable Visual Pretraining for Language Intelligence

View all activity

Organizations

upvoted a paper 2 days ago

Group Entropy-Controlled Policy Optimization

Paper • 2607.16850 • Published 6 days ago • 27

upvoted a paper 9 days ago

AdvancedMathBench: A Benchmark Suite for Advanced Mathematical Proof Generation and Verification

Paper • 2607.11849 • Published 11 days ago • 33

upvoted a paper 10 days ago

Scalable Visual Pretraining for Language Intelligence

Paper • 2607.09657 • Published 14 days ago • 56

upvoted a paper about 2 months ago

ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning

Paper • 2606.03503 • Published Jun 2 • 25

liked a dataset 3 months ago

JunkaiZ/Rubrics

Viewer • Updated Mar 20 • 13k • 70 • 3

liked 2 datasets 4 months ago

garg-aayush/sft-cs336-assign5-datasets

Preview • Updated Jan 26 • 125 • 6

Maxwell-Jia/MATH

Viewer • Updated Dec 3, 2024 • 12.5k • 543 • 5

liked a model 5 months ago

Kwaipilot/KAT-Dev

Text Generation • 33B • Updated Oct 14, 2025 • 1.12k • • 215

liked a model 7 months ago

meta-llama/Llama-3.2-1B-Instruct

Text Generation • 1B • Updated Oct 24, 2024 • 10.5M • • 1.54k

liked a model 8 months ago

bird-of-paradise/deepseek-mla

Text Generation • Updated Feb 27, 2025 • 20

upvoted a collection 8 months ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 174

liked a dataset 8 months ago

karpathy/fineweb-edu-100b-shuffle

Viewer • Updated Sep 25, 2025 • 97.2M • 4.91k • 168

liked a model 11 months ago

jakegrigsby/metamon

Reinforcement Learning • Updated May 20 • 4

liked a dataset 11 months ago

jakegrigsby/metamon-parsed-replays

Viewer • Updated May 21 • 7 • 369 • 5

liked a model 11 months ago

tencent/Hunyuan-GameCraft-1.0

Image-to-Video • Updated Aug 19, 2025 • 160 • 491

upvoted an article 11 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 379

liked a dataset 12 months ago

ZenMoore/RoleBench

Preview • Updated Nov 23, 2023 • 1.4k • 92

liked a dataset about 1 year ago

HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 73.5k • 845

liked a Space about 1 year ago

MTEB Leaderboard

📊

7.58k

Embedding Leaderboard

published a Space about 1 year ago

Mcp Sentiment

📈

mcp-sentiment