4 25 44

Chi PRO

ChilleD

AI & ML interests

Natural Language Processing.

Recent Activity

liked a model 5 days ago

zai-org/GLM-5.2-FP8

liked a model 5 days ago

zai-org/GLM-5.2

liked a dataset 8 days ago

badlogicgames/pi-mono

View all activity

Organizations

liked 2 models 5 days ago

zai-org/GLM-5.2-FP8

Text Generation • 753B • Updated 2 days ago • 138k • • 117

zai-org/GLM-5.2

Text Generation • 753B • Updated 2 days ago • 19.7k • • 1.72k

liked a dataset 8 days ago

badlogicgames/pi-mono

Preview • Updated Apr 6 • 2.94k • 159

upvoted a paper 9 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 10 days ago • 140

upvoted a paper 11 days ago

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 17 days ago • 67

upvoted 2 papers about 1 month ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

Paper • 2605.21468 • Published May 20 • 50

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published May 19 • 190

published a dataset about 1 month ago

ChilleD/WebHarbor

Updated May 11 • 252

updated a dataset about 1 month ago

ChilleD/WebHarbor

Updated May 11 • 252

updated a collection about 1 month ago

SynthAgent

Collection

4 items • Updated May 11

upvoted a paper about 1 month ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published May 7 • 16

updated a Space about 2 months ago

Agent World Model Environment Server

🤖

Step through and monitor an OpenEnv environment via web UI

published a Space about 2 months ago

Agent World Model Environment Server

🤖

Step through and monitor an OpenEnv environment via web UI

liked a model about 2 months ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 13 days ago • 2.8M • • 4.99k

upvoted a collection about 2 months ago

DeepSeek-V4

Collection

4 items • Updated Apr 24 • 686

upvoted a paper 3 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

upvoted a paper 4 months ago

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Paper • 2602.22190 • Published Feb 25 • 17

Chi PRO

AI & ML interests

Recent Activity

Organizations

ChilleD's activity

Agent World Model Environment Server

Agent World Model Environment Server