Julian Kindel
JulianKindel
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training upvoted a paper 3 months ago
ProAct: Agentic Lookahead in Interactive Environments upvoted a paper 3 months ago
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System