arxiv:2602.09555
DeyangKong
DeyangKong
AI & ML interests
Natural Language Processing
Recent Activity
upvoted a paper 17 days ago
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space upvoted a paper 26 days ago
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models upvoted a paper about 1 month ago
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models