Lazy Beaver

Jayce-Ping

5 8 15

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

upvoted a paper 6 days ago

In-Context World Modeling for Robotic Control

upvoted a paper 6 days ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

View all activity

Organizations

upvoted 3 papers 6 days ago

liked a model 15 days ago

zai-org/GLM-5.2

Text Generation • 753B • Updated 1 day ago • 191k • • 3.32k

authored a paper 22 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 25 days ago • 41

updated 4 models 23 days ago

Tencent-Hunyuan-Multimodal-RL/FLUX2-klein-base-9b-GenEval2-Multi-Reward

Text-to-Image • Updated 23 days ago • 62 • 3

Tencent-Hunyuan-Multimodal-RL/FLUX2-klein-base-9b-GenEval2-Single-Reward

Text-to-Image • Updated 23 days ago • 54 • 1

Tencent-Hunyuan-Multimodal-RL/SD3.5-GenEval2-Multi-Reward

Text-to-Image • Updated 23 days ago • 58

Tencent-Hunyuan-Multimodal-RL/SD3.5-GenEval2-Single-Reward

Text-to-Image • Updated 23 days ago • 59

updated a collection 23 days ago

Flow-DPPO: GenEval2

Collection

Flow-DPPO-trained LoRA adapters (single- and multi-reward) for SD3.5 and FLUX.2-klein-9B optimized on GenEval2. • 5 items • Updated 23 days ago

upvoted a paper 23 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 25 days ago • 41

upvoted a paper 24 days ago

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 26 days ago • 33

updated a collection 25 days ago

Flow-DPPO: GenEval2

Collection

Flow-DPPO-trained LoRA adapters (single- and multi-reward) for SD3.5 and FLUX.2-klein-9B optimized on GenEval2. • 5 items • Updated 23 days ago

published 4 models 25 days ago

Tencent-Hunyuan-Multimodal-RL/FLUX2-klein-base-9b-GenEval2-Multi-Reward

Text-to-Image • Updated 23 days ago • 62 • 3

Tencent-Hunyuan-Multimodal-RL/FLUX2-klein-base-9b-GenEval2-Single-Reward

Text-to-Image • Updated 23 days ago • 54 • 1

Tencent-Hunyuan-Multimodal-RL/SD3.5-GenEval2-Multi-Reward

Text-to-Image • Updated 23 days ago • 58

Tencent-Hunyuan-Multimodal-RL/SD3.5-GenEval2-Single-Reward

Text-to-Image • Updated 23 days ago • 59

Lazy Beaver

AI & ML interests

Recent Activity

Organizations

Jayce-Ping's activity