YijuGuo
AI & ML interests
LLM Alignment
Recent Activity
authored
a paper
about 11 hours ago
Controllable Preference Optimization: Toward Controllable
Multi-Objective Alignment
authored
a paper
about 11 hours ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
authored
a paper
about 11 hours ago
Learning to Focus: Causal Attention Distillation via Gradient-Guided
Token Pruning