Xiliang Yang's picture

2

Xiliang Yang

NoManDeRY

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

authored a paper about 1 year ago

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

updated a model about 1 year ago

NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-fixed-0.95

View all activity

Organizations

None yet

upvoted a paper 5 days ago

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Paper • 2603.03756 • Published 7 days ago • 85

authored a paper about 1 year ago

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

Paper • 2502.07599 • Published Feb 11, 2025 • 15

updated 7 models about 1 year ago

NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-fixed-0.95

Text Generation • 8B • Updated Feb 18, 2025 • 1

NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-decrease_linear-1.0to0.95

Text Generation • 8B • Updated Feb 18, 2025 • 8

NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-increase_linear_0.95to1.0

Text Generation • 8B • Updated Feb 18, 2025 • 8

NoManDeRY/DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-1.0

Text Generation • 8B • Updated Feb 18, 2025 • 2

NoManDeRY/DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-0.95

Text Generation • 8B • Updated Feb 18, 2025 • 3

NoManDeRY/DPO-Shift-Qwen-2-7B-UltraChat200K-SFT

Text Generation • 8B • Updated Feb 18, 2025 • 3

NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-fixed-1.0

Text Generation • 8B • Updated Feb 18, 2025

upvoted a paper about 1 year ago

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

Paper • 2502.07599 • Published Feb 11, 2025 • 15

published 7 models about 1 year ago

NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-increase_linear_0.95to1.0

Text Generation • 8B • Updated Feb 18, 2025 • 8

NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-decrease_linear-1.0to0.95

Text Generation • 8B • Updated Feb 18, 2025 • 8

NoManDeRY/DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-0.95

Text Generation • 8B • Updated Feb 18, 2025 • 3

NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-fixed-1.0

Text Generation • 8B • Updated Feb 18, 2025

NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-fixed-0.95

Text Generation • 8B • Updated Feb 18, 2025 • 1

NoManDeRY/DPO-Shift-Qwen-2-7B-UltraChat200K-SFT

Text Generation • 8B • Updated Feb 18, 2025 • 3

NoManDeRY/DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-1.0

Text Generation • 8B • Updated Feb 18, 2025 • 2