Ruiyang Sun

RuiyangSun

1 1 8

rockmagma02

AI & ML interests

Recent Activity

liked a dataset about 2 months ago

programbench/ProgramBench-Tests

authored a paper over 2 years ago

Safe RLHF: Safe Reinforcement Learning from Human Feedback

authored a paper almost 3 years ago

Baichuan 2: Open Large-scale Language Models

View all activity

Organizations

liked a dataset about 2 months ago

programbench/ProgramBench-Tests

Updated May 6 • 111k • 9

authored a paper over 2 years ago

Safe RLHF: Safe Reinforcement Learning from Human Feedback

Paper • 2310.12773 • Published Oct 19, 2023 • 28

authored a paper almost 3 years ago

Baichuan 2: Open Large-scale Language Models

Paper • 2309.10305 • Published Sep 19, 2023 • 22

liked a Space almost 3 years ago

MT Bench

📊

203

Explore and compare AI model answers on benchmark questions

liked a dataset about 3 years ago

PKU-Alignment/processed-hh-rlhf

Viewer • Updated Nov 24, 2023 • 168k • 64 • 11

authored a paper about 3 years ago

BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset

Paper • 2307.04657 • Published Jul 10, 2023 • 6

upvoted a paper about 3 years ago

BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset

Paper • 2307.04657 • Published Jul 10, 2023 • 6

liked 2 models about 3 years ago

PKU-Alignment/beaver-7b-v1.0-reward

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 4.23k • 17

PKU-Alignment/beaver-7b-v1.0-cost

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 4.4k • 10

liked a dataset about 3 years ago

BAAI/COIG-PC

Viewer • Updated Jun 14, 2024 • 540M • 854 • 271

updated a dataset about 3 years ago

OmniSafeAI/hh-prompts

Viewer • Updated Apr 22, 2023 • 169k • 13 • 1

liked 2 datasets over 3 years ago

allenai/prosocial-dialog

Viewer • Updated Feb 3, 2023 • 166k • 869 • 119

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 33.7k • 1.83k

Ruiyang Sun

AI & ML interests

Recent Activity

Organizations

RuiyangSun's activity

MT Bench