SJY8460's picture

3 1

SJY8460

SJY23

https://sjy8460.github.io/

AI & ML interests

NLP/LLM

Recent Activity

authored a paper 1 day ago

PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from Scratch

authored a paper 1 day ago

Aligning Large Language Models via Fully Self-Synthetic Data

authored a paper 1 day ago

GRLO: Towards Generalizable Reinforcement Learning in Open-Ended Environments from Zero

View all activity

Organizations

None yet

Papers 3

arxiv:2605.15464

arxiv:2510.06670

arxiv:2510.06652

models 3

SJY23/Gemma-2-9B-it-SAO

9B • Updated Sep 12, 2024 • 2

SJY23/Qwen2-7B-Instruct-SAO

8B • Updated Sep 12, 2024 • 2

SJY23/LLama3-8B-Instruct-SAO

8B • Updated Sep 12, 2024 • 1

datasets 2

SJY23/PiKa-SFT-30k

Viewer • Updated Apr 9 • 30k • 152 • 2

SJY23/SAO-Gemma-9B

Viewer • Updated Oct 28, 2024 • 80k • 26