Junxiao Yang

yangjunxiao2021

10 33 14

https://yangjunx21.github.io/

yangjunx21

AI & ML interests

Alignment/AI safety

Recent Activity

liked a dataset 11 days ago

aisa-group/InferenceBench-Trajectories

liked a dataset 11 days ago

aisa-group/PostTrainBench-Trajectories

liked a dataset 26 days ago

zhifeixie/StreamAudio-2M

View all activity

Organizations

liked 2 datasets 11 days ago

aisa-group/InferenceBench-Trajectories

Updated 13 days ago • 923 • 3

aisa-group/PostTrainBench-Trajectories

Updated 4 days ago • 16.2k • 7

liked a dataset 26 days ago

zhifeixie/StreamAudio-2M

Viewer • Updated Jun 3 • 381k • 4.45k • 27

New activity in thu-coai/Syncred-Bench 27 days ago

Add task categories, paper and code links

#2 opened 28 days ago by

nielsr

authored a paper about 1 month ago

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Paper • 2605.29801 • Published May 28 • 144

upvoted a paper about 1 month ago

SynCred-Bench: Benchmarking Synthetic Credibility in AI-Generated Visual Misinformation

Paper • 2606.03348 • Published Jun 2 • 2

submitted a paper to Daily Papers about 1 month ago

SynCred-Bench: Benchmarking Synthetic Credibility in AI-Generated Visual Misinformation

Paper • 2606.03348 • Published Jun 2 • 2

updated a dataset about 1 month ago

yangjunxiao2021/Syncred-Bench

Updated Jun 3 • 39 • 2

liked 2 datasets about 1 month ago

thu-coai/Syncred-Bench

Viewer • Updated 27 days ago • 2.1k • 115 • 2

yangjunxiao2021/Syncred-Bench

Updated Jun 3 • 39 • 2

published 2 datasets about 1 month ago

thu-coai/Syncred-Bench

Viewer • Updated 27 days ago • 2.1k • 115 • 2

yangjunxiao2021/Syncred-Bench

Updated Jun 3 • 39 • 2

upvoted a paper about 1 month ago

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Paper • 2605.29801 • Published May 28 • 144

upvoted a paper about 2 months ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

authored a paper 3 months ago

LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety

Paper • 2604.12710 • Published Apr 13 • 5

upvoted a paper 3 months ago

LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety

Paper • 2604.12710 • Published Apr 13 • 5

submitted a paper to Daily Papers 3 months ago

LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety

Paper • 2604.12710 • Published Apr 13 • 5

upvoted an article 3 months ago

Article

Uncensor any LLM with abliteration

mlabonne

•

Jun 13, 2024

• 871

upvoted a paper 4 months ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published Feb 27 • 100

liked a dataset 5 months ago

ronantakizawa/moltbook

Viewer • Updated Feb 2 • 6.23k • 174 • 55

Junxiao Yang

AI & ML interests

Recent Activity

Organizations

yangjunxiao2021's activity

Add task categories, paper and code links

Uncensor any LLM with abliteration