Princeton AI for Math

university

https://pli.princeton.edu/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

narutatsuri authored a paper 22 days ago

Rethinking On-Policy Self-Distillation for Thinking Models

narutatsuri authored a paper about 1 month ago

Do Thinking Tokens Help with Safety?

narutatsuri submitted a paper about 1 month ago

Do Thinking Tokens Help with Safety?

View all activity

narutatsuri

authored a paper 22 days ago

Rethinking On-Policy Self-Distillation for Thinking Models

Paper • 2607.05184 • Published 26 days ago • 2

narutatsuri

authored a paper about 1 month ago

Do Thinking Tokens Help with Safety?

Paper • 2606.25013 • Published Jun 23 • 1

narutatsuri

submitted a paper to Daily Papers about 1 month ago

Do Thinking Tokens Help with Safety?

Paper • 2606.25013 • Published Jun 23 • 1

shichengshuai98

authored 4 papers 3 months ago

MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI

Paper • 2605.08678 • Published May 9 • 9

Building Math Agents with Multi-Turn Iterative Preference Learning

Paper • 2409.02392 • Published Sep 4, 2024 • 16

Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources

Paper • 2306.08364 • Published Jun 14, 2023

Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning

Paper • 2605.00347 • Published May 1 • 16

narutatsuri

authored a paper 4 months ago

Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision

Paper • 2604.12002 • Published Apr 13 • 12

zrrr

authored a paper 5 months ago

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

Paper • 2508.03613 • Published Aug 5, 2025 • 16

zrrr

authored 3 papers about 1 year ago

Panacea: Pareto Alignment via Preference Adaptation for LLMs

Paper • 2402.02030 • Published Feb 3, 2024 • 10

From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding

Paper • 2412.06474 • Published Dec 9, 2024

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

narutatsuri

authored a paper about 1 year ago

Reranking-based Generation for Unbiased Perspective Summarization

Paper • 2506.15925 • Published Jun 19, 2025 • 5

stanleyrwei

authored a paper about 1 year ago

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Paper • 2506.11928 • Published Jun 13, 2025 • 25

narutatsuri

authored 2 papers over 1 year ago

Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions

Paper • 2502.04322 • Published Feb 6, 2025 • 3

Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution

Paper • 2409.07072 • Published Sep 11, 2024

narutatsuri

authored 3 papers almost 2 years ago

Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations

Paper • 2307.08678 • Published Jul 17, 2023

Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies

Paper • 2305.12586 • Published May 21, 2023

Contrastive Loss is All You Need to Recover Analogies as Parallel Lines

Paper • 2306.08221 • Published Jun 14, 2023

AI & ML interests

Recent Activity

Team members 12

PLI-AI4Math's activity