Efficient Reasoning

community

AI & ML interests

None defined yet.

Recent Activity

Leo-Dai authored a paper about 1 month ago

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

ChengsongHuang submitted a paper about 1 month ago

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

TongZheng1999 authored a paper about 2 months ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

View all activity

authored a paper about 1 month ago

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

Paper • 2606.03102 • Published Jun 2 • 14

submitted a paper to Daily Papers about 1 month ago

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

Paper • 2606.03102 • Published Jun 2 • 14

authored a paper about 2 months ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published May 8 • 70

authored 3 papers 2 months ago

DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification

Paper • 2605.09269 • Published May 10 • 6

Reinforcing Multimodal Reasoning Against Visual Degradation

Paper • 2605.09262 • Published May 10 • 7

G-Zero: Self-Play for Open-Ended Generation from Zero Data

Paper • 2605.09959 • Published May 11 • 17

submitted a paper to Daily Papers 2 months ago

G-Zero: Self-Play for Open-Ended Generation from Zero Data

Paper • 2605.09959 • Published May 11 • 17

authored a paper 2 months ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published May 8 • 70

submitted a paper to Daily Papers 2 months ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published May 8 • 70

authored 2 papers 2 months ago

Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation

Paper • 2602.03689 • Published Feb 3

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published May 7 • 38

submitted a paper to Daily Papers 2 months ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published May 7 • 38

authored a paper 4 months ago

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published Mar 10 • 54

authored 2 papers 5 months ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 80

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Paper • 2601.22628 • Published Jan 30 • 35

authored a paper 5 months ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published Feb 3 • 27

authored a paper 5 months ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published Feb 3 • 27

submitted a paper to Daily Papers 5 months ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published Feb 3 • 27

submitted a paper to Daily Papers 5 months ago

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Paper • 2601.22628 • Published Jan 30 • 35

authored a paper 6 months ago

RelayLLM: Efficient Reasoning via Collaborative Decoding

Paper • 2601.05167 • Published Jan 8 • 31