Reasonning
updated
OmniThink: Expanding Knowledge Boundaries in Machine Writing through
Thinking
Paper
• 2501.09751
• Published
• 46
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Paper
• 2501.09686
• Published
• 41
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
• 2501.12948
• Published
• 441
s1: Simple test-time scaling
Paper
• 2501.19393
• Published
• 124
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Paper
• 2501.19324
• Published
• 39
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Paper
• 2502.01100
• Published
• 21
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM
Reasoning via Autoregressive Search
Paper
• 2502.02508
• Published
• 22
LIMO: Less is More for Reasoning
Paper
• 2502.03387
• Published
• 62
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs
using Particle-Based Monte Carlo Methods
Paper
• 2502.01618
• Published
• 10
Token Assorted: Mixing Latent and Text Tokens for Improved Language
Model Reasoning
Paper
• 2502.03275
• Published
• 18
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for
Reasoning Quality, Robustness, and Efficiency
Paper
• 2502.09621
• Published
• 28
Logical Reasoning in Large Language Models: A Survey
Paper
• 2502.09100
• Published
• 24
Chain of Draft: Thinking Faster by Writing Less
Paper
• 2502.18600
• Published
• 50
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive
Cognitive-Inspired Sketching
Paper
• 2503.05179
• Published
• 46
Efficient Reasoning Models: A Survey
Paper
• 2504.10903
• Published
• 21
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Paper
• 2504.10481
• Published
• 85
VerifiAgent: a Unified Verification Agent in Language Model Reasoning
Paper
• 2504.00406
• Published
• 8
Could Thinking Multilingually Empower LLM Reasoning?
Paper
• 2504.11833
• Published
• 29
Thought Manipulation: External Thought Can Be Efficient for Large
Reasoning Models
Paper
• 2504.13626
• Published
• 7
Phi-4-reasoning Technical Report
Paper
• 2504.21318
• Published
• 54
Knowledge Augmented Complex Problem Solving with Large Language Models:
A Survey
Paper
• 2505.03418
• Published
• 9
Reasoning Models Better Express Their Confidence
Paper
• 2505.14489
• Published
• 20
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper
• 2505.24726
• Published
• 277
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic
Sampling
Paper
• 2506.08672
• Published
• 30
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT
Improvements
Paper
• 2506.22419
• Published
• 15
In-Context Learning Strategies Emerge Rationally
Paper
• 2506.17859
• Published
• 10