-
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
Paper • 2603.20278 • Published • 94 -
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought
Paper • 2603.22847 • Published • 25 -
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory
Paper • 2604.01007 • Published • 29
DARYL LaMar MOORE
darylmooreNC
·
AI & ML interests
Agents, training, reasoning
Recent Activity
liked a model 2 days ago
dealignai/MiniMax-M2.5-JANG_4M-CRACK updated a collection 3 days ago
LLM Architectures updated a collection 3 days ago
Agentic AI Training and Tuning Organizations
None yet
LLM Reasoning
LLM Training Methodologies
-
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation
Paper • 2603.15132 • Published • 35
Agentic AI Training and Tuning
-
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 132 -
Natural-Language Agent Harnesses
Paper • 2603.25723 • Published • 24 -
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery
Paper • 2604.01658 • Published • 52
Agentic AI
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper • 2510.16872 • Published • 112
Large Language Models
-
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 14 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Paper • 2510.00515 • Published • 42 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 148
Researcg
Multi-Agent Infrastructure
LLM Architectures
-
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 132 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 138 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
Paper • 2604.04707 • Published • 195
Reinforcement Learning
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper • 2510.19363 • Published • 63 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 48 -
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19
Sports Predictive Modeling
Research AI
-
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
Paper • 2603.20278 • Published • 94 -
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought
Paper • 2603.22847 • Published • 25 -
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory
Paper • 2604.01007 • Published • 29
Researcg
LLM Reasoning
Multi-Agent Infrastructure
LLM Training Methodologies
-
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation
Paper • 2603.15132 • Published • 35
LLM Architectures
-
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 132 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 138 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
Paper • 2604.04707 • Published • 195
Agentic AI Training and Tuning
-
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 132 -
Natural-Language Agent Harnesses
Paper • 2603.25723 • Published • 24 -
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery
Paper • 2604.01658 • Published • 52
Reinforcement Learning
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper • 2510.19363 • Published • 63 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 48 -
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19
Agentic AI
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper • 2510.16872 • Published • 112
Sports Predictive Modeling
Large Language Models
-
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 14 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Paper • 2510.00515 • Published • 42 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 148