RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography
Abstract
RadAgent, a tool-using AI agent, enhances chest CT report generation through interpretable step-by-step reasoning traces that improve clinical accuracy, robustness, and faithfulness compared to existing 3D vision-language models.
Vision-language models (VLM) have markedly advanced AI-driven interpretation and reporting of complex medical imaging, such as computed tomography (CT). Yet, existing methods largely relegate clinicians to passive observers of final outputs, offering no interpretable reasoning trace for them to inspect, validate, or refine. To address this, we introduce RadAgent, a tool-using AI agent that generates CT reports through a stepwise and interpretable process. Each resulting report is accompanied by a fully inspectable trace of intermediate decisions and tool interactions, allowing clinicians to examine how the reported findings are derived. In our experiments, we observe that RadAgent improves Chest CT report generation over its 3D VLM counterpart, CT-Chat, across three dimensions. Clinical accuracy improves by 6.0 points (36.4% relative) in macro-F1 and 5.4 points (19.6% relative) in micro-F1. Robustness under adversarial conditions improves by 24.7 points (41.9% relative). Furthermore, RadAgent achieves 37.0% in faithfulness, a new capability entirely absent in its 3D VLM counterpart. By structuring the interpretation of chest CT as an explicit, tool-augmented and iterative reasoning trace, RadAgent brings us closer toward transparent and reliable AI for radiology.
Community
New agent for CT report generation that unlocks step-by-step, tool-based reasoning for more accurate and transparent reports. "Anthropic's Faithfulness" (37%) compared to 0% in 3D VLM baseline.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- A Reasoning-Enabled Vision-Language Foundation Model for Chest X-ray Interpretation (2026)
- EviAgent: Evidence-Driven Agent for Radiology Report Generation (2026)
- CT-Flow: Orchestrating CT Interpretation Workflow with Model Context Protocol Servers (2026)
- MedOpenClaw: Auditable Medical Imaging Agents Reasoning over Uncurated Full Studies (2026)
- MedVR: Annotation-Free Medical Visual Reasoning via Agentic Reinforcement Learning (2026)
- CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays (2026)
- Evolving Medical Imaging Agents via Experience-driven Self-skill Discovery (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2604.15231 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 1
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper