Absorbing Complexity: An Interaction-Native Knowledge Harness for Financial LLM Agents
Abstract
Financial AI agents struggle with user complexity, but a new architecture called InKH addresses this by embedding complexity into the system through structured knowledge management and temporal memory mechanisms.
Financial AI agents often fail for a simple reason: they make users carry the complexity. A user must repeatedly restate goals, risk preferences, portfolio context, past judgments, and shifting market assumptions, while the agent answers, retrieves, acts, and forgets. In finance, this is not just inconvenient. In tasks such as market analysis, copy-trading review, and trade preparation, forgotten context and stale memory can create latency, repeated errors, weak auditability, and unsafe decisions. We propose the interaction-native knowledge harness (InKH), an architecture for financial LLM agents that absorbs complexity into the system. InKH converts user, market, portfolio, and tool events into structured operational knowledge. It uses passive knowledge injection to assemble a bounded working context buffer before the main model step, temporal graph memory for low-latency retrieval, a wiki audit surface for human-readable governance, and background extraction with maturity, decay, and write-time invalidation. We evaluate InKH on a reproducible controlled synthetic benchmark with 24 random seeds, 4 rounds, 80 episodes per round, and 6 baselines, producing 46,080 baseline-conditioned evaluations. InKH achieves mean task quality of 0.815 at 900 ms latency. Compared with agent-driven wiki-walk memory, it reduces latency by 82.95 percent, token cost by 82.29 percent, and stale-knowledge usage by 96.58 percent, while improving quality by 0.108 and traceability by 0.461. Compared with a temporal-graph system without invalidation, it improves quality by 0.050 and reduces stale-memory usage by 96.58 percent with comparable serving cost. The results support a design thesis for financial AI: adoption happens when complexity is absorbed by the system rather than transferred to the user. The benchmark validates architecture-level behavior, not live trading performance.
Community
Financial AI adoption is constrained not by model quality but by cognition friction: users repeatedly restate fragmented context, historical judgments, and risk preferences. Existing financial agents remain turn-based and workflow-disposable.
We propose InKH, a financial-agent architecture combining: an event-stream view of user/market/tool updates; a bounded working context buffer assembled by passive injection rather than agent-driven search; a temporal knowledge graph for low-latency retrieval; a wiki audit surface; and background extraction with maturity, decay, and write-time invalidation.
We report a reproducible benchmark with 24 seeds, 4 rounds, 80 episodes per round, and 6 baselines. InKH achieves 0.815 task quality at 900ms mean latency. Relative to agent-driven wiki-walk memory: 82.95% lower latency, 82.29% lower token cost, 96.58% less stale-knowledge usage, +0.108 task quality, +0.461 traceability.
Get this paper in your agent:
hf papers read 2606.01886 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper