Independent Researcher,
Xiamen University, Xiamen, China,
Yue’erwan Internet Hospital Co., Ltd.,
School of Biomedical Engineering / School of Clinical Medicine, Tsinghua University, Beijing, China,
National University of Singapore, Singapore,
Beijing Tsinghua Changgung Hospital
arXiv
(2026)
MemoryAgentRAG
📝 Paper Summary
Agentic RAG pipelineClinical Workflow Automation
A hospital-specific agent operating system that replaces vector retrieval with hierarchical document manifests and enforces safety through strictly isolated Linux-based execution environments.
Core Problem
General-purpose agents lack the security and longitudinal memory required for hospitals, while existing hospital systems are too rigid to handle variable clinical needs.
Why it matters:
Standard agents require broad permissions (file/network access) incompatible with healthcare privacy and audit regulations
Vector-based memory fragments patient history into isolated chunks, losing the temporal and causal structure needed for clinical reasoning
Fixed-function hospital systems cannot handle the 'long tail' of unique patient comorbidities that fall outside pre-programmed protocols
Concrete Example:A physician querying a patient's medication history might retrieve a dosage note from three years ago alongside a recent adverse event with no structural signal to distinguish them. Vector retrieval treats these as independent chunks, failing to capture the causal timeline needed for safe decision-making.
Key Novelty
Agentic Operating System for Hospital (AOSH)
Replaces vector embeddings with 'Page-Indexed Memory': agents navigate a tree of human-readable 'manifest' files to find documents, mirroring how clinicians scan medical charts
Enforces 'least-privilege execution' by isolating agents in restricted Linux namespaces where they can only invoke pre-approved skills, blocking arbitrary network or file access
Uses a 'document-mutation' coordination model where agents communicate solely by writing structured updates to clinical records, creating an automatic audit trail
Architecture
Overall architecture of the Agentic Operating System for Hospital
Breakthrough Assessment
7/10
Proposes a necessary architectural shift from 'model capabilities' to 'infrastructure design' for clinical AI. The abandonment of vector retrieval for manifest-based navigation is a significant conceptual pivot.
⚙️ Technical Details
Problem Definition
Setting: Deployment of autonomous LLM agents within regulated clinical environments
Inputs: Clinical requests (from physicians/patients) and longitudinal patient documents
Outputs: Structured document updates (reports, orders) and executed skill sequences
Executes reasoning and task planning within an isolated namespace
Model or implementation: Not explicitly reported in the provided text
Page-Indexed Memory
Organizes clinical context as a document tree with summary manifests
Model or implementation: LLM (for manifest summarization)
Medical Skills Library
Encapsulates specific clinical operations (e.g., vital sign aggregation)
Model or implementation: Code/API Connectors
Event Broker
Dispatches document mutation events to subscribed agents
Model or implementation: Messaging Infrastructure
Novel Architectural Elements
Page-Indexed Memory: Replaces vector stores with a manifest-based document tree that the agent navigates explicitly
Document-Mutation Coordination: Agents interact only via writing to shared documents (no direct message passing)
Linux-based User Isolation: Maps agent identities to restricted OS-level user accounts to enforce security boundaries
Modeling
Base Model: Not explicitly reported in the provided text
Compute: Not reported in the paper
Comparison to Prior Work
vs. MemGPT: MemGPT optimizes for conversational continuity; this work optimizes for document-structured longitudinal clinical records
vs. GraphRAG/RAPTOR: These require expensive offline index construction/re-indexing; this work handles live mutations naturally via local manifest updates
vs. Standard RAG: Abandonment of vector similarity in favor of LLM-driven manifest navigation to preserve temporal context
Limitations
No quantitative evaluation results provided in the text
Reliance on the agent's reasoning capability to navigate manifests may be slower than vector retrieval
Requires maintenance of manifest files (though described as local updates)
Specific details on the underlying LLM backbone are missing in the provided sections
Reproducibility
The paper describes a design grounded in the open-source OpenClaw framework, but does not provide a specific repository URL for the hospital-specific implementation or the curated medical skills library.
📊 Experiments & Results
Evaluation Setup
Not reported in the provided text (text ends at Section 4)
Metrics:
Statistical methodology: Not explicitly reported in the paper
Main Takeaways
The paper proposes an infrastructure-first approach to clinical AI, prioritizing safety and memory structure over raw model capability.
The 'Page-Indexed Memory' architecture offers a theoretical solution to the fragmentation problem of vector RAG by keeping documents in their hierarchical, temporal context.
The system handles the 'long tail' of clinical variability by allowing agents to compose skills ad-hoc, unlike fixed-function EHR systems.
📚 Prerequisite Knowledge
Prerequisites
Understanding of Retrieval-Augmented Generation (RAG) limitations
Basic knowledge of Operating System concepts (namespaces, permissions)
Familiarity with clinical workflows (EHRs, documentation)
Key Terms
OpenClaw: An open-source autonomous agent framework that structures capabilities as a curated library of discrete skills; the basis for this work
Page-Indexed Memory: A memory architecture where agents navigate a hierarchical tree of documents via human-readable summary files (manifests) rather than vector similarity
Manifest: A structured summary file (manifest.md) at each node of the document tree describing the scope and contents of child documents
Mutation Event: An append-only record of a change to a document, used to trigger agent actions and maintain an audit trail
RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
Vector Embedding: A numerical representation of text used in standard RAG; this paper argues it fragments clinical context
Least-privilege execution: A security design where agents operate with the minimum necessary permissions, enforced here via Linux namespaces