Evaluation Setup
Evaluation on three distinct agents (Calendar, Database, Weather [inferred from text]) under query-only attack constraints
Benchmarks:
- Three diverse agents (Agentic reasoning and retrieval) [New]
Metrics:
- Memory Injection Success Rate (MISR)
- Attack Success Rate (ASR)
- Statistical methodology: Not explicitly reported in the paper
Main Takeaways
- MINJA achieves a high average Memory Injection Success Rate (98.2%) across diverse agents, proving that progressive shortening effectively bypasses the difficulty of generating malicious records from benign queries.
- The attack successfully translates to harmful outcomes, with a 76.8% Attack Success Rate, meaning the agent frequently adopts the malicious reasoning when the victim queries.
- The method functions under strict constraints where the attacker cannot modify the victim's query or the memory database directly, highlighting a critical vulnerability in shared-memory agent deployments.