← Back to Paper List

Monitor-Generate-Verify (MGV): Formalising Metacognitive Theory for Language Model Reasoning

Nick Oh, Fernand Gobet
Centre for Philosophy of, London School of Economics
arXiv (2025)
Reasoning Memory Agent

📝 Paper Summary

Cognitive Architectures for LLMs Metacognitive Reasoning Test-time Compute
MGV is a theoretical framework that extends reasoning architectures by adding a pre-generation 'Monitoring' phase based on psychological theories to prevent models from committing to suboptimal strategies.
Core Problem
Current 'Generate-Verify' architectures suffer from the 'prefix dominance trap,' where models commit early to suboptimal reasoning paths and rarely recover via verification.
Why it matters:
  • The prefix dominance trap causes roughly 20% accuracy loss because verification often cannot correct a fundamentally flawed initial strategy
  • Existing systems (CoT, Self-Refine) prioritize generation and verification but lack the 'monitoring' processes to assess task difficulty or select strategies *before* starting
  • Translating human metacognitive theories into computational terms is necessary to identify missing components in current AI reasoning systems
Concrete Example: In current systems, if a model immediately starts solving a trick math problem with the wrong formula (prefix dominance), subsequent self-correction steps merely refine the wrong path rather than switching strategies. MGV proposes assessing the 'Feeling-of-Difficulty' first to select the right strategy before generating any solution.
Key Novelty
Monitor-Generate-Verify (MGV) Framework
  • Computational translation of Flavell’s and Nelson & Narens’ psychological theories, treating metacognitive constructs (experiences, knowledge) as algorithmic primitives
  • Introduction of an explicit 'Monitoring' phase before generation that converts uncertainty into tractable signals (e.g., difficulty assessments) to guide strategy selection
  • Formalization of 'Metacognitive Experience' as a vector signal and 'Metacognitive Knowledge' as a tripartite datastore (Agent, Task, Strategy variables)
Breakthrough Assessment
4/10
Provides a rigorous theoretical vocabulary for missing components in LLM reasoning, but offers zero empirical validation, implementation, or results to prove the framework works.
×