System Prompt: The 'constitution' or initial set of instructions that defines an AI agent's behavior, tools, and constraints
Undirected Scouring: A multi-model evaluation process where different LLMs critique a prompt sequentially, each building on the previous model's findings to discover unknown failure modes
Directed Evaluation: Analysis using formal rules (predicates) to check for specific, known failure patterns like 'mandate-prohibition conflicts' or 'scope overlap'
AST: Abstract Syntax Tree—a hierarchical representation of the prompt's structure (sections, directives, lists) used for static analysis
Composition Seams: The boundaries between modular components (e.g., different sub-prompts) where integration bugs often occur due to undefined contracts
Monolithic Prompt: A single, large document containing all instructions (e.g., Claude Code), prone to contradictions as new features are added
Modular Prompt: A prompt assembled at runtime from smaller, independent pieces (e.g., Gemini CLI), prone to bugs in the interaction between modules
Feature Flags: Switches that toggle specific sections of the prompt on or off at runtime