PRONTOQA: A benchmark generator for first-order logic problems that allows controlling ontology (True/False) and graph structure to test reasoning
Steamroller problems: Logic puzzles involving multiple steps of deduction (hops) designed to challenge reasoning systems; often used to test ATPs
False Ontology: A problem setting where facts contradict real-world knowledge (e.g., 'polars bears are reptiles') to force the model to rely on logic rather than memory
ATP: Automated Theorem Prover—algorithms that use logical rules to search for proofs
Bottom Up: A reasoning strategy (Forward Chaining) starting from facts to derive new conclusions until the query is answered
Top Down: A reasoning strategy (Backward Chaining) starting from the query/goal and working backwards to find supporting facts
Magic Set Transformation: An optimization strategy combining Top Down filtering (to find relevant rules) with Bottom Up execution
Default Negation: The assumption that if a statement cannot be proven true (or the model is unsure), it is treated as false