bipartite graph: A graph where nodes are divided into two disjoint sets (here, responses and claims), and edges only connect nodes from different sets
entailment: A relationship where the truth of one statement (the response) guarantees the truth of another (the claim)
centrality metrics: Measures used to determine the importance of a node in a graph; examples include Degree (number of connections) and Closeness (average distance to others)
self-consistency (SC): A method that samples multiple reasoning paths or answers from an LLM and selects the most consistent one; shown here to be equivalent to Degree Centrality
AUPRC: Area Under the Precision-Recall Curve—a performance metric for classification tasks, particularly useful when classes are imbalanced
FactScore: A metric/benchmark for evaluating the factuality of long-form text generation by breaking text into atomic claims and verifying them
degree centrality: A simple measure of importance based on counting the number of direct connections a node has
closeness centrality: A measure of importance based on the average length of the shortest paths between a node and all other nodes in the graph
betweenness centrality: A measure of importance based on the number of times a node acts as a bridge along the shortest path between two other nodes
PageRank: An algorithm that measures node importance by counting the number and quality of links to the node (originally used by Google for web pages)
verbalized confidence (VC): Prompting the LLM to explicitly state its confidence (e.g., 'I am 90% sure') in its own output
greedy decoding: A generation strategy where the model always selects the highest-probability token at each step