RAG: Retrieval-Augmented Generation—enhancing model output by retrieving relevant documents (e.g., from Wikipedia) before generation
DoLa: Decoding by Contrasting Layers—a decoding strategy that contrasts outputs from different model layers to amplify factual knowledge and reduce hallucinations
FactScore: A metric that quantifies the ratio of correct atomic facts in a generated response compared to a reference source (e.g., Wikipedia)
FinMA: A multi-task fine-tuned version of LLaMA-1 specialized for financial tasks
Prompt-based tool learning: A method where the LLM is prompted to generate code/function calls (e.g., Python API wrappers) to fetch external data instead of relying on internal memory
MAE: Mean Absolute Error—measure of the average size of mistakes in a collection of predictions, without considering their direction
Greedy decoding: A decoding method where the model selects the most probable next token at each step
Few-shot prompting: Providing the model with a small number of example input-output pairs in the prompt to guide its generation