FinCoT: Financial Chain-of-Thought—a prompting strategy that embeds expert workflows as diagrams to guide LLM reasoning.
Mermaid: A text-based syntax for generating diagrams and charts, used here to encode reasoning blueprints that LLMs can parse.
ST-CoT: Structured Chain-of-Thought—prompting that enforces specific tags like <thinking> and <output> to organize reasoning.
UST-CoT: Unstructured Chain-of-Thought—standard free-form step-by-step reasoning prompting.
SP: Standard Prompting—zero-shot prompting where the model is asked to answer directly without explicit reasoning steps.
CFA: Chartered Financial Analyst—a professional designation; used here to refer to the difficulty and style of the evaluation questions.
FinNLP: Financial Natural Language Processing—applying NLP techniques specifically to the finance domain.
SFT: Supervised Fine-Tuning—training a model on a labeled dataset to adapt it to a specific task.
GRPO: Group Relative Policy Optimization—a reinforcement learning algorithm used for training some of the baseline models (e.g., DianJin-R1, Fin-o1).
semi-reflection: A simplified self-verification step included within the <output> block of FinCoT, avoiding a separate complex reflection phase.