MIMIC-RG4: A newly constructed dataset derived from MIMIC-CXR, specifically curated to align report content with four distinct input scenarios (e.g., removing prior comparisons if no prior report is input).
Input-agnostic hallucination: Generated text that describes information not present in the input sources, such as mentioning a prior procedure or comparison when no history is provided.
Longitudinal information: Data from a patient's previous medical examinations, specifically previous radiology reports in this context.
DiscBERT: A BERT-based discriminator trained by the authors to classify whether a report contains specific types of information (e.g., comparisons, views), used for dataset cleaning and evaluation.
CheXbert: A standardized tool for extracting disease labels from radiology reports, used here to ensure diagnostic consistency during data generation and for loss weighting.
Adaptive Token Fusion (ATF): A module that compresses features from various available modalities (images, text) into a fixed sequence length to maintain consistent input to the LLM.
Token-level Loss Weighting (TLW): A training strategy that increases the loss penalty for tokens corresponding to positive or uncertain findings, identified via Integrated Gradients on CheXbert outputs.
Integrated Gradients: An interpretability method used here to calculate attribution scores for tokens contributing to disease classifications.
Perceiver: A neural network architecture used to map high-dimensional inputs to a lower-dimensional latent space using cross-attention.