_comment: REQUIRED: Define ALL technical terms, acronyms, and method names used ANYWHERE in the entire summary. After drafting the summary, perform a MANDATORY POST-DRAFT SCAN: check every section individually (Core.one_sentence_thesis, evaluation_highlights, core_problem, Technical_details, Experiments.key_results notes, Figures descriptions and key_insights). HIGH-VISIBILITY RULE: Terms appearing in one_sentence_thesis, evaluation_highlights, or figure key_insights MUST be defined—these are the first things readers see. COMMONLY MISSED: PPO, DPO, MARL, dense retrieval, silver labels, cosine schedule, clipped surrogate objective, Top-k, greedy decoding, beam search, logit, ViT, CLIP, Pareto improvement, BLEU, ROUGE, perplexity, attention heads, parameter sharing, warm start, convex combination, sawtooth profile, length-normalized attention ratio, NTP. If in doubt, define it.
PTQ: Post-training Quantization—compressing a model after training without a full retraining process, usually using a small calibration dataset
QDiffBench: A new benchmark proposed in this paper that evaluates quantized diffusion models using same-domain data for FID and testing generalization on unseen prompts
FID: Fréchet Inception Distance—a metric used to assess the quality of images generated by a generative model by comparing the distribution of generated images to real images
CLIP Score: A metric evaluating how well an image matches its text caption, using the CLIP (Contrastive Language-Image Pre-training) model embeddings
Stable Diffusion XL: A large-scale (3.5B parameter) latent text-to-image diffusion model
W8A8: Quantization setting where Weights are 8-bit and Activations are 8-bit
W4A8: Quantization setting where Weights are 4-bit and Activations are 8-bit
PCR: Progressive Calibration and Relaxing—the authors' proposed method
Activation Relaxing: A mixed-precision strategy where a small subset of sensitive timesteps use higher bit-width (e.g., 10-bit) while others use lower bit-width
COCO: Common Objects in Context—a large-scale object detection, segmentation, and captioning dataset often used for benchmarking
Denoising steps: The iterative steps a diffusion model takes to convert noise into a clear image
Distribution gap: The difference in statistical characteristics between two datasets (e.g., COCO images vs. images generated by Stable Diffusion)
Calibration data: A small set of data used during PTQ to determine quantization parameters (like scaling factors)