_comment: REQUIRED: Define ALL technical terms, acronyms, and method names used ANYWHERE in the entire summary. After drafting the summary, perform a MANDATORY POST-DRAFT SCAN: check every section individually (Core.one_sentence_thesis, evaluation_highlights, core_problem, Technical_details, Experiments.key_results notes, Figures descriptions and key_insights). HIGH-VISIBILITY RULE: Terms appearing in one_sentence_thesis, evaluation_highlights, or figure key_insights MUST be defined—these are the first things readers see. COMMONLY MISSED: PPO, DPO, MARL, dense retrieval, silver labels, cosine schedule, clipped surrogate objective, Top-k, greedy decoding, beam search, logit, ViT, CLIP, Pareto improvement, BLEU, ROUGE, perplexity, attention heads, parameter sharing, warm start, convex combination, sawtooth profile, length-normalized attention ratio, NTP. If in doubt, define it.
ViT: Vision Transformer—a model architecture applying the Transformer mechanism directly to sequences of image patches
PTQ: Post-Training Quantization—quantizing a pre-trained model using only a small calibration set without full retraining
TanQ: Tan Quantizer—the proposed quantization function based on the tangent function to fit power-law distributions
MOSF: MAD-guided Optimal Scaling Factor—the proposed method to select scaling factors by minimizing Mean Absolute Deviation
MSA: Multi-Head Self-Attention—mechanism in Transformers that captures correlations between different input patches
LayerNorm: Layer Normalization—a technique to normalize neuron activities within a layer to stabilize training
MAD: Mean Absolute Deviation—a robust measure of variability used here to detect and handle outliers in scaling factors
RepQ-ViT: A prior PTQ method for ViTs using scale reparameterization and log-sqrt2 quantization
Log2 Quantizer: A quantization scheme that allocates intervals logarithmically, giving more precision to small values
W4A4: 4-bit quantization for both Weights (W) and Activations (A)
W3A3: 3-bit quantization for both Weights (W) and Activations (A)