LDM: Latent Diffusion Model—a generative model that performs diffusion (adding/removing noise) in a compressed latent space rather than pixel space
VGG-19: A deep convolutional neural network often used as a feature extractor in style transfer to capture texture and content statistics
LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning technique that freezes pre-trained weights and trains small rank-decomposition matrices
AdaIN: Adaptive Instance Normalization—a style transfer technique that aligns the mean and variance of content features to match those of style features
BoxDiff: A training-free method for controlling object layout in diffusion models using spatial constraints on attention maps
Self-Attention (SA): A mechanism where the model attends to different parts of the image itself to maintain structural consistency and layout
Cross-Attention (CA): A mechanism where the model attends to the text prompt to align visual content with semantic descriptions
DreamBooth: A fine-tuning technique for personalization that updates model weights to associate a specific subject with a unique identifier