HyperNetwork: An auxiliary neural network that takes an input (like an image) and outputs the weights for another neural network (the main model)
LoRA: Low-Rank Adaptation—a technique that fine-tunes large models by optimizing small, low-rank decomposition matrices instead of the full weight matrix
LiDB: Lightweight DreamBooth—the authors' proposed method to further decompose LoRA weights using a fixed random orthogonal basis, reducing trainable parameters to ~100KB
rank-relaxed fine-tuning: A strategy where the model is initialized with a rank-1 prediction from the HyperNetwork, but then fine-tuned with a higher rank (e.g., rank > 1) to capture finer details
diffusion denoising loss: The standard loss function used to train diffusion models, measuring the difference between added noise and predicted noise
ViT: Vision Transformer—a model architecture that processes images as sequences of patches using self-attention mechanisms
T2I: Text-to-Image—models that generate images from text descriptions