NFA: Non-negative Factor Analysis—a statistical method that models data as a linear combination of non-negative parts, promoting interpretability and sparsity
Reward Hacking: When an RL agent exploits flaws or spurious correlations in a reward model to get high scores without actually satisfying the intended goal
ELBO: Evidence Lower Bound—a proxy objective function used in variational inference to approximate intractable posterior distributions
Amortized Inference: Using a neural network (encoder) to predict the parameters of a variational distribution for each data point, rather than optimizing parameters individually
Weibull Distribution: A continuous probability distribution used here to model non-negative latent variables because it supports sparsity and efficient reparameterization
Epistemic Uncertainty: Uncertainty stemming from the model's lack of knowledge (can be reduced with more data), modeled here by the global dictionary distribution
Aleatoric Uncertainty: Uncertainty stemming from inherent noise in the data, modeled here by the stochastic local latent variables