RAG: Retrieval-Augmented Generation—enhancing generative models by retrieving relevant external data to improve accuracy, robustness, and knowledge currency
AIGC: Artificial Intelligence Generated Content—content (text, image, video, etc.) produced by advanced generative models like LLMs or Diffusion models
Dense Retrieval: Retrieval based on semantic matching of vector embeddings rather than keyword matching
Sparse Retrieval: Retrieval based on keyword matching statistics (e.g., BM25, TF-IDF)
Latent Representation: Intermediate internal states or vectors within a neural network model
Logit: The raw, unnormalized prediction scores output by the last layer of a neural network before applying a softmax function
Process Augmentation: RAG paradigm where retrieved information influences the generation steps themselves (e.g., deciding to skip a step) rather than just the content