SoC: System-on-Chip—an integrated circuit that integrates all components of a computer or other electronic system (CPU, GPU, NPU, memory) on a single chip.
NPU: Neural Processing Unit—a specialized circuit designed to accelerate machine learning algorithms.
Agentic RAG: A RAG system where autonomous agents dynamically decide steps like rewriting queries or multiple retrieval rounds, creating a complex, evolving execution graph.
DAG: Directed Acyclic Graph—a representation of tasks and dependencies where edges point from earlier tasks to later ones without loops.
Critical Path: The sequence of dependent tasks that determines the minimum possible duration of the entire process.
SJF: Shortest Job First—a scheduling policy that selects the waiting process with the smallest execution time to execute next.
PU: Processing Unit—generic term for a computation core like a CPU, GPU, or NPU.
Sub-stage: A fine-grained partition of a logical RAG stage (e.g., processing a batch of documents or generating a group of tokens) to expose parallelism.