ACAP: Adaptive Compute Acceleration Platform—a hybrid architecture combining CPU, FPGA (PL), and vector processors (AIE)
AIE: AI Engine—a VLIW vector processor core on Xilinx Versal chips optimized for compute-intensive math
PL: Programmable Logic—the traditional FPGA fabric used here for data orchestration and non-MM layers
PLIO: Programmable Logic Input/Output—interfaces connecting the AIE array to the PL fabric
Monolithic Accelerator: A single large accelerator design that processes all layers sequentially, often requiring padding for smaller layers
VLIW: Very Long Instruction Word—a CPU architecture that executes multiple instructions in parallel per cycle
Tiling: Splitting large matrices into smaller blocks to fit into on-chip cache/memory for data reuse
CDSE: CHARM Design Space Exploration—module for optimizing single accelerator parameters
CDAC: CHARM Diverse Accelerator Composer—module for partitioning resources among multiple accelerators