Tensor rematerialization: Saving memory by deleting intermediate activations during forward pass and recomputing them during backward pass
Memory fragmentation: When free memory exists but is divided into small, non-contiguous blocks that cannot satisfy a large allocation request
DTR: Dynamic Tensor Rematerialization—a baseline method that greedily evicts tensors based on a heuristic (staleness, cost, size) without considering memory addresses
DTE: Dynamic Tensor Evicting—a method extending DTR that considers adjacent free blocks but still relies on heuristics that may fail to produce contiguous chunks
Sliding window algorithm: Coop's method to find a contiguous sequence of tensors to evict by moving a window over the memory address list
Cost density: A metric (compute cost / memory size) used to classify tensors as 'cheap' or 'expensive' for partitioning
Recomputable in-place: A mechanism allowing in-place mutation of tensors while maintaining the ability to recompute their original values, avoiding copy-on-write overhead