PPO: Proximal Policy Optimization—an RL algorithm that improves training stability by limiting how much the policy can change in each update using a clipped objective.
TabTransformer: A Transformer architecture designed for tabular data that uses self-attention to learn contextual embeddings for categorical features.
IIoT: Industrial Internet of Things—interconnected sensors, instruments, and other devices networked together with computers' industrial applications.
MitM: Man-in-the-Middle attack—a cyberattack where the attacker secretly relays and possibly alters the communications between two parties who believe they are directly communicating.
GAE: Generalized Advantage Estimation—a method in RL to reduce variance in advantage estimates by exponentially weighting rewards over time.
Macro F1-score: An average of F1-scores calculated for each class individually, treating all classes equally regardless of their frequency (good for imbalanced data).
F1-score: The harmonic mean of precision and recall, providing a balanced metric for classification performance.