Optimal Security Protection Strategy Selection Model Based on Q-Learning Particle Swarm Optimization

Xin Gao; Yang Zhou; Lijuan Xu; Dawei Zhao

doi:10.3390/e24121727

Optimal Security Protection Strategy Selection Model Based on Q-Learning Particle Swarm Optimization

Entropy (Basel). 2022 Nov 25;24(12):1727. doi: 10.3390/e24121727.

Authors

Xin Gao¹, Yang Zhou¹, Lijuan Xu¹, Dawei Zhao¹

Affiliation

¹ Shandong Provincial Key Laboratory of Computer Networks, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan 250014, China.

Abstract

With the rapid development of Industrial Internet of Things technology, the industrial control system (ICS) faces more and more security threats, which may lead to serious risks and extensive damage. Naturally, it is particularly important to construct efficient, robust, and low-cost protection strategies for ICS. However, how to construct an objective function of optimal security protection strategy considering both the security risk and protection cost, and to find the optimal solution, are all significant challenges. In this paper, we propose an optimal security protection strategy selection model and develop an optimization framework based on Q-Learning particle swarm optimization (QLPSO). The model performs security risk assessment of ICS by introducing the protection strategy into the Bayesian attack graph. The QLPSO adopts the Q-Learning to improve the local optimum, insufficient diversity, and low precision of the PSO algorithm. Simulations are performed on a water distribution ICS, and the results verify the validity and feasibility of our proposed model and the QLPSO algorithm.

Keywords: Bayesian attack graph; Q-Learning; optimal protection strategy; particle swarm optimization.

Abstract

Grants and funding