The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty
Item Type:Conference Paper
Citation:Mohammad Saifullah, Charalampos Andriotis, Konstantinos Papakonstantinou, The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty, 14th International Conference on Applications of Statistics and Probability in Civil Engineering (ICASP14), Dublin, Ireland, 2023.
submission_488.pdf (PDF) 1.189Mb
To preserve structural safety of deteriorating engineering systems through optimal maintenance, it is imperative to efficiently integrate structural health information with decision-making optimization frameworks. Although there may be abundance of available data, these are often uncertain and incomplete. In addition, joint inspection and maintenance (I&M) optimization is inherently complex due to high-dimensional state and action spaces, stochastic objectives, long planning horizons, and various constraints, among others. As shown recently, these computational challenges can be effectively addressed within the optimization principles of Partially Observable Markov Decision Processes (POMDPs) and constrained Deep Reinforcement Learning (DRL). The POMDP framework provides a way of updating the decision-maker's perception about the system state by naturally incorporating the Value of Information (VoI) in the optimality equations. As such, optimal observation-gathering actions are those which guide maintenance decisions towards reduced life-cycle costs and risks. The role of VoI in DRL-driven I&M has also been shown to be central to the formation of policy gradients, which are necessary to obtain the optimal I&M plan with deep learning actor-critic architectures. Leveraging this property, a recently devised DRL architecture is further examined in this work, consisting of fully decoupled 'maintainer' and 'inspector' actors, which allow for greater efficacy and interpretability in multi-agent DRL training and inference in I&M. Several numerical analyses are carried out to assess the performance of the relevant architectures on stochastic systems with a varying number of components, multiple maintenance-inspection actions per component, and system-level failure risks.
Other Titles:14th International Conference on Applications of Statistics and Probability in Civil Engineering(ICASP14)
Type of material:Conference Paper
Series/Report no:14th International Conference on Applications of Statistics and Probability in Civil Engineering(ICASP14)
Availability:Full text available