Reinforcement Learning Using Monte Carlo Policy Estimation for Disaster Mitigation

Mohammed Talat Khouj; Sarbjit Sarkaria; Cesar Lopez; Jose Marti

doi:10.1007/978-3-662-45355-1_11

Conference Papers Year : 2014

Reinforcement Learning Using Monte Carlo Policy Estimation for Disaster Mitigation

(1) , (1) , (1) , (1)

Mohammed Talat Khouj

Function : Author

University of British Columbia [Canada]

Sarbjit Sarkaria

Function : Author

University of British Columbia [Canada]

Cesar Lopez

Function : Author

University of British Columbia [Canada]

Jose Marti

Function : Author

University of British Columbia [Canada]

Abstract

Urban communities rely heavily on the system of interconnected critical infrastructures. The interdependencies in these complex systems give rise to vulnerabilities that must be considered in disaster mitigation planning. Only then will it be possible to address and mitigate major critical infrastructure disruptions in a timely manner.This paper describes an intelligent decision making system that optimizes the allocation of resources following an infrastructure disruption. The novelty of the approach arises from the application of Monte Carlo estimation for policy evaluation in reinforcement learning to draw on experiential knowledge gained from a massive number of simulations. This method enables a learning agent to explore and exploit the available trajectories, which lead to an optimum goal in a reasonable amount of time. The specific goal of the case study described in this paper is to maximize the number of patients discharged from two hospitals in the aftermath of an infrastructure disruption by intelligently utilizing the available resources. The results demonstrate that a learning agent, through interactions with an environment of simulated catastrophic scenarios, is capable of making informed decisions in a timely manner.

Keywords

Domains

Computer Science [cs]

Fichier principal

978-3-662-45355-1_11_Chapter.pdf (1.03 Mo)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-01386763

Submitted on : Monday, October 24, 2016-3:33:00 PM

Last modification on : Wednesday, December 11, 2024-2:58:03 PM

Dates and versions

hal-01386763 , version 1 (24-10-2016)

Licence

Attribution

Identifiers

HAL Id : hal-01386763 , version 1
DOI : 10.1007/978-3-662-45355-1_11

Cite

Mohammed Talat Khouj, Sarbjit Sarkaria, Cesar Lopez, Jose Marti. Reinforcement Learning Using Monte Carlo Policy Estimation for Disaster Mitigation. 8th International Conference on Critical Infrastructure Protection (ICCIP), Mar 2014, Arlington, United States. pp.155-172, ⟨10.1007/978-3-662-45355-1_11⟩. ⟨hal-01386763⟩

Reinforcement Learning Using Monte Carlo Policy Estimation for Disaster Mitigation

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share