Online Learning Methods for Border Patrol Resource Allocation

R Klima, C Kiekintveld, V Lisy

In Conference on Decision and Game Theory for Security (GameSec). 2014.

This is the author's version of the work.

Download

Abstract

We introduce a model for border security resource allocation with repeated interactions between attackers and defenders. The defender must learn the optimal resource allocation strategy based on historical apprehension data, balancing exploration and exploitation in the policy. We experiment with several solution methods for this online learning problem including UCB, sliding-window UCB, and EXP3. We test the learning methods against several different classes of attackers including attacker with randomly varying strategies and attackers who react adversarially to the defender's strategy. We present experimental data to identify the optimal parameter settings for these algorithms and compare the algorithms against the different types of attackers.