Fabio Persia (University of Naples, Italy) - Dr Multimedia Communications
Date: May 7th 2013 Location: Eurecom - Eurecom
Abstract: There are numerous applications where we wish to discover unexpected activities in a sequence of time-stamped observation data---for instance, we may want to detect inexplicable events in transactions at a web site or in video of an airport tarmac. In this approach, we start with a known set A of activities (both innocuous and dangerous) that we wish to monitor. However, in addition, we wish to identify "unexplained" subsequences in an observation sequence that are poorly explained (e.g., because they may contain occurrences of activities that have never been seen or anticipated before, i.e. they are not in A). We formally define the probability that a sequence of observations is unexplained (totally or partially) w.r.t. A. We develop efficient algorithms to identify the top-k Totally and Partially Unexplained Sequences w.r.t. A. These algorithms leverage theorems that enable us to speed up the search for totally/partially unexplained sequences. We describe experiments using real-world video and cyber security datasets showing that our approach works well in practice in terms of both running time and accuracy.