Explainable Inference on Sequential Data via Memory-Tracking
Explainable Inference on Sequential Data via Memory-Tracking
Biagio La Rosa, Roberto Capobianco, Daniele Nardi
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
Main track. Pages 2006-2013.
https://doi.org/10.24963/ijcai.2020/278
In this paper we present a novel mechanism to
get explanations that allow to better understand
network predictions when dealing with sequential
data. Specifically, we adopt memory-based networks
— Differential Neural Computers — to exploit
their capability of storing data in memory and
reusing it for inference. By tracking both the memory
access at prediction time, and the information
stored by the network at each step of the input
sequence, we can retrieve the most relevant input
steps associated to each prediction. We validate
our approach (1) on a modified T-maze, which is a
non-Markovian discrete control task evaluating an
algorithm’s ability to correlate events far apart in
history, and (2) on the Story Cloze Test, which is
a commonsense reasoning framework for evaluating
story understanding that requires a system to
choose the correct ending to a four-sentence story.
Our results show that we are able to explain agent’s
decisions in (1) and to reconstruct the most relevant
sentences used by the network to select the story
ending in (2). Additionally, we show not only that
by removing those sentences the network prediction
changes, but also that the same are sufficient to
reproduce the inference.
Keywords:
Machine Learning: Explainable Machine Learning
Machine Learning: Interpretability
Machine Learning: Deep Learning: Sequence Modeling
Machine Learning: Deep Learning