Learning Out-of-Vocabulary Words in Intelligent Personal Agents

Learning Out-of-Vocabulary Words in Intelligent Personal Agents

Avik Ray, Yilin Shen, Hongxia Jin

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
Main track. Pages 4309-4315. https://doi.org/10.24963/ijcai.2018/599

Semantic parsers play a vital role in intelligent agents to convert natural language instructions to an actionable logical form representation. However, after deployment, these parsers suffer from poor accuracy on encountering out-of-vocabulary (OOV) words, or significant accuracy drop on previously supported instructions after retraining. Achieving both goals simultaneously is non-trivial. In this paper, we propose novel neural networks based parsers to learn OOV words; one incorporating a new hybrid paraphrase generation model, and an enhanced sequence-to-sequence model. Extensive experiments on both benchmark and custom datasets show our new parsers achieve significant accuracy gain on OOV words and phrases, and in the meanwhile learn OOV words while maintaining accuracy on previously supported instructions.
Keywords:
Machine Learning: Neural Networks
Natural Language Processing: Natural Language Semantics
Natural Language Processing: NLP Applications and Tools
Machine Learning: Deep Learning