Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification

Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification

Jin Wang, Zhongyuan Wang, Dawei Zhang, Jun Yan

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
Main track. Pages 2915-2921. https://doi.org/10.24963/ijcai.2017/406

Text classification is a fundamental task in NLP applications. Most existing work relied on either explicit or implicit text representation to address this problem. While these techniques work well for sentences, they can not easily be applied to short text because of its shortness and sparsity. In this paper, we propose a framework based on convolutional neural networks that combines explicit and implicit representations of short text for classification. We first conceptualize a short text as a set of relevant concepts using a large taxonomy knowledge base. We then obtain the embedding of short text by coalescing the words and relevant concepts on top of pre-trained word vectors. We further incorporate character level features into our model to capture fine-grained subword information. Experimental results on five commonly used datasets show that our proposed method significantly outperforms state-of-the-art methods.
Keywords:
Machine Learning: Deep Learning
Natural Language Processing: Text Classification