End-to-End Multi-Perspective Matching for Entity Resolution
End-to-End Multi-Perspective Matching for Entity Resolution
Cheng Fu, Xianpei Han, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, Hao Kong
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
Main track. Pages 4961-4967.
https://doi.org/10.24963/ijcai.2019/689
Entity resolution (ER) aims to identify data records referring to the same real-world entity. Due to the heterogeneity of entity attributes and the diversity of similarity measures, one main challenge of ER is how to select appropriate similarity measures for different attributes. Previous ER methods usually employ heuristic similarity selection algorithms, which are highly specialized to specific ER problems and are hard to be generalized to other situations. Furthermore, previous studies usually perform similarity learning and similarity selection independently, which often result in error propagation and are hard to be optimized globally. To resolve the above problems, this paper proposes an end-to-end multi-perspective entity matching model, which can adaptively select optimal similarity measures for heterogenous attributes by jointly learning and selecting similarity measures in an end-to-end way. Experiments on two real-world datasets show that our method significantly outperforms previous ER methods.
Keywords:
Natural Language Processing: Information Extraction
Multidisciplinary Topics and Applications: Databases
Natural Language Processing: Coreference Resolution