Network Schema Preserving Heterogeneous Information Network Embedding

Network Schema Preserving Heterogeneous Information Network Embedding

Jianan Zhao, Xiao Wang, Chuan Shi, Zekuan Liu, Yanfang Ye

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
Main track. Pages 1366-1372. https://doi.org/10.24963/ijcai.2020/190

As heterogeneous networks have become increasingly ubiquitous, Heterogeneous Information Network (HIN) embedding, aiming to project nodes into a low-dimensional space while preserving the heterogeneous structure, has drawn increasing attention in recent years. Many of the existing HIN embedding methods adopt meta-path guided random walk to retain both the semantics and structural correlations between different types of nodes. However, the selection of meta-paths is still an open problem, which either depends on domain knowledge or is learned from label information. As a uniform blueprint of HIN, the network schema comprehensively embraces the high-order structure and contains rich semantics. In this paper, we make the first attempt to study network schema preserving HIN embedding, and propose a novel model named NSHE. In NSHE, a network schema sampling method is first proposed to generate sub-graphs (i.e., schema instances), and then multi-task learning task is built to preserve the heterogeneous structure of each schema instance. Besides preserving pairwise structure information, NSHE is able to retain high-order structure (i.e., network schema). Extensive experiments on three real-world datasets demonstrate that our proposed model NSHE significantly outperforms the state-of-the-art methods.
Keywords:
Data Mining: Mining Graphs, Semi Structured Data, Complex Data
Data Mining: Feature Extraction, Selection and Dimensionality Reduction
Data Mining: Clustering, Unsupervised Learning
Machine Learning: Deep Learning: Convolutional networks