Social Media-based User Embedding: A Literature Review

Social Media-based User Embedding: A Literature Review

Shimei Pan, Tao Ding

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
Survey track. Pages 6318-6324. https://doi.org/10.24963/ijcai.2019/881

Automated representation learning is behind many recent success stories in machine learning. It is often used to transfer knowledge learned from a large dataset (e.g., raw text) to tasks for which only a small number of training examples are available. In this paper, we review recent advance in learning to represent social media users in low-dimensional embeddings. The technology is critical for creating high performance social media-based human traits and behavior models since the ground truth for assessing latent human traits and behavior is often expensive to acquire at a large scale. In this survey, we review typical methods for learning a unified user embeddings from heterogeneous user data (e.g., combines social media texts with images to learn a unified user representation). Finally we point out some current issues and future directions.
Keywords:
Humans and AI: Personalization and User Modeling
Machine Learning Applications: Applications of Unsupervised Learning
Natural Language Processing: Embeddings
Multidisciplinary Topics and Applications: Social Sciences