Approximating Discrete Probability Distribution of Image Emotions by Multi-Modal Features Fusion

Approximating Discrete Probability Distribution of Image Emotions by Multi-Modal Features Fusion

Sicheng Zhao, Guiguang Ding, Yue Gao, Jungong Han

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
Main track. Pages 4669-4675. https://doi.org/10.24963/ijcai.2017/651

Existing works on image emotion recognition mainly assigned the dominant emotion category or average dimension values to an image based on the assumption that viewers can reach a consensus on the emotion of images. However, the image emotions perceived by viewers are subjective by nature and highly related to the personal and situational factors. On the other hand, image emotions can be conveyed by different features, such as semantics and aesthetics. In this paper, we propose a novel machine learning approach that formulates the categorical image emotions as a discrete probability distribution (DPD). To associate emotions with the extracted visual features, we present a weighted multi-modal shared sparse leaning to learn the combination coefficients, with which the DPD of an unseen image can be predicted by linearly integrating the DPDs of the training images. The representation abilities of different modalities are jointly explored and the optimal weight of each modality is automatically learned. Extensive experiments on three datasets verify the superiority of the proposed method, as compared to the state-of-the-art.
Keywords:
Uncertainty in AI: Approximate Probabilistic Inference
Robotics and Vision: Vision and Perception