Cross-Domain Facial Expression Recognition via Disentangling Identity Representation

Cross-Domain Facial Expression Recognition via Disentangling Identity Representation

Tong Liu, Jing Li, Jia Wu, Lefei Zhang, Shanshan Zhao, Jun Chang, Jun Wan

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
Main Track. Pages 1213-1221. https://doi.org/10.24963/ijcai.2023/135

Most existing cross-domain facial expression recognition (FER) works require target domain data to assist the model in analyzing distribution shifts to overcome negative effects. However, it is often hard to obtain expression images of the target domain in practical applications. Moreover, existing methods suffer from the interference of identity information, thus limiting the discriminative ability of the expression features. We exploit the idea of domain generalization (DG) and propose a representation disentanglement model to address the above problems. Specifically, we learn three independent potential subspaces corresponding to the domain, expression, and identity information from facial images. Meanwhile, the extracted expression and identity features are recovered as Fourier phase information reconstructed images, thereby ensuring that the high-level semantics of images remain unchanged after disentangling the domain information. Our proposed method can disentangle expression features from expression-irrelevant ones (i.e., identity and domain features). Therefore, the learned expression features exhibit sufficient domain invariance and discriminative ability. We conduct experiments with different settings on multiple benchmark datasets, and the results show that our method achieves superior performance compared with state-of-the-art methods.
Keywords:
Computer Vision: CV: Recognition (object detection, categorization)
Computer Vision: CV: Adversarial learning, adversarial attack and defense methods
Computer Vision: CV: Representation learning