Who Looks like Me: Semantic Routed Image Harmonization

Who Looks like Me: Semantic Routed Image Harmonization

Jinsheng Sun, Chao Yao, Xiaokun Wang, Yu Guo, Yalan Zhang, Xiaojuan Ban

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence
Main Track. Pages 1308-1316. https://doi.org/10.24963/ijcai.2024/145

Image harmonization, aiming to seamlessly blend extraneous foreground objects with background images, is a promising and challenging task.Ensuring a synthetic image appears realistic requires maintaining consistency in visual characteristics, such as texture and style, across global and semantic regions.In this paper, We approach image harmonization as a semantic routed style transfer problem, and propose an imageharmonization model by routing semantic similarity explicitly to enhance the consistency of appearance characteristics.To refine calculate the similarity between the composed foreground and background instance, we propose an InstanceSimilarity Evaluation Module(ISEM).To harness analogous semantic information effectively, we further introduceStyle Transfer Block(STB) to establish fine-grained foreground-background semantic correlation.Our method has achieved excellent experimental results on existing datasets and our model outperforms the state-of-the-art by a margin of 0.45 dB on iHarmony4 dataset.
Keywords:
Computer Vision: CV: Image and video synthesis and generation 
Computer Vision: CV: Computational photography