Structure-Aware Spatial-Temporal Interaction Network for Video Shadow Detection

Housheng Wei; Guanyu Xing; Jingwei Liao; Yanci Zhang; Yanli Liu

doi:10.24963/ijcai.2024/158

Structure-Aware Spatial-Temporal Interaction Network for Video Shadow Detection

Housheng Wei, Guanyu Xing, Jingwei Liao, Yanci Zhang, Yanli Liu

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence

Main Track. Pages 1425-1433. https://doi.org/10.24963/ijcai.2024/158

PDF BibTeX

Video shadow detection faces significant challenges due to ambiguous semantics and variable shapes. Existing video shadow detection algorithms typically overlook the fine shadow details, resulting in inconsistent detection between consecutive frames in complex real-world video scenarios. To address this issue, we propose a spatial-temporal feature interaction strategy, which refines and enhances global shadow semantics with local prior features in the modeling of shadow relations between frames. Moreover, a structure-aware shadow prediction module is proposed, which focuses on modeling the distance relation between local shadow edges and regions. Quantitative experimental results demonstrate that our approach significantly outperforms the state-of-the-art methods, providing stable and consistent shadow detection results in complex video shadow scenarios.

Keywords:

Computer Vision: CV: Recognition (object detection, categorization)

Computer Vision: CV: Scene analysis and understanding

Computer Vision: CV: Segmentation

Computer Vision: CV: Video analysis and understanding