FreqFormer: Frequency-aware Transformer for Lightweight Image Super-resolution

Tao Dai; Jianping Wang; Hang Guo; Jinmin Li; Jinbao Wang; Zexuan Zhu

doi:10.24963/ijcai.2024/81

FreqFormer: Frequency-aware Transformer for Lightweight Image Super-resolution

Tao Dai, Jianping Wang, Hang Guo, Jinmin Li, Jinbao Wang, Zexuan Zhu

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence

Main Track. Pages 731-739. https://doi.org/10.24963/ijcai.2024/81

PDF BibTeX

Transformer-based models have been widely and successfully used in various low-vision visual tasks, and have achieved remarkable performance in single image super-resolution (SR). Despite the significant progress in SR, Transformer-based SR methods (e.g., SwinIR) still suffer from the problems of heavy computation cost and low-frequency preference, while ignoring the reconstruction of rich high-frequency information, hence hindering the representational power of Transformers. To address these issues, in this paper, we propose a novel Frequency-aware Transformer (FreqFormer) for lightweight image SR. Specifically, a Frequency Division Module (FDM) is first introduced to separately handle high- and low-frequency information in a divide-and-conquer manner. Moreover, we present Frequency-aware Transformer Block (FTB) to extracting both spatial frequency attention and channel transposed attention to recover high-frequency details. Extensive experimental results on public datasets demonstrate the superiority of our FreqFormer over state-of-the-art SR methods in terms of both quantitative metrics and visual quality. Code and models are available at https://github.com/JPWang-CS/FreqFormer.

Keywords:

Computer Vision: CV: Applications

Computer Vision: CV: Image and video synthesis and generation

Computer Vision: CV: Interpretability and transparency

Computer Vision: CV: Machine learning for vision