Perception-Inspired Graph Convolution for Music Understanding Tasks
Perception-Inspired Graph Convolution for Music Understanding Tasks
Emmanouil Karystinaios, Francesco Foscarin, Gerhard Widmer
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence
AI, Arts & Creativity. Pages 7681-7689.
https://doi.org/10.24963/ijcai.2024/850
We propose a new graph convolutional block, called MusGConv, specifically designed for the efficient processing of musical score data and motivated by general perceptual principles. It focuses on two fundamental dimensions of music, pitch and rhythm, and considers both relative and absolute representations of these components. We evaluate our approach on four different musical understanding problems: monophonic voice separation, harmonic analysis, cadence detection, and composer identification which, in abstract terms, translate to different graph learning problems, namely, node classification, link prediction, and graph classification. Our experiments demonstrate that MusGConv improves the performance on three of the aforementioned tasks while being conceptually very simple and efficient. We interpret this as evidence that it is beneficial to include perception-informed processing of fundamental musical concepts when developing graph network applications on musical score data. All code and models are released on https://github.com/manoskary/musgconv.
Keywords:
Application domains: Music and sound
Methods and resources: Computational implementations inspired by fields such as psychology or cognitive science
Methods and resources: Machine learning, deep learning, neural models, reinforcement learning