F. Weninger, B. Schuller, C. Liem, F. Kurth, and A. Hanjalic, “Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines“, In Multimodal Music Processing, M. Müller, M. Goto, and M. Schedl (eds), 2012.
Abstract:
The emerging field of Music Information Retrieval (MIR) has been influenced by neighboring domains in signal processing and machine learning, including automatic speech recognition, image processing and text information retrieval. In this contribution, we start with concrete examples for methodology transfer between speech and music processing, oriented on the building blocks of pattern recognition: preprocessing, feature extraction, and classification/decoding. We then assume a higher level viewpoint when describing sources of mutual inspiration derived from text and image information retrieval. We conclude that dealing with the peculiarities of music in MIR research has contributed to advancing the state-of-the-art in other fields, and that many future challenges in MIR are strikingly similar to those that other research areas have been facing.