Deep Learning and Music Adversaries

Deep Learning and Music Adversaries
Corey Kereliuk, Bob L. Sturm, Jan Larsen
Abstract	An adversary is essentially an algorithm intent on making a classification system perform in some particular way given an input, e.g., increase the probability of a false negative. Recent work builds adversaries for deep learning systems applied to image object recognition, which exploits the parameters of the system to find the minimal perturbation of the input image such that the network misclassifies it with high confidence. We adapt this approach to construct and deploy an adversary of deep learning systems applied to music content analysis. In our case, however, the input to the systems is magnitude spectral frames, which requires special care in order to produce valid input audio signals from network-derived perturbations. For two different train-test partitionings of two benchmark datasets, and two different deep architectures, we find that this adversary is very effective in defeating the resulting systems. We find the convolutional networks are more robust, however, compared with systems based on a majority vote over individually classified audio frames. Furthermore, we integrate the adversary into the training of new deep systems, but do not find that this improves their resilience against the same adversary.
Keywords	deep nural networks, music information retrieval, content based processing, pattern recognition and claasicificastion
Type	Journal paper [With referee]
Journal	IEEE Transactions on Multimedia
Year	2015 Month November
Publisher	IEEE
ISBN / ISSN	DOI 10.1109/TMM.2015.2478068
Note	Aeepar in 'Deep Learning for Multimedia Computing' special section
Electronic version(s)	[pdf]
BibTeX data	[bibtex]
IMM Group(s)	Intelligent Signal Processing