Logo image
Multimodal Sparsity-Eager Support Vector Machines for Music Classification
Conference proceeding

Multimodal Sparsity-Eager Support Vector Machines for Music Classification

Kamelia Aryafar and Ali Shokoufandeh
2014 13th International Conference on Machine Learning and Applications, pp 405-408
Dec 2014

Abstract

Accuracy audio classification Multimedia communication multimodal Multiple signal classification Music Support vector machines Training Vectors
As the demand for multimedia grows, the development of information retrieval systems utilizing all available data modalities becomes of paramount importance. The provision of multiple modalities is motivated by usability, presence of noise in one modality and non-universality of a single modality. Radio stations and music TV channels hold archives of millions of music tapes and lyrics. Gigabytes of music files are also spread over the web along with the lyrics and metadata for each file. Searching and organizing large scale multimodal datasets is a challenging task. Supervised methods such as support vector machine (SVM) achieve state of the art performance for music classification on single modality, but suffer from over-fitting on training examples and limitations of single modality approaches. In this paper, we introduce a classifier fusion of multimodal audio and lyrics data to address these single modality classification limitations. We introduce the multimodal l 1 -SVM classifier, that utilizes sparse methods to deal with over-fitting for music classification. We compare the classification accuracy of the fusion classifier for a genre classification task in a large public dataset with single modality l 1 -SVM.

Metrics

10 Record Views
5 citations in Scopus

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#4 Quality Education

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Web of Science research areas
Computer Science, Artificial Intelligence
Computer Science, Interdisciplinary Applications
Logo image