Skip to main content

14.05.2024

Fisher Discriminative Embedding Low-Rank Sparse Representation for Music Genre Classification

verfasst von: Xin Cai, Hongjuan Zhang

Erschienen in: Circuits, Systems, and Signal Processing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This work focuses on a music genre classification method based on a sparse low-rank representation. Sparse low-rank representation is an effective method for learning classifiers, which aims to learn a row-sparse low-rank representation matrix to effectively ignore noise and identify subspace structures in data contaminated by outliers. However, these related methods fail to utilize the discriminative information to mine the rich supervision information available in the training samples. To address this issue, a novel Fisher Discriminative Embedding Low-Rank Sparse Representation (FDLRSR) classification algorithm is proposed based on the Fisher criterion, which results in stronger intra-class similarity and inter-class separability representation coefficients. Meanwhile, its two special cases, i.e., the Fisher Discriminative Embedding Low-Rank Representation (FDLR) and Fisher Discriminative Embedding Sparse Representation (FDSR) are also presented in this work. Specifically, the proposed classification method employs the FDLRSR algorithm coupled with the feature combinations consisting acoustic features and spectral features for music genre classification tasks by minimizing the residuals. Compared with the several state-of-the-art music genre classification methods, the proposed methods substantially improve the classification results on three widely used datasets, the GTZAN, ISMIR2004 and Homburg datasets, with the highest classification accuracies of 97.9% and 99.43%, which verify its effectiveness and availability.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

ATZelektronik

Die Fachzeitschrift ATZelektronik bietet für Entwickler und Entscheider in der Automobil- und Zulieferindustrie qualitativ hochwertige und fundierte Informationen aus dem gesamten Spektrum der Pkw- und Nutzfahrzeug-Elektronik. 

Lassen Sie sich jetzt unverbindlich 2 kostenlose Ausgabe zusenden.

ATZelectronics worldwide

ATZlectronics worldwide is up-to-speed on new trends and developments in automotive electronics on a scientific level with a high depth of information. 

Order your 30-days-trial for free and without any commitment.

Weitere Produktempfehlungen anzeigen
Literatur
2.
Zurück zum Zitat B.E. Boser, I.M. Guyon, V.N. Vapnik, A training algorithm for optimal margin classifiers. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pages 144–152, (1992) B.E. Boser, I.M. Guyon, V.N. Vapnik, A training algorithm for optimal margin classifiers. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pages 144–152, (1992)
3.
Zurück zum Zitat S. Boyd, N. Parikh, E. Chu, B. Peleato, J. Eckstein et al., Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach.® learn. 3(1), 1–122 (2011) S. Boyd, N. Parikh, E. Chu, B. Peleato, J. Eckstein et al., Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach.® learn. 3(1), 1–122 (2011)
4.
Zurück zum Zitat J.F. Cai, E.J. Candès, Z. Shen, A singular value thresholding algorithm for matrix completion. SIAM J. Optim. 20(4), 1956–1982 (2010)MathSciNetCrossRef J.F. Cai, E.J. Candès, Z. Shen, A singular value thresholding algorithm for matrix completion. SIAM J. Optim. 20(4), 1956–1982 (2010)MathSciNetCrossRef
5.
Zurück zum Zitat E.J. Candès, X. Li, Y. Ma, J. Wright, Robust principal component analysis. J. ACM (JACM) 58(3), 1–37 (2011)MathSciNetCrossRef E.J. Candès, X. Li, Y. Ma, J. Wright, Robust principal component analysis. J. ACM (JACM) 58(3), 1–37 (2011)MathSciNetCrossRef
6.
Zurück zum Zitat P. Cano, E. Gómez, F. Gouyon, P. Herrera, M. Koppenberger, B. Ong, X. Serra, S. Streich, N. Wack, ISMIR 2004 audio description contest. Tech. Report. Music Technol. Group, Bracelona, Spain 01, 2006 (2004) P. Cano, E. Gómez, F. Gouyon, P. Herrera, M. Koppenberger, B. Ong, X. Serra, S. Streich, N. Wack, ISMIR 2004 audio description contest. Tech. Report. Music Technol. Group, Bracelona, Spain 01, 2006 (2004)
8.
Zurück zum Zitat S.S. Chen, D.L. Donoho, M.A. Saunders, Atomic decomposition by basis pursuit. SIAM Rev. 43(1), 129–159 (2001)MathSciNetCrossRef S.S. Chen, D.L. Donoho, M.A. Saunders, Atomic decomposition by basis pursuit. SIAM Rev. 43(1), 129–159 (2001)MathSciNetCrossRef
9.
Zurück zum Zitat Z. Chen, W. XiaoJun, J. Kittler, Low-rank discriminative least squares regression for image classification. Signal Process. 173, 107485 (2020)CrossRef Z. Chen, W. XiaoJun, J. Kittler, Low-rank discriminative least squares regression for image classification. Signal Process. 173, 107485 (2020)CrossRef
10.
Zurück zum Zitat D.C. Corrèa, F.A. Rodrigues, A survey on symbolic data-based music genre classification. Expert Syst. Appl. 60, 190–210 (2016)CrossRef D.C. Corrèa, F.A. Rodrigues, A survey on symbolic data-based music genre classification. Expert Syst. Appl. 60, 190–210 (2016)CrossRef
11.
Zurück zum Zitat Y.M.G. Costa, L.S. Oliveira, A.L. Koerich, F. Gouyon, Music genre recognition using spectrograms. In 2011 18th International Conference on Systems, Signals and Image Processing, pages 1–4, (07 2011) Y.M.G. Costa, L.S. Oliveira, A.L. Koerich, F. Gouyon, Music genre recognition using spectrograms. In 2011 18th International Conference on Systems, Signals and Image Processing, pages 1–4, (07 2011)
12.
Zurück zum Zitat Y.M.G. Costa, L.S. Oliveira, A.L. Koerich, F. Gouyon, Music genre recognition using gabor filters and lpq texture descriptors. Prog. Pattern Recognit. Image Anal. Comput. Vis. and Appl. 8259, 67–74 (2013) Y.M.G. Costa, L.S. Oliveira, A.L. Koerich, F. Gouyon, Music genre recognition using gabor filters and lpq texture descriptors. Prog. Pattern Recognit. Image Anal. Comput. Vis. and Appl. 8259, 67–74 (2013)
13.
Zurück zum Zitat Y.M.G. Costa, L.S. Oliveira, A.L. Koerich, F. Gouyon, J.G. Martins, Music genre classification using lbp textural features. Signal Process. 92(11), 2723–2737 (2012)CrossRef Y.M.G. Costa, L.S. Oliveira, A.L. Koerich, F. Gouyon, J.G. Martins, Music genre classification using lbp textural features. Signal Process. 92(11), 2723–2737 (2012)CrossRef
14.
Zurück zum Zitat T. Cover, P. Hart, Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 13(1), 21–27 (1967)CrossRef T. Cover, P. Hart, Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 13(1), 21–27 (1967)CrossRef
15.
Zurück zum Zitat D. Haishun, Y. Wang, F. Zhang, Y. Zhou, Low-rank discriminative adaptive graph preserving subspace learning. Neural Process. Lett. 52(3), 2127–2149 (2020)CrossRef D. Haishun, Y. Wang, F. Zhang, Y. Zhou, Low-rank discriminative adaptive graph preserving subspace learning. Neural Process. Lett. 52(3), 2127–2149 (2020)CrossRef
16.
Zurück zum Zitat A. Elbir, N. Aydin, Music genre classification and music recommendation by using deep learning. Electron. Lett. 56(12), 627–629 (2020)CrossRef A. Elbir, N. Aydin, Music genre classification and music recommendation by using deep learning. Electron. Lett. 56(12), 627–629 (2020)CrossRef
17.
Zurück zum Zitat Z. Fu, G. Lu, K.M. Ting, D. Zhang, A survey of audio-based music classification and annotation. IEEE Trans. Multimedia 13(2), 303–319 (2011)CrossRef Z. Fu, G. Lu, K.M. Ting, D. Zhang, A survey of audio-based music classification and annotation. IEEE Trans. Multimedia 13(2), 303–319 (2011)CrossRef
19.
Zurück zum Zitat Y.F. Guo, S.J. Li, J.Y. Yang, T.T. Shu, W. LiDe, A generalized foley-sammon transform based on generalized fisher discriminant criterion and its application to face recognition. Pattern Recogn. Lett. 24(1–3), 147–158 (2003)CrossRef Y.F. Guo, S.J. Li, J.Y. Yang, T.T. Shu, W. LiDe, A generalized foley-sammon transform based on generalized fisher discriminant criterion and its application to face recognition. Pattern Recogn. Lett. 24(1–3), 147–158 (2003)CrossRef
20.
Zurück zum Zitat N. Han, W. Jigang, Y. Liang, X. Fang, W.K. Wong, S. Teng, Low-rank and sparse embedding for dimensionality reduction. Neural Netw. 108, 202–216 (2018)CrossRef N. Han, W. Jigang, Y. Liang, X. Fang, W.K. Wong, S. Teng, Low-rank and sparse embedding for dimensionality reduction. Neural Netw. 108, 202–216 (2018)CrossRef
21.
Zurück zum Zitat H. Homburg, I. Mierswa, B. Möller, K. Morik, M. Wurst, A benchmark dataset for audio classification and clustering. In ISMIR 2005, 528–531 (2005) H. Homburg, I. Mierswa, B. Möller, K. Morik, M. Wurst, A benchmark dataset for audio classification and clustering. In ISMIR 2005, 528–531 (2005)
22.
Zurück zum Zitat C.-H. Lee, J.-L. Shih, Yu. Kun-Ming, H.-S. Lin, Automatic music genre classification based on modulation spectral analysis of spectral and cepstral features. IEEE Trans. Multimedia 11, 670–682 (2009)CrossRef C.-H. Lee, J.-L. Shih, Yu. Kun-Ming, H.-S. Lin, Automatic music genre classification based on modulation spectral analysis of spectral and cepstral features. IEEE Trans. Multimedia 11, 670–682 (2009)CrossRef
23.
Zurück zum Zitat A. Li, D. Chen, W. Zhiqiang, G. Sun, K. Lin, Self-supervised sparse coding scheme for image classification based on low rank representation. PLoS ONE 13(6), e0199141 (2018)CrossRef A. Li, D. Chen, W. Zhiqiang, G. Sun, K. Lin, Self-supervised sparse coding scheme for image classification based on low rank representation. PLoS ONE 13(6), e0199141 (2018)CrossRef
24.
Zurück zum Zitat H. Li, T. Jiang, K. Zhang, Efficient and robust feature extraction by maximum margin criterion. IEEE Trans. Neural Netw. 17(1), 157–165 (2006)CrossRef H. Li, T. Jiang, K. Zhang, Efficient and robust feature extraction by maximum margin criterion. IEEE Trans. Neural Netw. 17(1), 157–165 (2006)CrossRef
25.
Zurück zum Zitat T. Li, M. Ogihara, Toward intelligent music information retrieval. IEEE Trans. Multimedia 8(3), 564–574 (2006)CrossRef T. Li, M. Ogihara, Toward intelligent music information retrieval. IEEE Trans. Multimedia 8(3), 564–574 (2006)CrossRef
26.
Zurück zum Zitat T.L. Li , A.B. Chan, Genre classification and the invariance of mfcc features to key and tempo. In International Conference on MultiMedia Modeling, pages 317–327. Springer (2011) T.L. Li , A.B. Chan, Genre classification and the invariance of mfcc features to key and tempo. In International Conference on MultiMedia Modeling, pages 317–327. Springer (2011)
27.
Zurück zum Zitat T. Lidy, A. Rauber, Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR 2005), pages 34–41, September 11-15 (2005) T. Lidy, A. Rauber, Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR 2005), pages 34–41, September 11-15 (2005)
28.
Zurück zum Zitat S. Lim, J. Lee, S. Jang, S. Lee, M.Y. Kim, Music-genre classification system based on spectro-temporal features and feature selection. IEEE Trans. Consum. Electron. 58(4), 1262–1268 (2012)CrossRef S. Lim, J. Lee, S. Jang, S. Lee, M.Y. Kim, Music-genre classification system based on spectro-temporal features and feature selection. IEEE Trans. Consum. Electron. 58(4), 1262–1268 (2012)CrossRef
29.
Zurück zum Zitat Z. Lin, M. Chen, Y. Ma, The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. arXiv preprint arXiv:1009.5055, (2010) Z. Lin, M. Chen, Y. Ma, The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. arXiv preprint arXiv:​1009.​5055, (2010)
30.
Zurück zum Zitat Z. Lin, R. Liu, Z. Su, Linearized alternating direction method with adaptive penalty for low-rank representation. arXiv preprint arXiv:1109.0367, (2011) Z. Lin, R. Liu, Z. Su, Linearized alternating direction method with adaptive penalty for low-rank representation. arXiv preprint arXiv:​1109.​0367, (2011)
31.
Zurück zum Zitat C. Liu, L. Feng, G. Liu, H. Wang, S. Liu, Bottom-up broadcast neural network for music genre classification. Multimed. Tools Appl. 80(5), 7313–7331 (2021)CrossRef C. Liu, L. Feng, G. Liu, H. Wang, S. Liu, Bottom-up broadcast neural network for music genre classification. Multimed. Tools Appl. 80(5), 7313–7331 (2021)CrossRef
32.
Zurück zum Zitat G. Liu, Z. Lin, J. Shuicheng Yan, Y.Y. Sun, Y. Ma, Robust recovery of subspace structures by low-rank representation. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 171–184 (2012)CrossRef G. Liu, Z. Lin, J. Shuicheng Yan, Y.Y. Sun, Y. Ma, Robust recovery of subspace structures by low-rank representation. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 171–184 (2012)CrossRef
33.
Zurück zum Zitat G. Liu, Z. Lin, Y. Yu, Robust subspace segmentation by low-rank representation. In Proceedings of the 27th International Conference on International Conference on Machine Learning, number 8 in ICML’10, page 663–670, Madison, WI, USA, (2010). Omnipress G. Liu, Z. Lin, Y. Yu, Robust subspace segmentation by low-rank representation. In Proceedings of the 27th International Conference on International Conference on Machine Learning, number 8 in ICML’10, page 663–670, Madison, WI, USA, (2010). Omnipress
35.
Zurück zum Zitat L. Canyi, J. Feng, S. Yan, Z. Lin, A unified alternating direction method of multipliers by majorization minimization. IEEE Trans. Pattern Anal. Mach. Intell. 40(3), 527–541 (2017) L. Canyi, J. Feng, S. Yan, Z. Lin, A unified alternating direction method of multipliers by majorization minimization. IEEE Trans. Pattern Anal. Mach. Intell. 40(3), 527–541 (2017)
37.
Zurück zum Zitat L. Ma, C. Wang, B. Xiao, W. Zhou, Sparse representation for face recognition based on discriminative low-rank dictionary learning. In 2012 IEEE conference on computer vision and pattern recognition, pages 2586–2593, (2012) L. Ma, C. Wang, B. Xiao, W. Zhou, Sparse representation for face recognition based on discriminative low-rank dictionary learning. In 2012 IEEE conference on computer vision and pattern recognition, pages 2586–2593, (2012)
38.
Zurück zum Zitat D. Mitrović, M. Zeppelzauer, C. Breiteneder, Features for content-based audio retrieval. In Adv. Comput. Improv. Web 78, 71–150 (2010)CrossRef D. Mitrović, M. Zeppelzauer, C. Breiteneder, Features for content-based audio retrieval. In Adv. Comput. Improv. Web 78, 71–150 (2010)CrossRef
39.
Zurück zum Zitat L. Nanni, Y.M.G. Costa, D.R. Lucio, C.N. Silla, S. Brahnam, Combining visual and acoustic features for audio classification tasks. Pattern Recogn. Lett. 88, 49–56 (2017)CrossRef L. Nanni, Y.M.G. Costa, D.R. Lucio, C.N. Silla, S. Brahnam, Combining visual and acoustic features for audio classification tasks. Pattern Recogn. Lett. 88, 49–56 (2017)CrossRef
40.
Zurück zum Zitat L. Nanni, Y.M.G. Costa, A. Lumini, M.Y. Kim, S.R. Baek, Combining visual and acoustic features for music genre classification. Expert Syst. Appl. 45, 108–117 (2016)CrossRef L. Nanni, Y.M.G. Costa, A. Lumini, M.Y. Kim, S.R. Baek, Combining visual and acoustic features for music genre classification. Expert Syst. Appl. 45, 108–117 (2016)CrossRef
41.
Zurück zum Zitat R. Nosaka, C.H. Suryanto, K. Fukui, Rotation invariant co-occurrence among adjacent lbps. In Jong-Il Park and Junmo Kim, editors, Computer Vision - ACCV 2012 Workshops, pages 15–25, (2013) R. Nosaka, C.H. Suryanto, K. Fukui, Rotation invariant co-occurrence among adjacent lbps. In Jong-Il Park and Junmo Kim, editors, Computer Vision - ACCV 2012 Workshops, pages 15–25, (2013)
42.
Zurück zum Zitat T. Ojala, M. Pietikainen, T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)CrossRef T. Ojala, M. Pietikainen, T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)CrossRef
43.
Zurück zum Zitat V. Ojansivu, J. Heikkilä, Blur insensitive texture classification using local phase quantization. In Abderrahim Elmoataz, Olivier Lezoray, Fathallah Nouboud, and Driss Mammass, editors, Image and Signal Processing, pages 236–243, (2008) V. Ojansivu, J. Heikkilä, Blur insensitive texture classification using local phase quantization. In Abderrahim Elmoataz, Olivier Lezoray, Fathallah Nouboud, and Driss Mammass, editors, Image and Signal Processing, pages 236–243, (2008)
44.
Zurück zum Zitat Y. Panagakis, C. Kotropoulos, Music genre classification via topology preserving non-negative tensor factorization and sparse representations. In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 249–252, (2010) Y. Panagakis, C. Kotropoulos, Music genre classification via topology preserving non-negative tensor factorization and sparse representations. In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 249–252, (2010)
45.
Zurück zum Zitat Y. Panagakis, C.L. Kotropoulos, G.R. Arce, Music genre classification via joint sparse low-rank representation of audio features. IEEE/ACM Trans. Audio, Speech, Lang. Process. 22(12), 1905–1917 (2014)CrossRef Y. Panagakis, C.L. Kotropoulos, G.R. Arce, Music genre classification via joint sparse low-rank representation of audio features. IEEE/ACM Trans. Audio, Speech, Lang. Process. 22(12), 1905–1917 (2014)CrossRef
46.
Zurück zum Zitat L. Qiu, S. Li, Y. Sung, 3D-DCDAE: Unsupervised music latent representations learning method based on a deep 3d convolutional denoising autoencoder for music genre classification. Mathematics 9(18), 2274 (2021)CrossRef L. Qiu, S. Li, Y. Sung, 3D-DCDAE: Unsupervised music latent representations learning method based on a deep 3d convolutional denoising autoencoder for music genre classification. Mathematics 9(18), 2274 (2021)CrossRef
47.
Zurück zum Zitat L. Qiu, S. Li, Y. Sung, DBTMPE: Deep bidirectional transformers-based masked predictive encoder approach for music genre classification. Mathematics 9(5), 530 (2021)CrossRef L. Qiu, S. Li, Y. Sung, DBTMPE: Deep bidirectional transformers-based masked predictive encoder approach for music genre classification. Mathematics 9(5), 530 (2021)CrossRef
48.
Zurück zum Zitat A. Schindler, A. Rauber, An audio-visual approach to music genre classification through affective color features. In Allan Hanbury, Gabriella Kazai, Andreas Rauber, and Norbert Fuhr, editors, Advances in Information Retrieval, pages 61–67, (04 2015) A. Schindler, A. Rauber, An audio-visual approach to music genre classification through affective color features. In Allan Hanbury, Gabriella Kazai, Andreas Rauber, and Norbert Fuhr, editors, Advances in Information Retrieval, pages 61–67, (04 2015)
49.
Zurück zum Zitat F. Song, D. Zhang, D. Mei, Z. Guo, A multiple maximum scatter difference discriminant criterion for facial feature extraction. IEEE Trans. Syst. Man Cybern. Part B (Cybernetics) 37(6), 1599–1606 (2007)CrossRef F. Song, D. Zhang, D. Mei, Z. Guo, A multiple maximum scatter difference discriminant criterion for facial feature extraction. IEEE Trans. Syst. Man Cybern. Part B (Cybernetics) 37(6), 1599–1606 (2007)CrossRef
50.
Zurück zum Zitat D.G. Stork, R.O. Duda, P.E. Hart, D. Stork, Pattern classification (A Wiley-Interscience Publication, Hoboken, 2001) D.G. Stork, R.O. Duda, P.E. Hart, D. Stork, Pattern classification (A Wiley-Interscience Publication, Hoboken, 2001)
51.
Zurück zum Zitat G. Tzanetakis, P. Cook, Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing 10(5), 293–302 (2002)CrossRef G. Tzanetakis, P. Cook, Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing 10(5), 293–302 (2002)CrossRef
52.
Zurück zum Zitat E. Van Den Berg, M.P. Friedlander, Probing the pareto frontier for basis pursuit solutions. SIAM J. Sci. Comput. 31(2), 890–912 (2009)MathSciNetCrossRef E. Van Den Berg, M.P. Friedlander, Probing the pareto frontier for basis pursuit solutions. SIAM J. Sci. Comput. 31(2), 890–912 (2009)MathSciNetCrossRef
53.
Zurück zum Zitat T.H. Vu, V. Monga, Fast low-rank shared dictionary learning for image classification. IEEE Trans. Image Process. 26(11), 5160–5175 (2017)MathSciNetCrossRef T.H. Vu, V. Monga, Fast low-rank shared dictionary learning for image classification. IEEE Trans. Image Process. 26(11), 5160–5175 (2017)MathSciNetCrossRef
54.
Zurück zum Zitat H. Wang, S. Yan, D. Xu, X. Tang, T. Huang, Trace ratio vs. ratio trace for dimensionality reduction. In 2007 IEEE Conference on Computer Vision and Pattern Recognition, pages 1–8, (2007) H. Wang, S. Yan, D. Xu, X. Tang, T. Huang, Trace ratio vs. ratio trace for dimensionality reduction. In 2007 IEEE Conference on Computer Vision and Pattern Recognition, pages 1–8, (2007)
55.
Zurück zum Zitat Z. Wen, B. Hou, L. Jiao, Discriminative dictionary learning with two-level low rank and group sparse decomposition for image classification. IEEE trans. cybern. 47(11), 3758–3771 (2017)CrossRef Z. Wen, B. Hou, L. Jiao, Discriminative dictionary learning with two-level low rank and group sparse decomposition for image classification. IEEE trans. cybern. 47(11), 3758–3771 (2017)CrossRef
56.
Zurück zum Zitat J. Wright, A.Y. Yang, A. Ganesh, S.S. Sastry, Y. Ma, Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2009)CrossRef J. Wright, A.Y. Yang, A. Ganesh, S.S. Sastry, Y. Ma, Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2009)CrossRef
57.
Zurück zum Zitat M. Wu, Z. Chen, J.R. Jang, J. Ren, Y. Li, C. Lu, Combining visual and acoustic features for music genre classification. In 2011 10th International Conference on Machine Learning and Applications and Workshops, volume 2, pages 124–129, (2011) M. Wu, Z. Chen, J.R. Jang, J. Ren, Y. Li, C. Lu, Combining visual and acoustic features for music genre classification. In 2011 10th International Conference on Machine Learning and Applications and Workshops, volume 2, pages 124–129, (2011)
58.
Zurück zum Zitat X. Huan, C. Caramanis, S. Sanghavi, Robust pca via outlier pursuit. IEEE Trans. Inf. Theory 58(5), 3047–3064 (2012)MathSciNetCrossRef X. Huan, C. Caramanis, S. Sanghavi, Robust pca via outlier pursuit. IEEE Trans. Inf. Theory 58(5), 3047–3064 (2012)MathSciNetCrossRef
59.
Zurück zum Zitat Y. Xu, W. Zhou, A deep music genres classification model based on cnn with squeeze & excitation block. In 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pages 332–338, (2020) Y. Xu, W. Zhou, A deep music genres classification model based on cnn with squeeze & excitation block. In 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pages 332–338, (2020)
60.
Zurück zum Zitat B.Q. Yang, X.P. Guan, J.W. Zhu, G. ChaoChen, W. KaiJie, X. JiaJie, Svms multi-class loss feedback based discriminative dictionary learning for image classification. Pattern Recogn. 112, 107690 (2021)CrossRef B.Q. Yang, X.P. Guan, J.W. Zhu, G. ChaoChen, W. KaiJie, X. JiaJie, Svms multi-class loss feedback based discriminative dictionary learning for image classification. Pattern Recogn. 112, 107690 (2021)CrossRef
61.
Zurück zum Zitat H. Yang, W.Q. Zhang, Music genre classification using duplicated convolutional layers in neural networks. In Interspeech, pages 3382–3386, (2019) H. Yang, W.Q. Zhang, Music genre classification using duplicated convolutional layers in neural networks. In Interspeech, pages 3382–3386, (2019)
62.
Zurück zum Zitat J. Yang, X. Yuan, Linearized augmented lagrangian and alternating direction methods for nuclear norm minimization. Math. Comput. 82(281), 301–329 (2013)MathSciNetCrossRef J. Yang, X. Yuan, Linearized augmented lagrangian and alternating direction methods for nuclear norm minimization. Math. Comput. 82(281), 301–329 (2013)MathSciNetCrossRef
63.
Zurück zum Zitat M. Yang, L. Zhang, X. Feng, D. Zhang, Sparse representation based fisher discrimination dictionary learning for image classification. Int. J. Comput. Vision 109(3), 209–232 (2014)MathSciNetCrossRef M. Yang, L. Zhang, X. Feng, D. Zhang, Sparse representation based fisher discrimination dictionary learning for image classification. Int. J. Comput. Vision 109(3), 209–232 (2014)MathSciNetCrossRef
64.
Zurück zum Zitat J. Ylioinas, A. Hadid, Y. Guo, M. Pietikäinen, Efficient image appearance description using dense sampling based local binary patterns. In Kyoung Mu Lee, Yasuyuki Matsushita, James M. Rehg, and Zhanyi Hu, editors, Computer Vision – ACCV 2012, pages 375–388, (2013) J. Ylioinas, A. Hadid, Y. Guo, M. Pietikäinen, Efficient image appearance description using dense sampling based local binary patterns. In Kyoung Mu Lee, Yasuyuki Matsushita, James M. Rehg, and Zhanyi Hu, editors, Computer Vision – ACCV 2012, pages 375–388, (2013)
65.
Zurück zum Zitat Yu. Yang, S. Luo, S. Liu, H. Qiao, Y. Liu, L. Feng, Deep attention based music genre classification. Neurocomputing 372, 84–91 (2020)CrossRef Yu. Yang, S. Luo, S. Liu, H. Qiao, Y. Liu, L. Feng, Deep attention based music genre classification. Neurocomputing 372, 84–91 (2020)CrossRef
66.
Zurück zum Zitat Y. Zhang, Z. Jiang, L.S. Davis, Learning structured low-rank representations for image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 676–683, (2013) Y. Zhang, Z. Jiang, L.S. Davis, Learning structured low-rank representations for image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 676–683, (2013)
67.
Zurück zum Zitat G. Zhao, T. Ahonen, J. Matas, M. Pietikainen, Rotation-invariant image and video description with local binary pattern features. IEEE Trans. Image Process. 21(4), 1465–1477 (2012)MathSciNetCrossRef G. Zhao, T. Ahonen, J. Matas, M. Pietikainen, Rotation-invariant image and video description with local binary pattern features. IEEE Trans. Image Process. 21(4), 1465–1477 (2012)MathSciNetCrossRef
68.
Zurück zum Zitat L. Zhuang, H. Gao, Z. Lin, Y. Ma, X. Zhang, N. Yu, Non-negative low rank and sparse graph for semi-supervised learning. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, pages 2328–2335, (2012) L. Zhuang, H. Gao, Z. Lin, Y. Ma, X. Zhang, N. Yu, Non-negative low rank and sparse graph for semi-supervised learning. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, pages 2328–2335, (2012)
Metadaten
Titel
Fisher Discriminative Embedding Low-Rank Sparse Representation for Music Genre Classification
verfasst von
Xin Cai
Hongjuan Zhang
Publikationsdatum
14.05.2024
Verlag
Springer US
Erschienen in
Circuits, Systems, and Signal Processing
Print ISSN: 0278-081X
Elektronische ISSN: 1531-5878
DOI
https://doi.org/10.1007/s00034-024-02696-0