MetrikaBox: An open framework for experimenting with audio classification

Abstract

This paper presents MetrikaBox, a general-purpose, open-source, and extensible audio classification package designed to facilitate the development of Deep Learning (DL) models for a wide range of audio processing tasks. The software manages all necessary preprocessing steps to build classification models capable of distinguishing between user-defined classes using advanced Artificial Intelligence (AI) techniques. MetrikaBox is well suited for tasks such as musical genre classification, voice-versus-music discrimination, and other audio classification or segmentation applications. Users can either employ the package as provided or extend it by integrating their own datasets, classification models, data loading systems, augmentation techniques, and more. The package has been tested in both commercial and academic settings, where it has produced models for industrial audio processing and served as a platform for proof-of-concept applications. Comprehensive documentation and practical examples included in the repository support users in integrating the system into their audio analysis projects. MetrikaBox is openly available and provides a user interface for convenient testing.

Publication
SoftwareX
Jorge Perianez-Pascual
Jorge Perianez-Pascual
Researcher

Software engineer and reasearcher at i3lab. Co-founder of MetrikaMedia.

Juan D. Gutiérrez
Juan D. Gutiérrez
Assistant Professor

Assistant Professor at Universidade de Santiago de Compostela. I enjoy computing but, above all, learning new things.

Fernando Sánchez-Figueroa
Fernando Sánchez-Figueroa
Full Professor

My research focuses on Web engineering, big data visualization, and MDD.

Roberto Rodriguez-Echeverria
Roberto Rodriguez-Echeverria
Associate Professor

Associate Professor at Universidad de Extremadura. Deep learner, MTB rider and father of 2.