TY - JOUR
T1 - A Music Playback Algorithm Based on Residual-Inception Blocks for Music Emotion Classification and Physiological Information
AU - Liao, Yi Jr
AU - Wang, Wei Chun
AU - Ruan, Shanq Jang
AU - Lee, Yu Hao
AU - Chen, Shih Ching
N1 - Funding Information:
Funding: This research was funded by the National Taiwan University of Science and Technology—Taipei Medical University Joint Research Program under the project “The design of musical canter algorithm based on deep learning“ (Grant No. NTUST-TMU-110-02), and “The effect of meditation therapy and singing therapy on autonomic nerve function in patients with Parkinson’s disease” (Grant No. TMU-NTUST-103-04).
Funding Information:
This research was funded by the National Taiwan University of Science and Technology?Taipei Medical University Joint Research Program under the project ?The design of musical canter algorithm based on deep learning? (Grant No. NTUST-TMU-110-02), and ?The effect of meditation therapy and singing therapy on autonomic nerve function in patients with Parkinson?s disease? (Grant No. TMU-NTUST-103-04).
Publisher Copyright:
© 2022 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2022/2/1
Y1 - 2022/2/1
N2 - Music can generate a positive effect in runners’ performance and motivation. However, the practical implementation of music intervention during exercise is mostly absent from the literature. Therefore, this paper designs a playback sequence system for joggers by considering music emotion and physiological signals. This playback sequence is implemented by a music selection module that combines artificial intelligence techniques with physiological data and emotional music. In order to make the system operate for a long time, this paper improves the model and selection music module to achieve lower energy consumption. The proposed model obtains fewer FLOPs and parameters by using logarithm scaled Mel-spectrogram as input features. The accuracy, computational complexity, trainable parameters, and inference time are evaluated on the Bi-modal, 4Q emotion, and Soundtrack datasets. The experimental results show that the proposed model is better than that of Sarkar et al. and achieves competitive performance on Bi-modal (84.91%), 4Q emotion (92.04%), and Soundtrack (87.24%) datasets. More specifically, the proposed model reduces the computational complexity and inference time while maintaining the classification accuracy, compared to other models. Moreover, the size of the proposed model for network training is small, which can be applied to mobiles and other devices with limited computing resources. This study designed the overall playback sequence system by considering the relationship between music emotion and physiological situation during exercise. The playback sequence system can be adopted directly during exercise to improve users’ exercise efficiency.
AB - Music can generate a positive effect in runners’ performance and motivation. However, the practical implementation of music intervention during exercise is mostly absent from the literature. Therefore, this paper designs a playback sequence system for joggers by considering music emotion and physiological signals. This playback sequence is implemented by a music selection module that combines artificial intelligence techniques with physiological data and emotional music. In order to make the system operate for a long time, this paper improves the model and selection music module to achieve lower energy consumption. The proposed model obtains fewer FLOPs and parameters by using logarithm scaled Mel-spectrogram as input features. The accuracy, computational complexity, trainable parameters, and inference time are evaluated on the Bi-modal, 4Q emotion, and Soundtrack datasets. The experimental results show that the proposed model is better than that of Sarkar et al. and achieves competitive performance on Bi-modal (84.91%), 4Q emotion (92.04%), and Soundtrack (87.24%) datasets. More specifically, the proposed model reduces the computational complexity and inference time while maintaining the classification accuracy, compared to other models. Moreover, the size of the proposed model for network training is small, which can be applied to mobiles and other devices with limited computing resources. This study designed the overall playback sequence system by considering the relationship between music emotion and physiological situation during exercise. The playback sequence system can be adopted directly during exercise to improve users’ exercise efficiency.
KW - Convolutional neural networks
KW - Deep learning
KW - Emotion classification
KW - Music selection module
KW - Physiological data
UR - http://www.scopus.com/inward/record.url?scp=85122959137&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85122959137&partnerID=8YFLogxK
U2 - 10.3390/s22030777
DO - 10.3390/s22030777
M3 - Article
AN - SCOPUS:85122959137
SN - 1424-8220
VL - 22
JO - Sensors
JF - Sensors
IS - 3
M1 - 777
ER -