Emotion4MIDI: a Lyrics-based Emotion-Labeled Symbolic Music Dataset
Our paper “Emotion4MIDI: a Lyrics-based Emotion-Labeled Symbolic Music Dataset” has been accepted to 2023 EPIA Conference on Artificial Intelligence (EPIA 2023), which took place in the beautiful Azores Islands in Portugal.
The paper is available on Springer Link and on ArXiv. The source code is available on Github. Our paper can be cited as:
@inproceedings{sulunEmotion4MIDILyricsBasedEmotionLabeled2023a,
title = {Emotion4MIDI: A Lyrics-Based Emotion-Labeled Symbolic Music Dataset},
booktitle = {Progress in Artificial Intelligence},
author = {Sulun, Serkan and Oliveira, Pedro and Viana, Paula},
editor = {Moniz, Nuno and Vale, Zita and Cascalho, Jos{\'e} and Silva, Catarina and Sebasti{\~a}o, Raquel},
year = {2023},
pages = {77--89},
publisher = {Springer Nature Switzerland},
address = {Cham},
isbn = {978-3-031-49011-8}}
Abstract: We present a new large-scale emotion-labeled symbolic music dataset consisting of 12k MIDI songs. To create this dataset, we first trained emotion classification models on the GoEmotions dataset, achieving state-of-the-art results with a model half the size of the baseline. We then applied these models to lyrics from two large-scale MIDI datasets. Our dataset covers a wide range of fine-grained emotions, providing a valuable resource to explore the connection between music and emotions and, especially, to develop models that can generate music based on specific emotions. Our code for inference, trained models, and datasets are available online.
Presentation at EPIA 2023:
Also at EPIA 2023, I gave a lightning talk on our previous paper on symbolic music generation, which was selected as one of the best lightning talks:
I also would like to thank the organizers of EPIA 2023 for the prize money, which I’ve already spent on swimming with the sharks in Azores: