Automatic Authorship Classification for German Lyrics Using Naïve Bayes
DOI:
https://doi.org/10.21248/jlcl.36.2023.242Keywords:
German Lyrics, Text Classification, Naïve Bayes, Machine LearningAbstract
Text classification is a prevalent and essential machine-learning task. Machine learning classifiers have developed immensely since their inception. The naïve Bayes classifier is one of the most prominent supervised machine learning classifiers. In this experiment, we highlight the performance of Naïve Bayes for classifying of authors/artists on the German lyrics corpus (“Songkorpus”) and compare the classification results with other classifier algorithms. The corpus of investigation consists of six artists with 970 songs in total. Bayes model evaluation measures revealed a precision of 0.91, recall of 0.94, and F1-measure of 0.9. Furthermore, the classification performance with other classifier algorithms did not reveal any statistically significant difference in performance. The results of the study add to the high volume of reports on the classification accuracy of Naive Bayes for the task of lyrical classification.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Akshay Mendhakar, Mesian Tilmatine
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.