Publication Date
2011-07-21
Availability
Open access
Embargo Period
2011-07-21
Degree Name
Master of Science (MS)
Department
Computer Science (Arts and Sciences)
Date of Defense
2011-07-15
First Committee Member
Mitsunori Ogihara
Second Committee Member
Burton Rosenberg
Third Committee Member
Dimitris Papamichail
Abstract
In the field of Music Data Mining, Mood and Topic information has been considered as a high level metadata. The extraction of mood and topic information is difficult but is regarded as very valuable. The immense growth of Web 2.0 resulted in Social Tags being a direct interaction with users (humans) and their feedback through tags can help in classification and retrieval of music. One of the major shortcomings of the approaches that have been employed so far is the improper filtering of social tags. This thesis delves into the topic of information extraction from songs’ tags and lyrics. The main focus is on removing all erroneous and unwanted tags with help of other features. The hierarchical clustering method is applied to create clusters of tags. The clusters are based on semantic information any given pair of tags share. The lyrics features are utilized by employing CLOPE clustering method to form lyrics clusters, and Naïve Bayes method to compute probability values that aid in classification process. The outputs from classification are finally used to estimate the accuracy of a tag belonging to the song. The results obtained from the experiments all point towards the success of the method proposed and can be utilized by other research projects in the similar field.
Keywords
social tags; lyrics classification; hierarchical clustering; Naive Bayes; CLOPE clustering; Last.fm
Recommended Citation
Chawla, Rahul, "Filtering Social Tags for Songs based on Lyrics using Clustering Methods" (2011). Open Access Theses. 272.
http://scholarlyrepository.miami.edu/oa_theses/272