Please use this identifier to cite or link to this item:
https://rda.sliit.lk/handle/123456789/1453
Title: | Sentiment Classification of Sinhala Content in Social Media: A Comparison between Stemmers and N-gram Features |
Authors: | Jayasuriya, P Munasinghe, R Thelijjagoda, S |
Keywords: | Sentiment Classification Sinhala Content Social Media Comparison between Stemmers N-gram Features |
Issue Date: | 9-Dec-2021 |
Publisher: | IEEE |
Citation: | P. Jayasuriya, R. Munasinghe and S. Thelijjagoda, "Sentiment Classification of Sinhala Content in Social Media: A Comparison between Stemmers and N-gram Features," 2021 IEEE 16th International Conference on Industrial and Information Systems (ICIIS), 2021, pp. 134-139, doi: 10.1109/ICIIS53135.2021.9660711. |
Series/Report no.: | 2021 IEEE 16th International Conference on Industrial and Information Systems (ICIIS);Pages 134-139 |
Abstract: | Sentiment classification for non-English languages has gained significant attention from researchers in the past few years with the increasing use of non-English scripts and Romanized scripts for expressing sentiments over social media. In this study, we begin by classifying Sinhala sentiments on social media into positive and negative polarity classes using N-gram feature extraction. N-grams are a contiguous sequence of words or characters of a text. Then we focus on improving the classification accuracy by employing different stemming methods. Stemming is generally used to reduce the dimensionality of the feature set - something which needs to be carried out with great care as over reducing feature dimensionality causes the classification accuracy to decrease. Finally, we compare the accuracy and efficiency of N-gram feature extraction and stemming based sentiment analysis models. |
URI: | http://rda.sliit.lk/handle/123456789/1453 |
ISSN: | 2164-7011 |
Appears in Collections: | Department of Information Management-Scopes Research Papers Research Papers - Dept of Information of Management Research Papers - SLIIT Staff Publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Sentiment_Classification_of_Sinhala_Content_in_Social_Media_A_Comparison_between_Stemmers_and_N-gram_Features.pdf Until 2050-12-31 | 449.18 kB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.