Please use this identifier to cite or link to this item: https://rda.sliit.lk/handle/123456789/1453
Title: Sentiment Classification of Sinhala Content in Social Media: A Comparison between Stemmers and N-gram Features
Authors: Jayasuriya, P
Munasinghe, R
Thelijjagoda, S
Keywords: Sentiment Classification
Sinhala Content
Social Media
Comparison between Stemmers
N-gram Features
Issue Date: 9-Dec-2021
Publisher: IEEE
Citation: P. Jayasuriya, R. Munasinghe and S. Thelijjagoda, "Sentiment Classification of Sinhala Content in Social Media: A Comparison between Stemmers and N-gram Features," 2021 IEEE 16th International Conference on Industrial and Information Systems (ICIIS), 2021, pp. 134-139, doi: 10.1109/ICIIS53135.2021.9660711.
Series/Report no.: 2021 IEEE 16th International Conference on Industrial and Information Systems (ICIIS);Pages 134-139
Abstract: Sentiment classification for non-English languages has gained significant attention from researchers in the past few years with the increasing use of non-English scripts and Romanized scripts for expressing sentiments over social media. In this study, we begin by classifying Sinhala sentiments on social media into positive and negative polarity classes using N-gram feature extraction. N-grams are a contiguous sequence of words or characters of a text. Then we focus on improving the classification accuracy by employing different stemming methods. Stemming is generally used to reduce the dimensionality of the feature set - something which needs to be carried out with great care as over reducing feature dimensionality causes the classification accuracy to decrease. Finally, we compare the accuracy and efficiency of N-gram feature extraction and stemming based sentiment analysis models.
URI: http://rda.sliit.lk/handle/123456789/1453
ISSN: 2164-7011
Appears in Collections:Department of Information Management-Scopes
Research Papers
Research Papers - Dept of Information of Management
Research Papers - SLIIT Staff Publications

Files in This Item:
File Description SizeFormat 
Sentiment_Classification_of_Sinhala_Content_in_Social_Media_A_Comparison_between_Stemmers_and_N-gram_Features.pdf
  Until 2050-12-31
449.18 kBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.