Sentiment Analysis Using Random Forest Algorithm-Online Social Media Based

Nfn Bahrawi

Abstract

Every day billions of data in the form of text flood the internet be it sourced from forums, blogs, social media, or review sites. With the help of sentiment analysis, previously unstructured data can be transformed into more structured data and make this data important information. The data can describe opinions / sentiments from the public, about products, brands, community services, services, politics, or other topics. Sentiment analysis is one of the fields of Natural Language Processing (NLP) that builds systems for recognizing and extracting opinions in text form. At the most basic level, the goal is to get emotions or 'feelings' from a collection of texts or sentences. The field of sentiment analysis, or also called 'opinion mining', always involves some form of data mining process to get the text that will later be carried out the learning process in the mechine learning that will be built. this study conducts a sentimental analysis with data sources from Twitter using the Random Forest algorithm approach, we will measure the evaluation results of the algorithm we use in this study. The accuracy of measurements in this study, around 75%. the model is good enough. but we suggest trying other algorithms in further research.

 


Keywords


sentiment analysis; random forest algorithm; clasification; machine learning

Full Text:

Download PDF

References

C. J. Hutto and E. E. Gilbert, “VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text. Eighth International Conference on Weblogs and Social Media (ICWSM-14).”,” Proc. 8th Int. Conf. Weblogs Soc. Media, ICWSM 2014, 2014.

N. Bahrawi, “Online Realtime Sentiment Analysis Tweets by Utilizing Streaming API Features From Twitter,” J. Penelit. Pos dan Inform., vol. 9, no. 1, pp. 53–62, 2019.

Y. Wan and Q. Gao, “An Ensemble Sentiment Classification System of Twitter Data for Airline Services Analysis,” 2015.

L. Dey, S. Chakraborty, A. Biswas, B. Bose, and S. Tiwari, “Sentiment Analysis of Review Datasets using Naïve Bayes’ and K-NN Classifier.”

F. Nurhuda, S. Widya Sihwi, and A. Doewes, “Analisis Sentimen Masyarakat terhadap Calon Presiden Indonesia 2014 berdasarkan Opini dari Twitter Menggunakan Metode Naive Bayes Classifier,” J. Teknol. Inf. ITSmart, vol. 2, no. 2, p. 35, 2016.

A. Hamzah, “Sentiment Analysis Untuk Memanfaatkan Saran Kuesioner Dalam Evaluasi Pembelajaran Dengan Menggunakan Naive Bayes Classifier (NBC,” 2014.

D. Setyawan and E. Winarko, “Analisis Opini Terhadap Fitur Smartphone Pada Ulasan Website Berbahasa Indonesia,” IJCCS (Indonesian J. Comput. Cybern. Syst., vol. 10, no. 2, pp. 183–194, 2016.

I. Zulfa and E. Winarko, “Sentimen Analisis Tweet Berbahasa Indonesia Dengan Deep Belief Network,” IJCCS (Indonesian J. Comput. Cybern. Syst., vol. 11, no. 2, p. 187, 2017.

D. P. Artanti, A. Syukur, A. Prihandono, and D. R. I. M. Setiadi, “Analisa Sentimen Untuk Penilaian Pelayanan Situs Belanja Online Menggunakan Algoritma Naïve Bayes,” pp. 8–9, 2018.

R. Feldman and J. Sanger, “The Text Mining Handbook,” 2006.

M. Anjali and G. Jivani, “A Comparative Study of Stemming Algorithms.”

R. Stephen, “Understanding inverse document frequency: on theoretical arguments for IDF,” J. Doc., vol. 60, no. 5, pp. 503–520, Jan. 2004.

S. J. Karen, “IDF term weighting and IR research lessons,” J. Doc., vol. 60, no. 5, pp. 521–523, Jan. 2004.

Ö. Akar, O. Gungor, and O. Güngör, “Classification of Multispectral Images Using Random Forest Algorithm View project 3D mapping View project Classification of multispectral images using Random Forest algorithm,” vol. 1, no. , pp. 105–112, 2012.

L. Breiman, “RANDOM FORESTS,” 2001.

K. Archer and R. Kimes, “Empirical characterization of random forest variable importance measures,” Comput. Stat. Data Anal., vol. 52, pp. 2249–2260, 2008.

L. Breiman and A. Cutler, “INTERFACE WORKSHOP-APRIL 2004 RFtools-for Predicting and Understanding Data.”

L. B. and A. Cutler, “Random forests - copyright.” [Online]. Available: https://www.stat.berkeley.edu/~breiman/RandomForests/cc_papers.htm. [Accessed: 26-Nov-2019].

A. Liaw and M. Wiener, “Classification and Regression by RandomForest,” 2002.




DOI: http://dx.doi.org/10.30818/jitu.2.2.2695

Refbacks

  • There are currently no refbacks.


Copyright (c) 2019 Journal of Information Technology and Its Utilization

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.