Dinda Ayu Muthia


Nowadays consumers are increasingly making their opinions and experiences online. Reading those reviews are time-consuming, but, if only few reviews were read, the evaluation would be biased. Sentiment analysis aims to solve this problem by automatically classifying user reviews into positive or negative opinions. Naive Bayes classifier is a popular machine learning technique for text classification, because it is so simple, efficient and it has a great performance in many domains. However, it has a lack that it is highly sensitive to the high number of feature. Therefore, in this research the concatenation of feature selection methods is used, that is Information gain and Genetic algorithm that could increase the accuracy of Naive Bayes classifier.This research turns out text classification in the form of positive or negative from book reviews. The measurement is based on the accuracy of Naive Bayes before and after adding the feature selection method. Evaluation was performed using 10 fold cross validation. Whereas the measurement of accuracy was measured by using confusion matrix and ROC curve. The result of this research is the improvement of accuracy of Naive Bayes from 75.50% to 84.50%.

Full Text:



Basari, A. S. H., Hussin, B., Ananta, I. G. P., & Zeniarja, J. (2013). Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization. Procedia Engineering, 53, 453–462.

Chen, J., Huang, H., Tian, S., & Qu, Y. (2009). Feature selection for text classification with Naïve Bayes. Expert Systems with Applications, 36(3), 5432–5435.

Feldman, R. (2013). Techniques and applications for sentiment analysis. Communications of the ACM, 56(4), 82.

Gorunescu, F. (2011). Data Mining Concept Model Technique.

Gunal, S. (2012). Hybrid feature selection for text classification ¨, 20.

Haddi, E., Liu, X., & Shi, Y. (2013). The Role of Text Pre-processing in Sentiment Analysis. Procedia Computer Science, 17, 26–32.

Han, J., & Kamber, M. (2007). Data Mining Concepts and Techniques.

Markov, Z., & Daniel, T. (2007). Uncovering Patterns in.

Moraes, R., Valiati, J. F., & Gavião Neto, W. P. (2013).

Document-level sentiment classification: An empirical comparison between SVM and ANN. Expert Systems with Applications, 40(2), 621–633.

Santoso, Budi. 2007. Data Mining Teknik Pemanfaatan Data Untuk Keperluan Bisnis. Yogyakarta: Graha Ilmu.

Uysal, A. K., & Gunal, S. (2012). A novel probabilistic feature selection method for text classification. Knowledge-Based Systems, 36, 226–235.

V, S. R. R., Somayajulu, D. V. L. N., & Dani, A. R. (2010). Classification of Movie Reviews Using Complemented Naive Bayesian Classifier, 1(4), 162–167.

Ye, Q., Zhang, Z., & Law, R. (2009). Expert Systems with Applications Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Systems With Applications, 36(3), 6527–6535.

Yessenov, K. (2009). Sentiment Analysis of Movie Review Comments 6.863, 1–17.

Zhang, Z., Ye, Q., Zhang, Z., & Li, Y. (2011). Sentiment classification of Internet restaurant reviews written in Cantonese. Expert Systems with Applications, 38(6), 7674–7682.


Copyright (c) 2016 Dinda Ayu Muthia

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


Dipublikasikan oleh LPPM Universitas Bina Sarana Informatika

Jl. Kramat Raya No.98, Kwitang, Kec. Senen, Kota Jakarta Pusat, DKI Jakarta 10450
Telepon: 021-21231170, ext. 704 / 705
Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License sundaempire787 Poskobet