An Aspect-Sentiment Pair Extraction Approach Based on Latent Dirichlet Allocation

Ekin Ekinci, Sevinc Ilhan Omurca
  • Sevinc Ilhan Omurca
    Kocaeli University, Turkey

Abstract

Online user reviews have a great influence on decision-making process of customers and product sales of companies. However, it is very difficult to obtain user sentiments among huge volume of data on the web consequently; sentiment analysis has gained great importance in terms of analyzing data automatically. On the other hand, sentiment analysis divides itself into branches and can be performed better with aspect level analysis. In this paper, we proposed to extract aspect-sentiment pairs from a Turkish reviews dataset. The proposed task is the fundamental and indeed the critical step of the aspect level sentiment analysis. While extracting aspect-sentiment pairs, an unsupervised topic model Latent Dirichlet Allocation (LDA) is used. With LDA, aspect-sentiment pairs from user reviews are extracted with 0.86 average precision based on ranked list. The aspect-sentiment pair extraction problem is first time realized with LDA on a real-world Turkish user reviews dataset. The experimental results show that LDA is effective and robust in aspect-sentiment pair extraction from user reviews.

Keywords

Aspect-Sentiment Pair Extraction; Latent Dirichlet Allocation (LDA); Sentiment Analysis; Turkish User Reviews

Full Text:

PDF
Submitted: 2018-07-18 11:07:29
Published: 2018-09-26 07:04:22
Search for citations in Google Scholar
Related articles: Google Scholar

References

A. Bagheri, M. Saraee, and F. de Jong, “Care more about customers: Unsupervised domain-independent aspect detection for sentiment analysis of customer reviews,” Knowl-Based Syst., vol. 52, pp. 201–213, Aug. 2013.

B. Liu, “Sentiment Analysis and Opinion Mining,” in Synthesis Lectures on Human Language Technologies, vol. 5, G. Hirst, Ed. Morgan & Claypool Publishers 2012, pp. 1–167.

N. S. Joshi, and S. A. Itkat, “A Survey on Feature Level Sentiment Analysis,” IJCSIT, vol. 5, no. 4, pp. 5422–5425, Aug. 2014.

H. Türkmen, E.Ekinci, and S. İlhan Omurca, “A Novel Method for Extracting Feature Opinion Pairs for Turkish,” in Lecture Notes in Computer Science, vol. 9883, C. Dichev, G. Agre, Eds. Springer, Cham, 2016, pp. 162–171.

M. Hu, and B. Liu, “Mining Opinion Features in Customer Reviews,” in Proc. 19th National Conference on Artificial Intelligence, San Jose, California, USA, 2004, pp. 755-760.

M. Hu, and B. Liu, “Mining and Summarizing Customer Reviews,” in Proc. International Conference on Knowledge Discovery and Data Mining, Seattle, WA, USA, 2004, pp 168-177.

A. M. Popescu, and O. Etzioni, “Mining and Summarizing Customer Reviews,” in Proc. Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, B.C., Canada, 2005, pp 339-346.

K. T. Chan, and I. King, “Let’s Tango – Finding the Right Couple for Feature-Opinion Association in Sentiment Analysis,” in Lecture Notes in Computer Science, vol. 5476, T. Theeramunkong, B. Kijsirikul, N. Cercone, and T. B. Ho, Eds. Springer, Berlin, Heidelberg, 2009, pp. 741-748.

Y. Huang, Z. He, and H. Wang, “Optimization of Feature-Opinion Pairs in Chinese Customer Reviews,” in Lecture Notes in Computer Science, vol. 5579, B. C. Chien, T. P. Hong, S. M. Chen, and M. Ali, Eds. Springer, Berlin, Heidelberg, 2009, pp. 747-756.

S. Brody, and N. Elhadad, “An Unsupervised Aspect-Sentiment Model for Online Reviews” in Proc. Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL, Los Angeles, California, USA, 2010, pp. 804-812.

Y. Jo, and A. Oh, “Aspect and Sentiment Unification Model for Online Review Analysis,” in Proc. Fourth ACM International Conference on Web Search and Data Mining, Hong Kong, China, 2011, pp. 815-824.

S. Kim, J. Zhang, Z. Chen, A. Oh, and S. Liu, “A Hierarchical Aspect-Sentiment Model for Online Reviews,” in Proc. Twenty-Seventh AAAI Conference on Artificial Intelligence, Bellevue, Washington, USA, 2013, pp. 526-533.

P. Yin, H. Wang, and K. Guo, “Feature–opinion pair identification of product reviews in Chinese: a domain ontology modelling method,” New Rev. Hypermedia M., vol. 19, no. 1, pp. 3-24, Apr. 2013.

R. Klinger, and P. Climiano, “”Joint and Pipeline Probabilistic Models for Fine-Grained Sentiment Analysis: Extracting Aspects, Subjective Phrases and their Relations,” in Proc. 2013 IEEE 13th International Conference on Data Mining Workshops, Dallas, TX, USA, 2013, pp. 937-944.

E. Zhou, X. Luo and Z. Qin, “Incorporating Language Patterns and Domain Knowledge into Feature-Opinion Extraction,” in Lecture Notes in Computer Science, vol. 8655, P. Sojka, A. Horàk, I. Kopeček, and K. Pala, Eds. Springer, Cham, 2014, pp. 209-216.

C. Quan, and F. Ren, “Unsupervised product feature extraction for feature-oriented opinion determination,” Inform. Sciences, vol. 272, pp. 16-28, July 2014.

S. Wang, Z. Chen, and B. Liu, “Mining Aspect-Specific Opinion using a Holistic Lifelong Topic Model,” in Proc. International World Wide Web Conference Committee, Montréal, Québec, Canada, 2016, pp. 167-176.

R. K. Amplayo, and S. Hwang, “Aspect Sentiment Model for Micro Reviews,” in Proc. 2017 IEEE International Conference on Data Mining, New Orleans, USA, 2017, pp. 727-732.

D. M. Blei, “Probabilistic Topic Models,” Commun. ACM, vol. 55, no. 4, pp. 77-84, Apr. 2012.

D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent Dirichlet Allocation,” J. Mach. Learn. Res., vol. 3, pp. 993-1022, Jan. 2003.

A. Agrawal, W. Fu, and T. Menzies, “What is wrong with topic modeling? And how to fix it using search-based software engineering,” Inform. Software Tech., vol. 98, pp. 74-88, June 2018.

E. Ekinci, and S. İlhan Omurca, “Extracting Implicit Aspects based on Latent Dirichlet Allocation,” in Proc. Doctoral Consortium – DCAART-(ICAART 2017), Porto, Portugal, 2017, pp. 17-23.

M. D. Akın, and A. A. Akın, “Türk Dilleri için Açık Kaynaklı Doğal Dil İşleme Kütüphanesi: Zemberek,” Elektrik Mühendisliği, vol. 431, pp. 38-44, Aug. 2007.

E. Ekinci, H. Türkmen, and S. İlhan Omurca, “Determining Multi Word Aspects by Using Apriori Algorithm and Syntactic Rules for Turkish Hotel Reviews,” in Proc. 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Poznan, Poland, 2017, pp. 225-229.

A. Moro, A. Raganato, and R. Navigli, “Entity Linking meets Word Sense Disambiguation: a Unified Approach,” TACL, vol. 2, pp. 231–244, May 2014.

Abstract views:
81

Views:
PDF
64




Copyright (c) 2018 International Journal of Intelligent Systems and Applications in Engineering

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
 
© Prof.Dr. Ismail SARITAS 2013-2018     -    Address: Selcuk University, Faculty of Technology 42031 Selcuklu, Konya/TURKEY.