A COMBINED DEEP LEARNING MODEL FOR PERSIAN SENTIMENT ANALYSIS

Zahra Bokaee Nezhad; Mohammad Ali Deihimi

doi:10.31436/iiumej.v20i1.1036

Authors

Zahra Bokaee Nezhad Department of Informatin and Computer Science, Zand university, Shiraz, Iran https://orcid.org/0000-0001-7533-4732
Mohammad Ali Deihimi https://orcid.org/0000-0001-5903-9960

DOI:

https://doi.org/10.31436/iiumej.v20i1.1036

Keywords:

Sentiment Analysis, Natural Language Processing, Hybrid Architecture, Persian, Machine Learning

Abstract

With increasing members in social media sites today, people tend to share their views about everything online. It is a convenient way to convey their messages to end users on a specific subject. Sentiment Analysis is a subfield of Natural Language Processing (NLP) that refers to the identification of users’ opinions toward specific topics. It is used in several fields such as marketing, customer services, etc. However, limited works have been done on Persian Sentiment Analysis. On the other hand, deep learning has recently become popular because of its successful role in several Natural Language Processing tasks. The objective of this paper is to propose a novel hybrid deep learning architecture for Persian Sentiment Analysis. According to the proposed model, local features are extracted by Convolutional Neural Networks (CNN) and long-term dependencies are learned by Long Short Term Memory (LSTM). Therefore, the model can harness both CNN's and LSTM's abilities. Furthermore, Word2vec is used for word representation as an unsupervised learning step. To the best of our knowledge, this is the first attempt where a hybrid deep learning model is used for Persian Sentiment Analysis. We evaluate the model on a Persian dataset that is introduced in this study. The experimental results show the effectiveness of the proposed model with an accuracy of 85%.

ABSTRAK: Hari ini dengan ahli yang semakin meningkat di laman media sosial, orang cenderung untuk berkongsi pandangan mereka tentang segala-galanya dalam talian. Ini adalah cara mudah untuk menyampaikan mesej mereka kepada pengguna akhir mengenai subjek tertentu. Analisis Sentimen adalah subfield Pemprosesan Bahasa Semula Jadi yang merujuk kepada pengenalan pendapat pengguna ke arah topik tertentu. Ia digunakan dalam beberapa bidang seperti pemasaran, perkhidmatan pelanggan, dan sebagainya. Walau bagaimanapun, kerja-kerja terhad telah dilakukan ke atas Analisis Sentimen Parsi. Sebaliknya, pembelajaran mendalam baru menjadi popular kerana peranannya yang berjaya dalam beberapa tugas Pemprosesan Bahasa Asli (NLP). Objektif makalah ini adalah mencadangkan senibina pembelajaran hibrid yang baru dalam Analisis Sentimen Parsi. Menurut model yang dicadangkan, ciri-ciri tempatan ditangkap oleh Rangkaian Neural Convolutional (CNN) dan ketergantungan jangka panjang dipelajari oleh Long Short Term Memory (LSTM). Oleh itu, model boleh memanfaatkan kebolehan CNN dan LSTM. Selain itu, Word2vec digunakan untuk perwakilan perkataan sebagai langkah pembelajaran tanpa pengawasan. Untuk pengetahuan yang terbaik, ini adalah percubaan pertama di mana model pembelajaran mendalam hibrid digunakan untuk Analisis Sentimen Persia. Kami menilai model pada dataset Persia yang memperkenalkan dalam kajian ini. Keputusan eksperimen menunjukkan keberkesanan model yang dicadangkan dengan ketepatan 85%.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

Rajadesingan, A., R. Zafarani and H. Liu (2015). Sarcasm Detection on Twitter: A Behavioural Modeling Approach. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, (2015): 97-106.

Jandail, R. R. S., P. Sharma and C. Agrawal (2014). A Survey on Sentiment Analysis and Opinion Mining: A need for an Organization and Requirement of a customer. In International Conference on Trends in Mechanical, Aeronautical, Computer, Civil, Electrical and Electronics Engineering, India, (2016):17-24.

Feldman, R. (2013). Techniques and applications for sentiment analysis. Communications of the ACM, 56: 82-89.

Mir, J. and M. Usman (2015). An effective model for aspect based opinion mining for social reviews. In Tenth International Conference on Digital Information Management (ICDIM), South Korea, (2015): 49-56.

Bagheri, A., M. Saraee and F. d. Jong (2013). Sentiment classification in Persian: Introducing a mutual information-based. In 21st Iranian Conference on Electrical Engineering (ICEE), Iran, (2013):1-6.

Le, Q. and T. Mikolov (2014). Distributed Representations of Sentences and Documents. In Proceedings of the 31 st International Conference on Machine Learning, China, (2014): 1-9.

Socher, R., J. Pennington, E. H. Huang, A. Y. Ng and C. D. Manning (2011). Semi-supervised recursive auto encoders for predicting sentiment distributions. In EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing, United Kingdom, (2011):151-161.

Lecun, Y., L. Bottou, Y. Bengio and P. Haffner (1998). Gradient-based learning applied to document recognition. In Proceedings of the IEEE, 86: 2278 – 2324.

Mikolov, T., M. Karafi´at, L. Burget, J. H. Cernocky and S. Khudanpur (2010). Recurrent neural network based language model. In INTERSPEECH, Japan, (2010): 1045-1048.

Hochreiter, H. and J. Schmidhuber (1997). LONG SHORT-TERM MEMORY. Neural computation, 9(8): 1735-1780.

Pang, B., L. Lee and S. Vaithyanathan (2002). Thumbs up? Sentiment Classification using Machine Learning Techniques. In Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, (2002): 79-86.

Wang, S. and C. D. Manning (2012). Baselines and Bigrams: Simple, Good Sentiment and Topic Classification. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Republic of Korea, (2012): 90-94.

Maas, A. L., R. E. Daly, P. T. Pham, D. Huang, A. Y. Ng and C. Potts (2011). Learning Word Vectors for Sentiment Analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, Oregon, (2011): 1-9.

Socher, R., A. Perelygin, J. Y. Wu, J. Chuang, C. D. Mannin, A. Y. Ng and C. Potts (2013). Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank. In Proceeding of the conference on empirical methods in natural language processing (EMNLP), (2013): 1631–1642.

Socher, R., B. Huval, C. D. Manning and A. Y. Ng (2013). Semantic compositionality through recursive matrix-vector spaces. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Korea, (2013): 1201–1211.

Bagheri, A., M. Saraee and d. J. Franciska (2013). Sentiment classification in Persian: Introducing a mutual information-based method for feature selection. In 21st Iranian Conference on Electrical Engineering (ICEE), Mashhad, (2013): 1-6.

Alimardani, S. and A. Aghaei (2015). Opinion Mining in Persian Language Using Supervised Algorithms. Journal of Information Systems and Telecommunication, 3:135-141.

Hajmohammadi, M. S. and R. Ibrahim (2013). A SVM-Based Method for Sentiment Analysis in Persian Language. In International Conference on Graphic and Image Processing, Singapore. DOI: https://doi.org/10.1117/12.2010940

Basiri, M. E. and A. Kabiri (2017). Sentence-Level Sentiment Analysis in Persian. In 3rd International Conference on Pattern Recognition and Image Analysis (IPRIA 2017), Iran, (2017): 84-89.

Roshanfekr, B., S. Khadivi and M. Rahmati (2017). Sentiment analysis using Deep learning on Persian Texts. In 25th Iranian Conference on Electrical Engineering (ICEE2017), Iran, (2017):1503-1508.

Kim, Y. (2014). Convolutional neural networks for sentence classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Qatar, (2014): 1746–1751.

Mikolov, T., K. Chen, G. Corrado and J. Dean (2013). Efficient estimation of word representation in vector space, In arXiv preprint arXiv.

Chung, J., C. Gulcehre, K. Cho and Y. Bengio (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. In arXiv preprint arXiv.