SENTIMENT ANALYSIS FOR SOCIAL MEDIA: A SURVEY
Keywords:sentiment analysis, sentiment classification, types of sentiment analysis, challenges in sentiment analysis
With the rapid development of the Internet industry, an increasing number of social media platforms have been developed. These social media platforms have become the main channels for communication among most users. Opinions from social media platforms provide the most updated and inclusive information. Sentiments from opinions are a valuable data source for solving many issues. Therefore, sentiment analysis has developed into one of the most popular natural language processing fields. Hence, improving the performance of sentiment analysis methods or discovering new problems related to these methods is essential. In this context, we must be aware of the general information relevant to this area. This survey presents a summary of the necessary stages for building a complete model to be used in sentiment analysis. For each procedure, we list the popular techniques that have been widely used in recent years. In addition, discussions and comparisons related to these methods are provided. Additionally, we discuss the challenges and possible research directions for future research in this field.
A. F. Agarap and P. Grafilon, “Statistical analysis on e-commerce reviews, with sentiment classification using bidirectional recurrent neural network (rnn),” arXiv preprint arXiv:1805.03687,
M. S. Akhtar, T. Garg, and A. Ekbal, “Multi-task learning for aspect term extraction and aspect sentiment classification,” Neurocomputing, 2020.
N. Al-Twairesh and H. Al-Negheimish, “Surface and deep features ensemble for sentiment analysis of arabic tweets,” IEEE Access, vol. 7, pp. 84 122–84 131, 2019.
O. Araque, I. Corcuera-Platas, J. F. S´anchez-Rada, and C. A. Iglesias, “Enhancing deep learning sentiment analysis with ensemble techniques in social applications,” Expert Systems with
Applications, vol. 77, pp. 236–246, 2017.
K. Atkinson, “Gnu aspell 0.60. 4,” 2006.
S. Baccianella, A. Esuli, and F. Sebastiani, “Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining.” in Lrec, vol. 10, no. 2010, 2010, pp. 2200–2204.
P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, “Enriching word vectors with subword information,” Transactions of the Association for Computational Linguistics, vol. 5, pp. 135–
D. Bollegala, T. Mu, and J. Y. Goulermas, “Cross-domain sentiment classification using sentiment sensitive embeddings,” IEEE Transactions on Knowledge and Data Engineering, vol. 28,
no. 2, pp. 398–410, 2015.
D. Bollegala, D. Weir, and J. Carroll, “Cross-domain sentiment classification using a sentiment sensitive thesaurus,” IEEE transactions on knowledge and data engineering, vol. 25, no. 8, pp. 1719–1731, 2012.
E. Cambria and A. Hussain, “Sentic computing,” Cognitive Computation, vol. 7, no. 2, pp. 183–185, 2015.
E. Cambria, D. Olsher, and D. Rajagopal, “Senticnet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis,” in Proceedings of the twenty-eighth AAAI
conference on artificial intelligence, 2014, pp. 1515–1521.
K. Chakraborty, S. Bhattacharyya, and R. Bag, “A survey of sentiment analysis from social media data,” IEEE Transactions on Computational Social Systems, vol. 7, no. 2, pp. 450–464, 2020.
L. Derczynski, A. Ritter, S. Clark, and K. Bontcheva, “Twitter part-of-speech tagging for all: Overcoming sparse and noisy data,” in Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013, 2013, pp. 198–206.
Y. Dong, Y. Fu, L. Wang, Y. Chen, Y. Dong, and J. Li, “A sentiment analysis method of capsule
network based on bilstm,” IEEE Access, vol. 8, pp. 37 014–37 020, 2020.
Y. Du, M. He, L. Wang, and H. Zhang, “Wasserstein based transfer network for cross-domain sentiment classification,” Knowledge-Based Systems, vol. 204, p. 106162, 2020.
M. K. Elhadad, K. F. Li, and F. Gebali, “Detecting misleading information on covid-19,” IEEE Access, vol. 8, pp. 165 201–165 215, 2020.
H. Fei and P. Li, “Cross-lingual unsupervised sentiment classification with multi-view transfer learning,” in Proceedings of the 58th Annual Meeting of the Association for Computational
Linguistics, 2020, pp. 5759–5771.
M. Ganapathibhotla and B. Liu, “Mining opinions in comparative sentences,” in Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), 2008, pp.
M. Ghorbani, M. Bahaghighat, Q. Xin, and F. Ozen, “Convlstmconv network: a deep learning ¨approach for sentiment analysis in cloud computing,” Journal of Cloud Computing, vol. 9, no. 1, pp. 1–12, 2020.
M. Giatsoglou, M. G. Vozalis, K. Diamantaras, A. Vakali, G. Sarigiannidis, and K. C. Chatzisavvas, “Sentiment analysis leveraging emotions and word embeddings,” Expert Systems with Applications, vol. 69, pp. 214–224, 2017.
H. Gong, S. Bhat, and P. Viswanath, “Enriching word embeddings with temporal and spatial information,” arXiv preprint arXiv:2010.00761, 2020.
P. Grover, A. K. Kar, Y. K. Dwivedi, and M. Janssen, “Polarization and acculturation in us election 2016 outcomes–can twitter analytics predict changes in voting preferences,” Technological Forecasting and Social Change, vol. 145, pp. 438–460, 2019.
S. Guo and N. Yao, “Polyseme-aware vector representation for text classification,” IEEE Access, vol. 8, pp. 135 686–135 699, 2020.
I. Gupta and N. Joshi, “Enhanced twitter sentiment analysis using hybrid approach and by accounting local contextual semantic,” Journal of Intelligent Systems, vol. 29, no. 1, pp. 1611–
Y. Han, M. Liu, and W. Jing, “Aspect-level drug reviews sentiment analysis based on double bigru and knowledge transfer,” IEEE Access, vol. 8, pp. 21 314–21 325, 2020.
S.-U. Hassan, N. R. Aljohani, N. Idrees, R. Sarwar, R. Nawaz, E. Mart´ınez-C´amara, S. Ventura, and F. Herrera, “Predicting literature’s early impact with sentiment analysis in twitter,”
Knowledge-Based Systems, vol. 192, p. 105383, 2020.
A. Hatefi Ghahfarrokhi and M. Shamsfard, “Tehran stock exchange prediction using sentiment analysis of online textual opinions,” Intelligent Systems in Accounting, Finance and Management, vol. 27, no. 1, pp. 22–37, 2020.
F. Hemmatian and M. K. Sohrabi, “A survey on classification techniques for opinion mining and sentiment analysis,” Artificial Intelligence Review, pp. 1–51, 2019.
A. Ishaq, S. Asghar, and S. A. Gillani, “Aspect-based sentiment analysis using a hybridized approach based on cnn and ga,” IEEE Access, vol. 8, pp. 135 499–135 512, 2020.
A. Jain, S. Tripathi, H. DharDwivedi, and P. Saxena, “Forecasting price of cryptocurrencies using tweets sentiment analysis,” in 2018 Eleventh International Conference on Contemporary Computing (IC3). IEEE, 2018, pp. 1–7.
Z. Jianqiang, G. Xiaolin, and Z. Xuejun, “Deep convolution neural networks for twitter sentiment analysis,” IEEE Access, vol. 6, pp. 23 253–23 260, 2018.
M. Y. Kabir and S. Madria, “A deep learning approach for tweet classification and rescue scheduling for effective disaster management,” in Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2019, pp. 269–278.
L. Kong, N. Schneider, S. Swayamdipta, A. Bhatia, C. Dyer, and N. A. Smith, “A dependency parser for tweets,” in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014, pp. 1001–1012.
A. Kumar, V. T. Narapareddy, V. A. Srikanth, L. B. M. Neti, and A. Malapati, “Aspect-based sentiment classification using interactive gated convolutional network,” IEEE Access, vol. 8, pp.
445–22 453, 2020.
F. M. Kundi, S. Ahmad, A. Khan, and M. Z. Asghar, “Detection and scoring of internet slangs for sentiment analysis using sentiwordnet,” Life Science Journal, vol. 11, no. 9, pp. 66–72, 2014.
Z. Li, Y. Wei, Y. Zhang, X. Zhang, and X. Li, “Exploiting coarse-to-fine task transfer for aspect level sentiment classification,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, 2019, pp. 4253–4260.
B. Liu, “Sentiment analysis and opinion mining,” Synthesis lectures on human language technologies, vol. 5, no. 1, pp. 1–167, 2012.
K. McManus, E. K. Mallory, R. L. Goldfeder, W. A. Haynes, and J. D. Tatum, “Mining twitter data to improve detection of schizophrenia,” AMIA Summits on Translational Science Proceedings, vol. 2015, p. 122, 2015.
G. A. Miller, “Wordnet: a lexical database for english,” Communications of the ACM, vol. 38, no. 11, pp. 39–41, 1995.
M. E. Mowlaei, M. S. Abadeh, and H. Keshavarz, “Aspect-based sentiment analysis using adaptive aspect-based lexicons,” Expert Systems with Applications, vol. 148, p. 113234, 2020.
R. Narayanan, B. Liu, and A. Choudhary, “Sentiment analysis of conditional sentences,” in Proceedings of the 2009 conference on empirical methods in natural language processing, 2009,
H.-T. Nguyen and L.-M. Nguyen, “Ilwaanet: An interactive lexicon-aware word-aspect attention network for aspect-level sentiment classification on social networking,” Expert Systems with
Applications, vol. 146, p. 113065, 2020.
J. Ni, J. Li, and J. McAuley, “Justifying recommendations using distantly-labeled reviews and fine-grained aspects,” in Proceedings of the 2019 Conference on Empirical Methods in Natural
Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019, pp. 188–197.
B. O’Connor, M. Krieger, and D. Ahn, “Tweetmotif: exploratory search and topic summarization for twitter.” in ICWSM, 2010, pp. 384–385.
B. O’dea, S. Wan, P. J. Batterham, A. L. Calear, C. Paris, and H. Christensen, “Detecting suicidality on twitter,” Internet Interventions, vol. 2, no. 2, pp. 183–188, 2015.
O. Owoputi, B. O’Connor, C. Dyer, K. Gimpel, N. Schneider, and N. A. Smith, “Improved part-of-speech tagging for online conversational text with word clusters,” in Proceedings of the
conference of the North American chapter of the association for computational linguistics: human language technologies, 2013, pp. 380–390.
C. D. Paice, “Another stemmer,” in ACM Sigir Forum, vol. 24, no. 3. ACM New York, NY, USA, 1990, pp. 56–61.
H. T. Phan, N. T. Nguyen, D. Hwang et al., “A new approach for predicting an important user on a topic on twitter,” in 2020 International Conference on INnovations in Intelligent SysTems
and Applications (INISTA). IEEE, 2020, pp. 1–6.
H. T. Phan, N. T. Nguyen, V. C. Tran, and D. Hwang, “A method for detecting and analyzing the sentiment of tweets containing conditional sentences,” in Asian Conference on Intelligent Information and Database Systems. Springer, 2019, pp. 177–188.
——, “A sentiment analysis method of objects by integrating sentiments from tweets,” Journal of Intelligent & Fuzzy Systems, vol. 37, no. 6, pp. 7251–7263, 2019.
——, “An approach for a decision-making support system based on measuring the user satisfaction level on twitter,” Information Sciences, 2021. [Online]. Available:
H. T. Phan, N. T. Nguyen, T. Van Cuong, and D. Hwang, “A method for detecting and analyzing the sentiment of tweets containing fuzzy sentiment phrases,” in 2019 IEEE International
Symposium on INnovations in Intelligent SysTems and Applications (INISTA). IEEE, 2019,pp. 1–6.
H. T. Phan, V. C. Tran, N. T. Nguyen, and D. Hwang, “Decision-making support method based on sentiment analysis of objects and binary decision tree mining,” in International Conference
on Industrial, Engineering and Other Applications of Applied Intelligent Systems. Springer, 2019, pp. 753–767.
——, “Detecting the degree of risk in online market based on satisfaction of twitter users,” in Computational Collective Intelligence - 12th International Conference, ICCCI 2020, Da Nang, Vietnam, November 30 - December 3, 2020, Proceedings, ser. Lecture Notes in Computer Science, N. T. Nguyen, B. H. Hoang, C. P. Huynh, D. Hwang, B. Trawinski, and G. Vossen, Eds., vol. 12496. Springer, 2020, pp. 58–70. [Online]. Available:
——, “A framework for detecting user’s psychological tendencies on twitter based on tweets sentiment analysis,” in International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems. Springer, 2020, pp. 357–372.
——, “Improving the performance of sentiment analysis of tweets containing fuzzy sentiment using the feature ensemble model,” IEEE Access, vol. 8, pp. 14 630–14 641, 2020.
M. F. Porter, “Snowball: A language for stemming algorithms,” 2001.
N. Pr¨ollochs, S. Feuerriegel, B. Lutz, and D. Neumann, “Negation scope detection for sentiment analysis: A reinforcement learning framework for replicating human interpretations,” Information Sciences, 2020.
C. M. Pulido, B. Villarejo-Carballido, G. Redondo-Sama, and A. G´omez, “Covid-19 infodemic: More retweets for science-based information on coronavirus than for false information,” International Sociology, p. 0268580920914755, 2020.
K. Ravi and V. Ravi, “A survey on opinion mining and sentiment analysis: tasks, approaches and applications,” Knowledge-based systems, vol. 89, pp. 14–46, 2015.
H. Rehioui and A. Idrissi, “New clustering algorithms for twitter sentiment analysis,” IEEE Systems Journal, vol. 14, no. 1, pp. 530–537, 2020.
A. U. Rehman, A. K. Malik, B. Raza, and W. Ali, “A hybrid cnn-lstm model for improving accuracy of movie reviews sentiment analysis,” Multimedia Tools and Applications, vol. 78, no. 18,
pp. 26 597–26 613, 2019.
S. M. Rezaeinia, R. Rahmani, A. Ghodsi, and H. Veisi, “Sentiment analysis based on improved pre-trained word embeddings,” Expert Systems with Applications, vol. 117, pp. 139–147, 2019.
J. Samuel, G. Ali, M. Rahman, E. Esawi, Y. Samuel et al., “Covid-19 public sentiment insights and machine learning for tweets classification,” Information, vol. 11, no. 6, p. 314, 2020.
A. Sarker, S. Lakamana, W. Hogg-Bremer, A. Xie, M. A. Al-Garadi, and Y.-C. Yang, “Selfreported covid-19 symptoms on twitter: An analysis and a research resource,” medRxiv, 2020.
K. Shuang, Q. Yang, J. Loo, R. Li, and M. Gu, “Feature distillation network for aspect-based sentiment analysis,” Information Fusion, 2020.
C. Strapparava, A. Valitutti et al., “Wordnet affect: an affective extension of wordnet.” in Lrec, vol. 4, no. 1083-1086. Citeseer, 2004, p. 40.
L. Svoboda and T. Brychcın, “Improving word meaning representations using wikipedia categories,” Neural Network World, vol. 523, p. 534, 2018.
L. Svoboda and T. Brychc´ın, “Enriching word embeddings with global information and testing on highly inflected language,” Computaci´on y Sistemas, vol. 23, no. 3, 2019.
M. Taboada, J. Brooke, M. Tofiloski, K. Voll, and M. Stede, “Lexicon-based methods for sentiment analysis,” Computational linguistics, vol. 37, no. 2, pp. 267–307, 2011.
S. Tedmori and A. Awajan, “Sentiment analysis main tasks and applications: A survey.” JIPS, vol. 15, no. 3, pp. 500–519, 2019.
M. Thelwall and K. Buckley, “Topic-based sentiment analysis for the social web: The role of mood and issue-related words,” Journal of the American Society for Information Science and
Technology, vol. 64, no. 8, pp. 1608–1617, 2013.
M. Thelwall, K. Buckley, and G. Paltoglou, “Sentiment strength detection for the social web,” Journal of the American Society for Information Science and Technology, vol. 63, no. 1, pp.
K. Toutanova, D. Klein, C. D. Manning, and Y. Singer, “Feature-rich part-of-speech tagging with a cyclic dependency network,” in Proceedings of the 2003 Human Language Technology
Conference of the North American Chapter of the Association for Computational Linguistics, 2003, pp. 252–259.
C. Wan, Y. Peng, K. Xiao, X. Liu, T. Jiang, and D. Liu, “An association-constrained lda model for joint extraction of product aspects and opinions,” Information Sciences, vol. 519, pp. 243–
T. Wang, K. Lu, K. P. Chow, and Q. Zhu, “Covid-19 sensing: Negative sentiment analysis on social media in china via bert model,” Ieee Access, vol. 8, pp. 138 162–138 169, 2020.
W. Wang, S. J. Pan, D. Dahlmeier, and X. Xiao, “Recursive neural conditional random fields for aspect-based sentiment analysis,” arXiv preprint arXiv:1603.06679, 2016.
J. Wei, J. Liao, Z. Yang, S. Wang, and Q. Zhao, “Bilstm with multi-polarity orthogonal attention for implicit sentiment analysis,” Neurocomputing, vol. 383, pp. 165–173, 2020.
J. Wiebe, T. Wilson, and C. Cardie, “Annotating expressions of opinions and emotions in language,” Language resources and evaluation, vol. 39, no. 2-3, pp. 165–210, 2005.
Z. Wu, Y. Li, J. Liao, D. Li, X. Li, and S. Wang, “Aspect-context interactive attention representation for aspect-level sentiment classification,” IEEE Access, vol. 8, pp. 29 238–29 248,
W. Yang, L. Mu, and Y. Shen, “Effect of climate and seasonality on depressed mood among twitter users,” Applied Geography, vol. 63, pp. 184–191, 2015.
H. Ye, Z. Yan, Z. Luo, and W. Chao, “Dependency-tree based convolutional neural networks for aspect term extraction,” in Pacific-Asia Conference on Knowledge Discovery and Data Mining.
Springer, 2017, pp. 350–362.
Z. Ye, F. Li, and T. Baldwin, “Encoding sentiment information into word vectors for sentiment analysis,” in Proceedings of the 27th International Conference on Computational Linguistics,
, pp. 997–1007.
Y. Yin, F. Wei, L. Dong, K. Xu, M. Zhang, and M. Zhou, “Unsupervised word and dependency path embeddings for aspect term extraction,” arXiv preprint arXiv:1605.07843, 2016.
L. Yue, W. Chen, X. Li, W. Zuo, and M. Yin, “A survey of sentiment analysis in social media,” Knowledge and Information Systems, pp. 1–47, 2019.
N. Yussupova, M. Boyko, D. Bogdanova, and A. Hilbert, “A decision support approach based on sentiment analysis combined with data mining for customer satisfaction research,” Int. J. Adv.
Intell. Syst, vol. 8, pp. 145–158, 2015.
C. Zhao, S. Wang, and D. Li, “Multi-source domain adaptation with joint learning for crossdomain sentiment classification,” Knowledge-Based Systems, vol. 191, p. 105254, 2020.
License1. We hereby assign copyright of our article (the Work) in all forms of media, whether now known or hereafter developed, to the Journal of Computer Science and Cybernetics. We understand that the Journal of Computer Science and Cybernetics will act on my/our behalf to publish, reproduce, distribute and transmit the Work.
2. This assignment of copyright to the Journal of Computer Science and Cybernetics is done so on the understanding that permission from the Journal of Computer Science and Cybernetics is not required for me/us to reproduce, republish or distribute copies of the Work in whole or in part. We will ensure that all such copies carry a notice of copyright ownership and reference to the original journal publication.
3. We warrant that the Work is our results and has not been published before in its current or a substantially similar form and is not under consideration for another publication, does not contain any unlawful statements and does not infringe any existing copyright.
4. We also warrant that We have obtained the necessary permission from the copyright holder/s to reproduce in the article any materials including tables, diagrams or photographs not owned by me/us.