Mutual Information Approach For Sentiment Analysis Using Deep Machine Learning Convolution Neural Network (CNN) Model
Keywords:
Natural Language Processing ,Machine Learning, Convolutional Neural Network (CNN),Sentiment Analysis, LSTM, Word Processing.Abstract
Sentiment Analysis is a field of Natural Language Processing which addresses the problem of extracting sentiment or, more generally, opinion from text. Obtaining deeper insights on that topic can be very valuable for a range of fields such as finance, marketing, politics and business. Previous research has shown how sentiment and public opinion can affect stock markets, product sales, polls as well as public health. This thesis researches the message sentiment polarity classification problem in Twitter aiming to classify messages based on the polarity of the sentiment towards a specific topic, where the tweets and the topics are always given. The dataset analysed and the evaluation metrics considered are provided conference and the 4th task about "Sentiment Analysis in Twitter". This task includes five subtasks, two of which were eventually engaged in this research according to the implemented approach. First, subtask B is a binary classification task, where the goal is to classify messages into two classes, positive and negative regarding the sentiment towards the topic. Following, subtask C where the target is to classify messages in a five-scale sentiment polarity from highly negative, negative, neutral, positive to highly positive, based on the sentiment towards a given topic. We are implemented and experimented with two deep learning models for both subtasks; a Convolutional Neural Network (CNN) and a state-of-the-art Recurrent Neural Network with context attention CNN LSTM model. We compare both models to the baselines of the challenge and show that the CNN LSTM outperforms the other models, in both subtasks, with all evaluation measures but one.
References
- Al-Rfou, Rami et al. 2020. “Theano: A Python Framework for Fast Computation of Mathematical Expressions.” CoRR abs/1605.0.
- Appel, Orestes, Francisco Chiclana, Jenny Carter, and Hamido Fujita. 2019. “A Hybrid Approach to the Sentiment Analysis Problem at the Sentence Level.” Knowledge-Based Systems 108:110-24.
- Arias, Marta, Argimiro Arratia, and Ramon Xuriguera. 2019. “Forecasting with Twitter Data.”ACM Transactions on Intelligent Systems and Technology 5(1):1-24.
- Asur, Sitaram and Bernardo A. Huberman. 2010. “Predicting the Future with Social Media.” Pp.492-99 in 2019 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT ’19. Washington, DC, USA: IEEE Computer Society.
- Baziotis, Christos, Nikos Pelekis, and Christos Doulkeridis. 2017. “DataStories at SemEval-2018 Task 4: Deep LSTM with Attention for Message-Level and Topic-Based Sentiment Analysis.” Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2018) (1):747-54.
- Bollen, Johan, Huina Mao, and Xiaojun Zeng. 2018. “Twitter Mood Predicts the Stock Market.”Journal of Computational Science 2(1):1-8.
- Ceron, Andrea, Luigi Curini, and M. Stefano. 2019. “Tweet Your Vote: How Content Analysis of Social Networks Can Improve Our Knowledge of Citizens’ Policy Preferences. An Application to Italy and France.” New Media & Society 16:1-24.
- Chevalier, Judith A. and Dina Mayzlin. 2006. “The Effect of Word of Mouth on Sales: Online Book Reviews.” Journal of Marketing Research 43(3):345-54.
- Collobert, Ronan et al. 2019. “Natural Language Processing (Almost) from Scratch.” CoRRabs/1103.0398.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRSET

This work is licensed under a Creative Commons Attribution 4.0 International License.