Sentiment Analysis using Improved Novel Convolutional Neural Network (SNCNN)

Authors

  • M. Kalaiarasu Sri Ramakrishna Engineering College, Coimbatore, India
  • C. Ranjeeth Kumar Sri Ramakrishna Engineering College, Coimbatore, India

DOI:

https://doi.org/10.15837/ijccc.2022.2.4351

Keywords:

Sentiment Analysis (SA), Improved Novel Convolutional Neural Network (INCNN), TF-IDF is Term Frequency-Inverse Document Frequency, SVM is Support Vector Machine, Information Gain (IG), Pearson’s Correlation Coefficient (PCC).

Abstract

Sentiment Analysis is an important method in which many researchers are working on the automated approach for extraction and analysis of huge volumes of user achieved data, which are accessible on social networking websites. This approach helps in analyzing the direct falls under the domain of SA. SA comprises the vast field of effective classification of user-initiated text under defined polarities. The proposed work includes four major steps for solving these issues: the first step is preprocessing which holds tokenization, stop word removal, stemming, cleaning up of unwanted text information like removing of Ads from Web pages, Text normalization for converting binary format. Secondly, the Feature extraction is based on the Bag words, Word2Vec and TF-ID which is a Term Frequency-Inverse Document Frequency. Thirdly, this feature selection includes the procedure for examining semantic gaps along with source features using teaching models and this involves target task characteristic application for Improved Novel Convolutional Neural Network (INCNN). The Feature Selection accompanies the procedure of Information Gain (IG) and PCC which is a Pearson Correlation Coefficient. Finally, the classification step INCNN gives out sentiment posts and responses for the user-based post aspects which helps in enhancing the system performance. The experimental outcome proposes the INCNN algorithm and provides higher performance rather than the existing approach. The proposed INCNN classifier results in highest accuracy.

References

[1] R. Feldman, 2013. "Techniques and applications for sentiment analysis", Communications of the ACM, Vol.56, No.4, Pp.82-89. https://doi.org/10.1145/2436256.2436274

[2] M. Ghiassi, J. Skinner and D. Zimbra, 2013. "Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network", Expert Systems with applications, Vol.40, No.16, Pp.6266-6282. https://doi.org/10.1016/j.eswa.2013.05.057

[3] N. Zainuddin, A. Selamat and R. Ibrahim, 2016. "Improving twitter aspect-based sentiment analysis using hybrid approach", Asian conference on intelligent information and database systems, Pp.151-160. https://doi.org/10.1007/978-3-662-49381-6_15

[4] D. Antenucci, M. Cafarella, M. Levenstein, C. Ré and M.D. Shapiro, 2014. "Using social media to measure labor market flows", National Bureau of Economic Research, Pp. 1-50. https://doi.org/10.3386/w20010

[5] N. Zainuddin, A. Selamat and R. Ibrahim, 2018. "Hybrid sentiment classification on twitter aspect-based sentiment analysis", Applied Intelligence, Vol.48, No.5, Pp.1218-1232. https://doi.org/10.1007/s10489-017-1098-6

[6] G. Wang, J. Sun, J. Ma, K. Xu and J. Gu, 2014. "Sentiment classification: The contribution of ensemble learning", Decision support systems, Vol.57, Pp.77-93, 2014. https://doi.org/10.1016/j.dss.2013.08.002

[7] O. Kolchyna, T.T. Souza, P. Treleaven and T. Aste, 2015. "Twitter sentiment analysis: Lexicon method, machine learning method and their combination", Computation and Language (cs.CL), Pp.1-32.

[8] N. í–ztürk and S. Ayvaz, 2018. "Sentiment Analysis on Twitter: A Text Mining Approach to the Syrian Refugee Crisis", Telematics and Informatics, Vol.35, No.1, Pp.136-147. https://doi.org/10.1016/j.tele.2017.10.006

[9] K. Philander and Y. Zhong, 2016. "Twitter sentiment analysis: Capturing sentiment from integrated resort tweets", International Journal of Hospitality Management, Vol.55, Pp.16-24. https://doi.org/10.1016/j.ijhm.2016.02.001

[10] A. Hasan, S. Moin, A. Karim and S. Shamshir band, 2018. "Machine learning-based sentiment analysis for twitter accounts", Mathematical and Computational Applications, Vol.23, No.1, Pp.1-15. https://doi.org/10.3390/mca23010011

[11] A. Alnawas and N. Arici, 2019. "Sentiment analysis of Iraqi Arabic dialect on Facebook based on distributed representations of documents", ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Vol.18, No.3, Pp.1-17. https://doi.org/10.1145/3278605

[12] S.A. Salloum, C. Mhamdi, M. Al-Emran and K. Shaalan, 2017. "Analysis and classification of Arabic newspapers' Facebook pages using text mining techniques", International Journal of Information Technology and Language Studies, Vol.1, No.2, Pp.8-17.

[13] K.M. Nahar, A. Jaradat, M.S. Atoum and F. Ibrahim, 2020. "Sentiment analysis and classification of arabjordanian facebook comments for jordanian telecom companies using lexicon-based approach and machine learning", Jordanian Journal of Computers and Information Technology (JJCIT), Vol.6, No.03, Pp.247-263. https://doi.org/10.5455/jjcit.71-1586289399

[14] H. Tran and M. Shcherbakov, 2016. "Detection and prediction of users attitude based on realtime and batch sentiment analysis of facebook comments", International conference on computational social networks, Pp.273-284. https://doi.org/10.1007/978-3-319-42345-6_24

[15] T.H. Soliman, M.A. Elmasry, A. Hedar and M.M. Doss, 2014. "Sentiment analysis of Arabic slang comments on facebook", International Journal of Computers & Technology, Vol.12, No.5, Pp.3470-3478. https://doi.org/10.24297/ijct.v12i5.2917

[16] M. Meire, M. Ballings and D. Van den Poel, 2016. "The added value of auxiliary data in sentiment analysis of Facebook posts", Decision Support Systems, Vol.89, Pp.98-112. https://doi.org/10.1016/j.dss.2016.06.013

[17] A. Ortigosa, J.M. Martí­n and R.M. Carro, 2014. "Sentiment analysis in Facebook and its application to e-learning", Computers in human behavior, Vol.31, Pp.527-541. https://doi.org/10.1016/j.chb.2013.05.024

[18] F. Millstein, 2020."Natural Language Processing With Python: Natural Language Processing Using NLTK", Frank Millstein, Pp.1-116. https://doi.org/10.1007/978-1-4842-4354-1_1

[19] A. Jabbar, S. Iqbal, A. Akhunzada and Q. Abbas, 2018. "An improved Urdu stemming algorithm for text mining based on multi-step hybrid approach", Journal of Experimental & Theoretical Artificial Intelligence, Vol.30, No.5, Pp.703-723. https://doi.org/10.1080/0952813X.2018.1467495

[20] M.A. Fauzi, 2018. "Word2Vec model for sentiment analysis of product reviews in Indonesian language", International Journal of Electrical and Computer Engineering, Vol.9, No.1, Pp.525- 530. https://doi.org/10.11591/ijece.v9i1.pp525-530

[21] A.I. Pratiwi, 2018. "On the feature selection and classification based on information gain for document sentiment analysis", Applied Computational Intelligence and Soft Computing, Pp. 1-5. https://doi.org/10.1155/2018/1407817

[22] S.V. Georgakopoulos, S.K. Tasoulis, A.G. Vrahatis and V.P. Plagianakos.2018., "Convolutional neural networks for toxic comment classification", Proceedings of the 10th Hellenic Conference on Artificial Intelligence, Pp.1-6. https://doi.org/10.1145/3200947.3208069

[23] https://www.kaggle.com/mchirico/cheltenham-s-facebook-group

[24] Ragab et al., (2020). A Novel One-Dimensional CNN with Exponential Adaptive Gradients for Air Pollution Index Prediction, sustainability, 12(23): 10090. https://doi.org/10.3390/su122310090

[25] Nasir et al. (2020). Pearson Correlation-Based Feature Selection for Document Classification Using Balanced Training, sensors, 20(23): 6793. https://doi.org/10.3390/s20236793

[26] Huang, et al. (2019). Signal status recognition based on 1DCNN and its feature extraction mechanism analysis. Sensors, 19(9). https://doi.org/10.3390/s19092018

[27] Meliboev et al. (2020). CNN Based Network Intrusion Detection with Normalization on Imbalanced Data, International Conference on Artificial Intelligence in Information and Communication, 19-21 Feb. 2020, Japan.

[28] Li, et al. (2020). A hybrid CNN-LSTM model for forecasting particulate matter (PM2.5), IEEE Access, 8, 26933-26940. https://doi.org/10.1109/ACCESS.2020.2971348

Additional Files

Published

2022-03-18

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.