SENTIMENT ANALYSIS OF TEXTS USING RECURRENT NEURAL NETWORKS OF THE TRANSFORMER ARCHITECTURE
Main Article Content
Abstract
This study focuses on the automated process of determining sentiment (emotional coloring) in text messages from Telegram channels to enhance Ukraine’s information security. The principal challenge addressed lies in the need for rapid and accurate detection of negative, positive, or neutral messages within large-scale data streams without additional fine-tuning on local datasets. The essence of the results obtained is the implementation of a zero-shot classification approach, based on the multilingual transformer model XLM-RoBERTa, which in the experiment yielded the following metrics: Accuracy = 0.4718, Precision = 0.7138, Recall = 0.4718, and F1 Score = 0.5044. Owing to the model’s strong ability to generalize lexico-semantic patterns, a stable compromise between Precision and Recall was achieved, thereby increasing the efficiency of message analysis in large data volumes. These results are explained by the architectural features of XLM-RoBERTa, primarily its multilingual nature and deep layer structure, which enable proper handling of multilingual texts without dedicated local training. Conclusions. The proposed approach is advisable when there is a large, diverse corpus of data requiring prompt detection of potential negative informational influences and timely counteraction. Practically, this significantly reduces the time spent on manual monitoring of the information space and eases the burden on analysts, thereby strengthening the ability of organizations or information security units to respond rapidly to destructive content. The research results can also be integrated into decision support systems, serving as a foundation for the development of software solutions aimed at monitoring the information space.
Article Details
References
Anusha, M. and Leelavathi, R. (2021), “Analysis on sentiment analytics using deep learning techniques”, 2021 fifth international conference on I-SMAC (iot in social, mobile, analytics and cloud) (I-SMAC), Palladam, India, 11–13 November 2021, doi: https://doi.org/10.1109/i-smac52330.2021.9640790
Feisheng, L. (2024), “Systematic review of sentiment analysis: insights through CNN-LSTM networks”, 2024 5th international conference on industrial engineering and artificial intelligence (IEAI), Bangkok, Thailand, 24–26 April 2024, pp. 102–109, doi: https://doi.org/10.1109/ieai62569.2024.00026
Kumar, A., Gupta, N. and Shrivastava, A. (2024), “A critical review on sentiment analysis based on deep learning techniques”, International journal for multidisciplinary research, vol. 6, no. 5, doi: https://doi.org/10.36948/ijfmr.2024.v06i05.28572
Petrovska, I., Kuchuk, H. and Mozhaiev, M. (2022), “Features of the distribution of computing resources in cloud systems”, 2022 IEEE 4th KhPI Week on Advanced Technology, KhPI Week 2022 – Conference Proceedings, 03-07 October 2022, Code 183771, doi: https://doi.org/10.1109/KhPIWeek57572.2022.9916459
Ravichandran, B. D. amd Keikhosrokiani, P. (2022), “Classification of Covid-19 misinformation on social media based on neuro-fuzzy and neural network: a systematic review”, Neural computing and applications, doi:
https://doi.org/10.1007/s00521-022-07797-y
Sharma, P. (2024), “Convolutional neural networks for sentiment analysis”, International journal for research in applied science and engineering technology, vol. 12, no. 11, pp. 465–469, doi: https://doi.org/10.22214/ijraset.2024.65091
Alharbi, A. S. M. and de Doncker, E. (2019), “Twitter sentiment analysis with a deep neural network: an enhanced approach using user behavioral information”, Cognitive systems research, vol. 54, pp. 50–61, doi:
https://doi.org/10.1016/j.cogsys.2018.10.001
Xuanyuan, M., Xiao, L. and Duan, M. (2021), “Sentiment classification algorithm based on multi-modal social media text information”, IEEE access, vol. 9, pp. 33410–33418, doi: https://doi.org/10.1109/access.2021.3061450
Luo, S., Gu, Y., Yao, X. and Fan, W. (2021), “Research on text sentiment analysis based on neural network and ensemble learning”, Revue d'Intelligence Artificielle, vol. 35, no. 1, ppP. 63–70, doi: https://doi.org/10.18280/ria.350107
Kande, J. (2024), “Twitter sentiment analysis with LSTM neural networks”, REST journal on data analytics and artificial intelligence, vol. 3, no. 3, pp. 92–98, doi: https://doi.org/10.46632/jdaai/3/3/11
Petrovska, I., Kuchuk, H., Kuchuk, N., Mozhaiev, O., Pochebut, M. and Onishchenko, Yu. (2023), “Sequential Series-Based Prediction Model in Adaptive Cloud Resource Allocation for Data Processing and Security”, 2023 13th International Conference on Dependable Systems, Services and Technologies, DESSERT 2023, 13–15 October, Athens, Greece, code 197136, doi: https://doi.org/10.1109/DESSERT61349.2023.10416496
Stojanovski, D., Strezoski, G., Madjarov, G. and Dimitrovski, I. (2015), “Twitter sentiment analysis using deep convolutional neural network”, Lecture notes in computer science, Cham, pp. 726–737, doi: https://doi.org/10.1007/978-3-319-19644-2_60
Gao, Z., Li, Z., Luo, J. and Li, X. (2022), “Short text aspect-based sentiment analysis based on CNN + BiGRU”, Applied sciences, vol. 12, no. 5, 2707, doi: https://doi.org/10.3390/app12052707
Wang, X. (2024), “Analysis of neural networks in text sentiment”, Applied and computational engineering, vol. 76, no. 1, pp. 240–244, doi: https://doi.org/10.54254/2755-2721/76/20240599
Chen, C., Zhuo, R. and Ren, J. (2019), “Gated recurrent neural network with sentimental relations for sentiment classification”, Information sciences, vol. 502, pp. 268–278, doi: https://doi.org/10.1016/j.ins.2019.06.050
Chen, B., Xu, Z., Wang, X., Xu, L. and Zhang, W. (2020), “Capsule network-based text sentiment classification”, IFAC-PapersOnLine, vol. 53, no. 5, pp. 698–703, doi: https://doi.org/10.1016/j.ifacol.2021.04.160
Liu, G. and Guo, J. (2019), “Bidirectional LSTM with attention mechanism and convolutional layer for text classification”, Neurocomputing, vol. 337, pp. 325–338, doi: https://doi.org/10.1016/j.neucom.2019.01.078[
Nistor, S. C., Moca, M., Moldovan, D., Oprean, D. B. and Nistor, R. L. (2021), “Building a twitter sentiment analysis system with recurrent neural networks”, Sensors, vol. 21, no. 7, 2266, doi: https://doi.org/10.3390/s21072266
Meena, Y. R., Kumar, S. A. and Kharbas V. K. (2024), “Applying deep neural networks to sentiment analysis in social media”, 2024 3rd international conference for innovation in technology (INOCON), Bangalore, India, 1–3 March 2024, doi: https://doi.org/10.1109/inocon60754.2024.10511816
Perikos, I. and Diamantopoulos, A. (2024), “Explainable aspect-based sentiment analysis using transformer models”, Big data and cognitive computing, vol. 8, no. 11, 141, doi: https://doi.org/10.3390/bdcc8110141
Prytula, M. (2024), “Fine-tuning BERT, DistilBERT, XLM-RoBERTa and Ukr-RoBERTa models for sentiment analysis of Ukrainian language reviews”, Artificial intelligence, vol. 29(2), pp. 85–97, doi: https://doi.org/10.15407/jai2024.02.085
Ji, C. and Wu, H. (2020), “Cascade architecture with rhetoric long short-term memory for complex sentence sentiment analysis”, Neurocomputing, vol. 405, pp. 161–172, doi: https://doi.org/10.1016/j.neucom.2020.04.055
Khan, M. R., Rizvi, S. Z., Yasin, A. and Ali, M. (2021), “Depression analysis of social media activists using the gated architecture bi-lstm”, 2021 international conference on cyber warfare and security (ICCWS), Islamabad, Pakistan, 23–25 November 2021, doi: https://doi.org/10.1109/iccws53234.2021.9703014
Liu, S. and Lee, I. (2021), “Sequence encoding incorporated CNN model for Email document sentiment classification”, Applied soft computing, vol. 102, 107104, doi: https://doi.org/10.1016/j.asoc.2021.107104
Mozhaev, O., Kuchuk, H., Kuchuk, N., Mykhailo, M. and Lohvynenko, M. (2017), “Multiservice network security metric”, 2nd International Conference on Advanced Information and Communication Technologies, AICT 2017 – Proceedings, pp. 133–136, doi: https://doi.org/10.1109/AIACT.2017.8020083
İbrahimov B.G., Hasanov A.H. and Hashimov E.G. (2024), “Research and analysis of efficiency indicators of critical infrastructures in the communication system”, Advanced Information Systems, vol. 8, no. 2, pp. 58–64, doi: https://doi.org/10.20998/2522-9052.2024.2.07
Kovalenko, A., Kuchuk, H., Radchenko, V. and Poroshenko, A. (2020), “Predicting of Data Center Cluster Traffic”, 2020 IEEE International Conference on Problems of Infocommunications Science and Technology, PIC S and T 2020 – Proceedings, pp. 437–441, 9468006, doi: https://doi.org/10.1109/PICST51311.2020.9468006
Peng B., Wang J., Zhang X. Adversarial learning of sentiment word representations for sentiment analysis. Information sciences. 2020. Vol. 541. P. 426–441. https://doi.org/10.1016/j.ins.2020.06.044.
Dorkin, A. and Sirts, K. (2024), “TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages”, arXiv:2404.12845, 11 p., doi: https://doi.org/10.48550/arXiv.2404.12845