Public Perceptions of HPV Vaccination Through Transformer-Based Social Media Sentiment Analysis
Abstract
Public perception plays a crucial role in determining the success of vaccination programs, particularly for the human papillomavirus vaccine aimed at preventing cervical cancer. Despite the increasing implementation of vaccination initiatives, public opinions expressed in digital environments may influence the acceptance and effectiveness of such programs. This study aims to examine public sentiment toward the human papillomavirus vaccine by analyzing discussions on a social media platform widely used for public communication. A data mining framework was employed to guide the analytical process, including data collection, preprocessing, sentiment classification, and thematic exploration. Transformer-based language models were utilized to classify public sentiment expressed in social media posts, followed by topic modeling to identify key issues discussed by users. The findings reveal that public discourse is largely characterized by supportive attitudes toward vaccination, reflecting a growing awareness of its role in cervical cancer prevention. Nevertheless, several concerns related to vaccine cost, accessibility, and post-vaccination experiences continue to emerge in online discussions. These results highlight the importance of integrating digital discourse analysis into public health communication strategies in order to better understand societal perspectives and improve the effectiveness of vaccination programs.
References
Abdulhakim Al-Absi, A., Kang, D.-K., & Abdulhakim Al-Absi, M. (2023). Sentiment Analysis and Classification Using Deep Semantic Information and Contextual Knowledge. Computers, Materials & Continua, 74(1), 671–691. https://doi.org/10.32604/cmc.2023.030262
Acuña-Cid, H. A., Ahumada-Tello, E., Ovalle-Osuna, Ó. O., Evans, R., Hernández-Ríos, J. E., & Zambrano-Soto, M. A. (2025). CRISP-NET: Integration of the CRISP-DM Model with Network Analysis. Machine Learning and Knowledge Extraction, 7(3), 101. https://doi.org/10.3390/make7030101
Aljabar, A., Ali, I., & Karomah, B. M. (2024). Sentiment Analysis Using Transformer Method. Journal of Informatics Information System Software Engineering and Applications (INISTA), 6(2), 90–97. https://doi.org/10.20895/inista.v6i2.1383
Arbyn, M., Rousta, P., Bruni, L., Schollin Ask, L., & Basu, P. (2024). Linkage of individual-patient data confirm protection of prophylactic human papillomavirus vaccination against invasive cervical cancer. JNCI: Journal of the National Cancer Institute, 116(6), 775–778. https://doi.org/10.1093/jnci/djae042
Boyd, A., Showalter, S., Mandt, S., & Smyth, P. (2022). Predictive Querying for Autoregressive Neural Sequence Models (Version 3). arXiv. https://doi.org/10.48550/ARXIV.2210.06464
Das, U. K., Ani, R. S., Datta, N., Fahad, I., Sikder, J., Sara, U., & Chakraborty, A. (2025). Enhancing sentiment analysis accuracy on social media comments using a tuned BERT model. Discover Computing, 28(1), 198. https://doi.org/10.1007/s10791-025-09599-x
Deng, Y., Van Der Meer, J., Tzovara, A., Schmidt, M., Bassetti, C., & Denecke, K. (2025). Analyzing Sleep Behavior Using BERT-BiLSTM and Fine-Tuned GPT-2 Sentiment Classification: Comparison Study. JMIR Medical Informatics, 13, e70753–e70753. https://doi.org/10.2196/70753
Dotan, E., Jaschek, G., Pupko, T., & Belinkov, Y. (2024). Effect of tokenization on transformers for biological sequences. Bioinformatics, 40(4), btae196. https://doi.org/10.1093/bioinformatics/btae196
Ellingson, M. K., Sheikha, H., Nyhan, K., Oliveira, C. R., & Niccolai, L. M. (2023). Human papillomavirus vaccine effectiveness by age at vaccination: A systematic review. Human Vaccines & Immunotherapeutics, 19(2), 2239085. https://doi.org/10.1080/21645515.2023.2239085
Furuno, A., Sukegawa, A., Ohshige, K., Suzuki, Y., Yamaguchi, M., Miyagi, E., Ueda, Y., Sekine, M., & Mizushima, T. (2024). Three‐year questionnaire study on human papillomavirus vaccination targeting new female college school students: Follow‐up to a 2021 report to reveal the impact of a policy change in Japan. Journal of Obstetrics and Gynaecology Research, 50(9), 1640–1648. https://doi.org/10.1111/jog.16049
Heyde, S., Osmani, V., Schauberger, G., Cooney, C., & Klug, S. J. (2024). Global parental acceptance, attitudes, and knowledge regarding human papillomavirus vaccinations for their children: A systematic literature review and meta-analysis. BMC Women’s Health, 24(1), 537. https://doi.org/10.1186/s12905-024-03377-5
Khan, J., Ahmad, K., Jagatheesaperumal, S. K., & Sohn, K.-A. (2025). Textual variations in social media text processing applications: Challenges, solutions, and trends. Artificial Intelligence Review, 58(3), 89. https://doi.org/10.1007/s10462-024-11071-z
Kim, S. J., Schiffelbein, J. E., Imset, I., & Olson, A. L. (2022). Countering Antivax Misinformation via Social Media: Message-Testing Randomized Experiment for Human Papillomavirus Vaccination Uptake. Journal of Medical Internet Research, 24(11), e37559. https://doi.org/10.2196/37559
Li, Z., Yang, C., & Huang, C. (2023). A Comparative Sentiment Analysis of Airline Customer Reviews Using Bidirectional Encoder Representations from Transformers (BERT) and Its Variants. Mathematics, 12(1), 53. https://doi.org/10.3390/math12010053
Liu, J., Niu, Q., Nagai-Tanima, M., & Aoyama, T. (2025). Understanding Human Papillomavirus Vaccination Hesitancy in Japan Using Social Media: Content Analysis. Journal of Medical Internet Research, 27, e68881. https://doi.org/10.2196/68881
Lundén, N., Bekar, E. T., Skoogh, A., & Bokrantz, J. (2023). Domain Knowledge in CRISP-DM: An Application Case in Manufacturing. IFAC-PapersOnLine, 56(2), 7603–7608. https://doi.org/10.1016/j.ifacol.2023.10.1156
Lviv Polytechnic National University, Podolchak, N., Tsygylyk, N., Lviv Polytechnic National University, Petlovanyi, M., & Lviv Polytechnic National University. (2025). Mathematical modeling of multi-label classification of job descriptions using transformer-based neural networks. Mathematical Modeling and Computing, 12(3), 767–778. https://doi.org/10.23939/mmc2025.03.767
Ma, L., Chen, R., Ge, W., Rogers, P., Lyn-Cook, B., Hong, H., Tong, W., Wu, N., & Zou, W. (2025). AI-powered topic modeling: Comparing LDA and BERTopic in analyzing opioid-related cardiovascular risks in women. Experimental Biology and Medicine, 250, 10389. https://doi.org/10.3389/ebm.2025.10389
Mascarenhas, A. K., Kelekar, A., Lucia, V. C., & Afonso, N. M. (2024). The receipt of the human papillomavirus vaccine’s influence on future human papillomavirus vaccine recommendations by medical and dental students. JADA Foundational Science, 3, 100029. https://doi.org/10.1016/j.jfscie.2023.100029
Naoum, P., Athanasakis, K., Zavras, D., Kyriopoulos, J., & Pavi, E. (2022). Knowledge, Perceptions and Attitudes Toward HPV Vaccination: A Survey on Parents of Girls Aged 11–18 Years Old in Greece. Frontiers in Global Women’s Health, 3, 871090. https://doi.org/10.3389/fgwh.2022.871090
Papia, S. K., Khan, M. A., Habib, T., Rahman, M., & Islam, M. N. (2024). DistilRoBiLSTMFuse: An efficient hybrid deep learning approach for sentiment analysis. PeerJ Computer Science, 10, e2349. https://doi.org/10.7717/peerj-cs.2349
Pencheva, D. (2025). Profiling Noisy Social Media Data for Sentiment Applications: A Visual and Analytical Framework. SAR Journal - Science and Research, 213–224. https://doi.org/10.18421/SAR83-01
Saputra, A. N. A., Saputro, R. E., & Saputra, D. I. S. (2025). Enhancing Sentiment Analysis Accuracy Using SVM and Slang Word Normalization on YouTube Comments. Sinkron, 9(2), 687–699. https://doi.org/10.33395/sinkron.v9i2.14613
Sari, Y., & Handayani, M. (2021). ANALYSIS OF THE “SOME” MODEL (SHARE, OPTIMIZE, MANAGE, ENGAGE) INSTAGRAM ACCOUNT @tnlkep Kepulauanseribu IN THE FRAMEWORK OF DIGITAL PROMOTION OF A THOUSAND ISLANDS MARINE PARK AS AN ECO-TOURISM DESTINATION FOR THE MILLENIAL GENERATION. Moestopo International Review on Social, Humanities, and Sciences, 1(1), 7–15. https://doi.org/10.32509/mirshus.v1i1.5
Sendekie, A. K., Abate, B. B., Adamu, B. A., Tefera, A. M., Mekonnen, K. T., Ashagrie, M. A., Tadesse, Y. B., Dagnaw, A. D., Melaku, M. S., & Bizuneh, G. K. (2025). Human papillomavirus vaccination hesitancy among young girls in Ethiopia: Factors and barriers to uptake. Frontiers in Public Health, 13, 1507832. https://doi.org/10.3389/fpubh.2025.1507832
Singh, G., Dash, N. R., Shaju, A., & Chakkalakkunnan, S. S. (2025). Perceptions and sentiments associated with HPV vaccine uptake among Indian Reddit users: A qualitative social media analysis. BMC Public Health, 25(1), 4037. https://doi.org/10.1186/s12889-025-25418-w
Surya, J., Kashyap, H., Nadig, R. R., & Raman, R. (2023). Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population. Cureus. https://doi.org/10.7759/cureus.45853
Talaat, A. S. (2023). Sentiment analysis classification system using hybrid BERT models. Journal of Big Data, 10(1), 110. https://doi.org/10.1186/s40537-023-00781-w
Xue, J., Zhang, B., Zhang, Q., Hu, R., Jiang, J., Liu, N., Peng, Y., Li, Z., & Logan, J. (2023). Using Twitter-Based Data for Sexual Violence Research: Scoping Review. Journal of Medical Internet Research, 25, e46084. https://doi.org/10.2196/46084
Yoon, S., Kim, H., An, J., & Jin, S. W. (2024). Exploring human papillomavirus vaccine hesitancy among college students and the potential of virtual reality technology to increase vaccine acceptance: A mixed-methods study. Frontiers in Public Health, 12, 1331379. https://doi.org/10.3389/fpubh.2024.1331379
Zouhar, V., Meister, C., Gastaldi, J. L., Du, L., Sachan, M., & Cotterell, R. (2023). Tokenization and the Noiseless Channel (Version 1). arXiv. https://doi.org/10.48550/ARXIV.2306.16842
Copyright (c) 2026 Desi Elfrida Silaban, Tuga Mauritsius

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.









