Public Perceptions of HPV Vaccination Through Transformer-Based Social Media Sentiment Analysis

  • Desi Elfrida Silaban Universitas Bina Nusantara, Jakarta, Indonesia
  • Tuga Mauritsius Universitas Bina Nusantara, Jakarta, Indonesia
Keywords: Sentiment Analysis, Human Papillomavirus Vaccine, Social Media Discourse, Transformer Model, Public Health Communication

Abstract

Public perception plays a crucial role in determining the success of vaccination programs, particularly for the human papillomavirus vaccine aimed at preventing cervical cancer. Despite the increasing implementation of vaccination initiatives, public opinions expressed in digital environments may influence the acceptance and effectiveness of such programs. This study aims to examine public sentiment toward the human papillomavirus vaccine by analyzing discussions on a social media platform widely used for public communication. A data mining framework was employed to guide the analytical process, including data collection, preprocessing, sentiment classification, and thematic exploration. Transformer-based language models were utilized to classify public sentiment expressed in social media posts, followed by topic modeling to identify key issues discussed by users. The findings reveal that public discourse is largely characterized by supportive attitudes toward vaccination, reflecting a growing awareness of its role in cervical cancer prevention. Nevertheless, several concerns related to vaccine cost, accessibility, and post-vaccination experiences continue to emerge in online discussions. These results highlight the importance of integrating digital discourse analysis into public health communication strategies in order to better understand societal perspectives and improve the effectiveness of vaccination programs.

Author Biographies

Desi Elfrida Silaban, Universitas Bina Nusantara, Jakarta, Indonesia

Information Systems Management Department, BINUS Graduate Program - Master of Information Systems Management

Tuga Mauritsius, Universitas Bina Nusantara, Jakarta, Indonesia

Information Systems Management Department, BINUS Graduate Program - Master of Information Systems Management

References

Abdulhakim Al-Absi, A., Kang, D.-K., & Abdulhakim Al-Absi, M. (2023). Sentiment Analysis and Classification Using Deep Semantic Information and Contextual Knowledge. Computers, Materials & Continua, 74(1), 671–691. https://doi.org/10.32604/cmc.2023.030262

Acuña-Cid, H. A., Ahumada-Tello, E., Ovalle-Osuna, Ó. O., Evans, R., Hernández-Ríos, J. E., & Zambrano-Soto, M. A. (2025). CRISP-NET: Integration of the CRISP-DM Model with Network Analysis. Machine Learning and Knowledge Extraction, 7(3), 101. https://doi.org/10.3390/make7030101

Aljabar, A., Ali, I., & Karomah, B. M. (2024). Sentiment Analysis Using Transformer Method. Journal of Informatics Information System Software Engineering and Applications (INISTA), 6(2), 90–97. https://doi.org/10.20895/inista.v6i2.1383

Arbyn, M., Rousta, P., Bruni, L., Schollin Ask, L., & Basu, P. (2024). Linkage of individual-patient data confirm protection of prophylactic human papillomavirus vaccination against invasive cervical cancer. JNCI: Journal of the National Cancer Institute, 116(6), 775–778. https://doi.org/10.1093/jnci/djae042

Boyd, A., Showalter, S., Mandt, S., & Smyth, P. (2022). Predictive Querying for Autoregressive Neural Sequence Models (Version 3). arXiv. https://doi.org/10.48550/ARXIV.2210.06464

Das, U. K., Ani, R. S., Datta, N., Fahad, I., Sikder, J., Sara, U., & Chakraborty, A. (2025). Enhancing sentiment analysis accuracy on social media comments using a tuned BERT model. Discover Computing, 28(1), 198. https://doi.org/10.1007/s10791-025-09599-x

Deng, Y., Van Der Meer, J., Tzovara, A., Schmidt, M., Bassetti, C., & Denecke, K. (2025). Analyzing Sleep Behavior Using BERT-BiLSTM and Fine-Tuned GPT-2 Sentiment Classification: Comparison Study. JMIR Medical Informatics, 13, e70753–e70753. https://doi.org/10.2196/70753

Dotan, E., Jaschek, G., Pupko, T., & Belinkov, Y. (2024). Effect of tokenization on transformers for biological sequences. Bioinformatics, 40(4), btae196. https://doi.org/10.1093/bioinformatics/btae196

Ellingson, M. K., Sheikha, H., Nyhan, K., Oliveira, C. R., & Niccolai, L. M. (2023). Human papillomavirus vaccine effectiveness by age at vaccination: A systematic review. Human Vaccines & Immunotherapeutics, 19(2), 2239085. https://doi.org/10.1080/21645515.2023.2239085

Furuno, A., Sukegawa, A., Ohshige, K., Suzuki, Y., Yamaguchi, M., Miyagi, E., Ueda, Y., Sekine, M., & Mizushima, T. (2024). Three‐year questionnaire study on human papillomavirus vaccination targeting new female college school students: Follow‐up to a 2021 report to reveal the impact of a policy change in Japan. Journal of Obstetrics and Gynaecology Research, 50(9), 1640–1648. https://doi.org/10.1111/jog.16049

Heyde, S., Osmani, V., Schauberger, G., Cooney, C., & Klug, S. J. (2024). Global parental acceptance, attitudes, and knowledge regarding human papillomavirus vaccinations for their children: A systematic literature review and meta-analysis. BMC Women’s Health, 24(1), 537. https://doi.org/10.1186/s12905-024-03377-5

Khan, J., Ahmad, K., Jagatheesaperumal, S. K., & Sohn, K.-A. (2025). Textual variations in social media text processing applications: Challenges, solutions, and trends. Artificial Intelligence Review, 58(3), 89. https://doi.org/10.1007/s10462-024-11071-z

Kim, S. J., Schiffelbein, J. E., Imset, I., & Olson, A. L. (2022). Countering Antivax Misinformation via Social Media: Message-Testing Randomized Experiment for Human Papillomavirus Vaccination Uptake. Journal of Medical Internet Research, 24(11), e37559. https://doi.org/10.2196/37559

Li, Z., Yang, C., & Huang, C. (2023). A Comparative Sentiment Analysis of Airline Customer Reviews Using Bidirectional Encoder Representations from Transformers (BERT) and Its Variants. Mathematics, 12(1), 53. https://doi.org/10.3390/math12010053

Liu, J., Niu, Q., Nagai-Tanima, M., & Aoyama, T. (2025). Understanding Human Papillomavirus Vaccination Hesitancy in Japan Using Social Media: Content Analysis. Journal of Medical Internet Research, 27, e68881. https://doi.org/10.2196/68881

Lundén, N., Bekar, E. T., Skoogh, A., & Bokrantz, J. (2023). Domain Knowledge in CRISP-DM: An Application Case in Manufacturing. IFAC-PapersOnLine, 56(2), 7603–7608. https://doi.org/10.1016/j.ifacol.2023.10.1156

Lviv Polytechnic National University, Podolchak, N., Tsygylyk, N., Lviv Polytechnic National University, Petlovanyi, M., & Lviv Polytechnic National University. (2025). Mathematical modeling of multi-label classification of job descriptions using transformer-based neural networks. Mathematical Modeling and Computing, 12(3), 767–778. https://doi.org/10.23939/mmc2025.03.767

Ma, L., Chen, R., Ge, W., Rogers, P., Lyn-Cook, B., Hong, H., Tong, W., Wu, N., & Zou, W. (2025). AI-powered topic modeling: Comparing LDA and BERTopic in analyzing opioid-related cardiovascular risks in women. Experimental Biology and Medicine, 250, 10389. https://doi.org/10.3389/ebm.2025.10389

Mascarenhas, A. K., Kelekar, A., Lucia, V. C., & Afonso, N. M. (2024). The receipt of the human papillomavirus vaccine’s influence on future human papillomavirus vaccine recommendations by medical and dental students. JADA Foundational Science, 3, 100029. https://doi.org/10.1016/j.jfscie.2023.100029

Naoum, P., Athanasakis, K., Zavras, D., Kyriopoulos, J., & Pavi, E. (2022). Knowledge, Perceptions and Attitudes Toward HPV Vaccination: A Survey on Parents of Girls Aged 11–18 Years Old in Greece. Frontiers in Global Women’s Health, 3, 871090. https://doi.org/10.3389/fgwh.2022.871090

Papia, S. K., Khan, M. A., Habib, T., Rahman, M., & Islam, M. N. (2024). DistilRoBiLSTMFuse: An efficient hybrid deep learning approach for sentiment analysis. PeerJ Computer Science, 10, e2349. https://doi.org/10.7717/peerj-cs.2349

Pencheva, D. (2025). Profiling Noisy Social Media Data for Sentiment Applications: A Visual and Analytical Framework. SAR Journal - Science and Research, 213–224. https://doi.org/10.18421/SAR83-01

Saputra, A. N. A., Saputro, R. E., & Saputra, D. I. S. (2025). Enhancing Sentiment Analysis Accuracy Using SVM and Slang Word Normalization on YouTube Comments. Sinkron, 9(2), 687–699. https://doi.org/10.33395/sinkron.v9i2.14613

Sari, Y., & Handayani, M. (2021). ANALYSIS OF THE “SOME” MODEL (SHARE, OPTIMIZE, MANAGE, ENGAGE) INSTAGRAM ACCOUNT @tnlkep Kepulauanseribu IN THE FRAMEWORK OF DIGITAL PROMOTION OF A THOUSAND ISLANDS MARINE PARK AS AN ECO-TOURISM DESTINATION FOR THE MILLENIAL GENERATION. Moestopo International Review on Social, Humanities, and Sciences, 1(1), 7–15. https://doi.org/10.32509/mirshus.v1i1.5

Sendekie, A. K., Abate, B. B., Adamu, B. A., Tefera, A. M., Mekonnen, K. T., Ashagrie, M. A., Tadesse, Y. B., Dagnaw, A. D., Melaku, M. S., & Bizuneh, G. K. (2025). Human papillomavirus vaccination hesitancy among young girls in Ethiopia: Factors and barriers to uptake. Frontiers in Public Health, 13, 1507832. https://doi.org/10.3389/fpubh.2025.1507832

Singh, G., Dash, N. R., Shaju, A., & Chakkalakkunnan, S. S. (2025). Perceptions and sentiments associated with HPV vaccine uptake among Indian Reddit users: A qualitative social media analysis. BMC Public Health, 25(1), 4037. https://doi.org/10.1186/s12889-025-25418-w

Surya, J., Kashyap, H., Nadig, R. R., & Raman, R. (2023). Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population. Cureus. https://doi.org/10.7759/cureus.45853

Talaat, A. S. (2023). Sentiment analysis classification system using hybrid BERT models. Journal of Big Data, 10(1), 110. https://doi.org/10.1186/s40537-023-00781-w

Xue, J., Zhang, B., Zhang, Q., Hu, R., Jiang, J., Liu, N., Peng, Y., Li, Z., & Logan, J. (2023). Using Twitter-Based Data for Sexual Violence Research: Scoping Review. Journal of Medical Internet Research, 25, e46084. https://doi.org/10.2196/46084

Yoon, S., Kim, H., An, J., & Jin, S. W. (2024). Exploring human papillomavirus vaccine hesitancy among college students and the potential of virtual reality technology to increase vaccine acceptance: A mixed-methods study. Frontiers in Public Health, 12, 1331379. https://doi.org/10.3389/fpubh.2024.1331379

Zouhar, V., Meister, C., Gastaldi, J. L., Du, L., Sachan, M., & Cotterell, R. (2023). Tokenization and the Noiseless Channel (Version 1). arXiv. https://doi.org/10.48550/ARXIV.2306.16842

Published
2026-04-14
How to Cite
Silaban, D. E., & Mauritsius, T. (2026). Public Perceptions of HPV Vaccination Through Transformer-Based Social Media Sentiment Analysis. Moestopo International Review on Social, Humanities, and Sciences, 6(1), 114-131. https://doi.org/10.32509/mirshus.v6i1.175
Section
Articles