Authors :
Zainab Muhammad Nadada; Prema Kirubakaran; Muhammad Suleiman
Volume/Issue :
Volume 11 - 2026, Issue 3 - March
Google Scholar :
https://tinyurl.com/sbf3rtw8
Scribd :
https://tinyurl.com/36hdpa7x
DOI :
https://doi.org/10.38124/ijisrt/26mar1713
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
The rapid growth of the use of social media is posing a significant rate of social, political and economic threats in
Nigeria. Communication done online has the nature of being informal, especially in platforms like Instagram, thus
complicating the verification of information. It is against this background that this study developed and evaluated a machine
learning–based framework for fake news detection on the Instagram social media platform within the Nigerian context. The
objectives of the study were to: (i) design a machine learning–based framework for fake news detection on Instagram; (ii)
implement the designed framework using Term Frequency–Inverse Document Frequency (TF-IDF) feature extraction and
Logistic Regression classification techniques; and (iii) evaluate the performance of the developed model using accuracy,
precision, recall, and F1-score metrics. This work adopted an experimental research design. FakeNewsNet repository was
utilized to get publicly available benchmark datasets, which contains political and entertainment news - PolitiFact and
GossipCop, where data was labeled as fake or real. Nigerian Pidgin English dataset was incorporated into the training
process so as to improve contextual relevance and show transfer learning. Under data preprocessing, applied in this research
were techniques such as text cleaning, label normalization, and stratified data splitting. TF-IDF, and a Logistic Regression
model with class weight balancing was executed under the feature extraction process, where the model was trained and
evaluated using an 80:20 train-test split. To simulate Instagram message input and provide instant fake or real news
predictions that had confidence scores, a chat model for news verification was implemented. The results showed that the
model achieved an overall accuracy of approximately 82%, with satisfactory precision, recall, and F1-score values,
indicating effective classification performance. Pidgin English inputs were successfully classified, a key indicator that the
model is adaptable to local linguistics patterns. This study concluded that machine learning techniques, when combined with
appropriate feature extraction and contextual data, can effectively support Nigerian fake news detection on social media
platforms. Recommended in this study is the involvement of larger Nigerian-language datasets, the exploration of advanced
deep learning models, and full integration with social media APIs to enhance real-time deployment and enhance the rate of
accuracy of detection.
Keywords :
Term Frequency–Inverse Document Frequency (TF-IDF).
References :
- Abikoye, O. C., & Abdulsalam, S. O. (2024). A Bi-LSTM-2-ML transfer-learning framework for fake news detection using hybrid embeddings. International Journal of Computer Applications, 186(4), 12–22.
- Adewole, T., Balogun, A., & Salami, M. (2023). Machine learning-based fake news detection on Nigerian Twitter data using TF-IDF and SVM. Nigerian Journal of Computing and Applied Informatics, 5(2), 45–55.
- Ahmed, H., Traore, I., & Saad, S. (2021). Detecting opinion spam and fake news using text classification: A comparative analysis. Expert Systems with Applications, 168, 114 371.
- Esan, A., Adebimpe, O., & Ojo, K. (2023). Long-short-term memory model for fake news detection in Nigeria. Ianna Journal of Interdisciplinary Studies, 5(1), 71–82.
- Hossain, M., Rahman, M., & Islam, M. (2022). Fake news detection for Bangla using Bi-LSTM with word embeddings. Journal of Information Technology and Digital Services, 4(3), 23–33.
- Kaliyar, R. K., Goswami, A., & Narang, P. (2021). FakeBERT: Fake news detection in social media with a BERT-based deep learning approach. Multimedia Tools and Applications, 80, 134 67–134 83.
- Kumar, S., & Singh, R. (2020). Deep learning models for fake news detection: A comparative study. International Journal of Information Systems and Management, 10(3), 15–27.
- Muhammad, S., Adelani, D., Ruder, S., Ahmad, I., & Bello, I. (2022). NaijaSenti: A Nigerian Twitter sentiment corpus for four major languages. In Proceedings of the 13th Language Resources and Evaluation Conference (pp. 5805–5815). European Language Resources Association.
- Olowononi, F., Ayodele, T., & Eke, C. (2022). Hybrid NLP approach for misinformation detection in Nigerian WhatsApp messages. African Journal of Computing & ICT, 15(2), 59–68.
- Oyewusi, T., Adebayo, J., & Akintoye, B. (2020). Semantic enrichment and resource creation for Nigerian Pidgin language processing. Nigerian Journal of Language Technologies, 3(1), 44–55.
- Pandey, A., Singh, V., & Srivastava, A. (2022). Performance evaluation of machine learning classifiers for fake news detection. International Journal of Information Engineering, 9(4), 33–41.
- Patel, K., & Patel, D. (2021). Deep learning architectures and applications in NLP: A review. Journal of Artificial Intelligence Research, 12(2), 1–15.
- Sangamnerkar, M., Patil, A., & Deshmukh, R. (2020). An ensemble machine learning approach for fabricated news detection. Procedia Computer Science, 167, 2344–2353.
- Shu, K., Sliva, A., Wang, S., Tang, J., & Liu, H. (2019). Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations, 19(1), 22–36.
- Singhal, S., Shah, R., Chakraborty, T., & Kumaraguru, P. (2019). SpotFake: A multimodal framework for fake news detection. In Proceedings of the IEEE International Conference on Big Data (pp. 2141–2149). IEEE.
- Song, C., Lee, S., & Han, J. (2021). Multimodal fake news detection via cross-modal attention networks. Information Processing & Management, 58(4), 102 551.
- Varshini, M., Krishnan, G., & Raj, S. (2024). RDGT-GAN: Robust distribution-generalized transformer for fake news detection. Expert Systems with Applications, 241, 122 564.
- Villagracia-Octaviano, M. (2021). A comparative analysis of machine learning algorithms for fake news detection. Journal of Data Science and Analytics, 3(2), 49–60.
- Wang, Y., Chen, L., & Zhang, P. (2023). Detecting AI-generated and fake news using fine-tuned transformer models. Artificial Intelligence Review, 56, 7023–7039.
The rapid growth of the use of social media is posing a significant rate of social, political and economic threats in
Nigeria. Communication done online has the nature of being informal, especially in platforms like Instagram, thus
complicating the verification of information. It is against this background that this study developed and evaluated a machine
learning–based framework for fake news detection on the Instagram social media platform within the Nigerian context. The
objectives of the study were to: (i) design a machine learning–based framework for fake news detection on Instagram; (ii)
implement the designed framework using Term Frequency–Inverse Document Frequency (TF-IDF) feature extraction and
Logistic Regression classification techniques; and (iii) evaluate the performance of the developed model using accuracy,
precision, recall, and F1-score metrics. This work adopted an experimental research design. FakeNewsNet repository was
utilized to get publicly available benchmark datasets, which contains political and entertainment news - PolitiFact and
GossipCop, where data was labeled as fake or real. Nigerian Pidgin English dataset was incorporated into the training
process so as to improve contextual relevance and show transfer learning. Under data preprocessing, applied in this research
were techniques such as text cleaning, label normalization, and stratified data splitting. TF-IDF, and a Logistic Regression
model with class weight balancing was executed under the feature extraction process, where the model was trained and
evaluated using an 80:20 train-test split. To simulate Instagram message input and provide instant fake or real news
predictions that had confidence scores, a chat model for news verification was implemented. The results showed that the
model achieved an overall accuracy of approximately 82%, with satisfactory precision, recall, and F1-score values,
indicating effective classification performance. Pidgin English inputs were successfully classified, a key indicator that the
model is adaptable to local linguistics patterns. This study concluded that machine learning techniques, when combined with
appropriate feature extraction and contextual data, can effectively support Nigerian fake news detection on social media
platforms. Recommended in this study is the involvement of larger Nigerian-language datasets, the exploration of advanced
deep learning models, and full integration with social media APIs to enhance real-time deployment and enhance the rate of
accuracy of detection.
Keywords :
Term Frequency–Inverse Document Frequency (TF-IDF).