Authors :
Dr. T. Prakash; S. Kausalya
Volume/Issue :
Volume 10 - 2025, Issue 8 - August
Google Scholar :
https://tinyurl.com/3xnmcvuy
Scribd :
https://tinyurl.com/4nhwcccs
DOI :
https://doi.org/10.38124/ijisrt/25aug1065
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Note : Google Scholar may take 30 to 40 days to display the article.
Abstract :
English as Second Language (ESL) learners often struggle with pronunciation, which can hinder academic success
and social integration. This study investigates the effectiveness of a speech-to-text artificial intelligence (AI) system in
improving pronunciation accuracy among ESL learners. Using a phoneme-matching approach, the system provided real-
time corrective feedback to students in semi-urban learning environments. Data were collected through pre- and post-tests
measuring accuracy, precision, recall, and F1-score. Results revealed a 15% improvement in pronunciation accuracy,
supported by consistent gains across all performance metrics. Learners also demonstrated increased confidence and
sustained engagement, highlighting the motivational value of instant AI-based feedback. These findings suggest that speech-
to-text AI can complement traditional instruction by offering personalized and continuous pronunciation training. Future
research should explore long-term retention and integration with immersive technologies such as virtual and augmented
reality.
Keywords :
Speech-to-Text, ESL Pronunciation, Artificial Intelligence, Natural Language Processing (NLP), Phoneme Feedback.
References :
- Brown, J., & Miller, S. (2019). Video-based communication tasks in ESL pronunciation. International Journal of Education, 10(2), 23–35.
- Garcia, L., & Patel, R. (2021). Adaptive AI systems for ESL learning. Journal of Applied Linguistics and AI, 8(1), 50–67.
- Kumar, P. (2021). Virtual reality in second language education. Language Technology Review, 5(3), 77–90.
- Lee, A. (2018). Long-term effects of pronunciation training in ESL learners. TESOL Quarterly, 52(4), 1020–1035.
- Rahman, F., & Devi, K. (2022). Barriers to ESL learning in semi-urban contexts. Asian Journal of English Studies, 14(2), 88–101.
- Smith, M., & Johnson, R. (2020). AI for ESL pronunciation. Journal of Language Research, 15(2), 50–65.
- Thomas, J., & Lee, S. (2020). Retention of pronunciation gains in AI-assisted learning. Educational Technology & Society, 23(4), 115–128.
- Wang, H., & Chen, Y. (2020). Neural networks for speech recognition in ESL learners. Computers & Education, 149, 103809.
English as Second Language (ESL) learners often struggle with pronunciation, which can hinder academic success
and social integration. This study investigates the effectiveness of a speech-to-text artificial intelligence (AI) system in
improving pronunciation accuracy among ESL learners. Using a phoneme-matching approach, the system provided real-
time corrective feedback to students in semi-urban learning environments. Data were collected through pre- and post-tests
measuring accuracy, precision, recall, and F1-score. Results revealed a 15% improvement in pronunciation accuracy,
supported by consistent gains across all performance metrics. Learners also demonstrated increased confidence and
sustained engagement, highlighting the motivational value of instant AI-based feedback. These findings suggest that speech-
to-text AI can complement traditional instruction by offering personalized and continuous pronunciation training. Future
research should explore long-term retention and integration with immersive technologies such as virtual and augmented
reality.
Keywords :
Speech-to-Text, ESL Pronunciation, Artificial Intelligence, Natural Language Processing (NLP), Phoneme Feedback.