Authors :
Vasu Kapil; Dheeraj; Ritik Chauhan; Amardeep Singh
Volume/Issue :
Volume 10 - 2025, Issue 2 - February
Google Scholar :
https://tinyurl.com/47wp29yb
Scribd :
https://tinyurl.com/ms3s4bpm
DOI :
https://doi.org/10.5281/zenodo.14921228
Abstract :
This research explores the development of a real-time language translation system integrating speech
recognition, text-based translation, and sign language recognition. The system employs Google Translate API for
multilingual translation, MediaPipe Hands for sign language recognition, and SpeechRecognition for real-time voice input.
The study aims to bridge communication gaps between spoken, written, and signed languages. The paper presents
implementation details, experimental results, and future scope for improvement. Findings indicate promising accuracy in
sign recognition and speech translation, highlighting the potential for real-world application in accessibility and
communication enhancement.
References :
- “Direct Speech to Speech Translation Using Machine Learning”, December 2020
- S. Venkateswarlu, D. B. K. Kamesh , J. K. R. Sastry and Radhika Rani, “ Text to Speech Conversion”, 23 September 2020
- Chris Piech, Sami Abu-El-Haija, “Auto-Translation for Localized Instruction”, Sep 2019
- Sagar Patil, Mayuri Phonde, Siddharth Prajapati , “Multilingual Speech and Text Recognition and Translation using Image”, April-2020
- Bapna, A. 2019. Googleblog. Accessed 06.02.2022 https://ai.googleblog.com/2019/10/exploring-massively-multilingual.html
- Belval 2022. Github. Accessed 05.02.2022 https://github.com/Belval/pdf2image
- Bradski, G. & Kaehler, A. 2008. Learning OpenCV, Computer Vision with the OpenCV Library. Sebastopol. O'Reilly Media, Inc.
- Riverbank Computing. Accessed 27.03.2022 https://riverbank computing.com/s oftware/pyqt/intro
- Glyph & Cog LLC 2011. Mankier. Accessed 05.02.2022 https://www.mankier.com/1/pdftoppm
- Google 2006. Announcing Tesseract OCR. Accessed 05.02.2022 https://web.archive.org/web/2006102 6075310/http://google-code-updates.blogspot.com/2006/08/announcing-tesseract-ocr.html
- Han, S. 2020. Googletrans Documentation. Accessed 06.02.2022 https://py-googletrans.readthedocs.io/en/latest/
- Harwani, B. 2018. Qt5 Python GUI Programming Cookbook: Building responsive and powerful cross-platform applications with PyQt. Birmingham, UK: Packt Publishing Ltd.
- Konica Minolta 2018. Accessed 05.02.2022 https://www.konicaminolta.com.au/news-insight/blog/how-optical-character-recognition-works
- Lee, M. 2022. Pypi. Accessed 05.02.2022 https://pypi.org/project/pytesseract/
- Bowen, L. & Caswell, I. 2020. Googleblog. Accessed 06.02.2022 https://ai.googleblog.com/2020/06/recent-advances-in-google-translate.html
- Lutz, M. 2001. Programming Python, 2nd edition. Sebastopol. O'Reilly media.
- Och, F. 2006. Googleblog. Accessed 05.02.2022 https://ai.googleblog.com/2006/04/statistic al-machine-translation-live.html
- OpenCV, 2022. Accessed 06.02.2022 https://docs.opencv.org/4.x/d7/d4d/tutorial_py_thresholding.ht ml
This research explores the development of a real-time language translation system integrating speech
recognition, text-based translation, and sign language recognition. The system employs Google Translate API for
multilingual translation, MediaPipe Hands for sign language recognition, and SpeechRecognition for real-time voice input.
The study aims to bridge communication gaps between spoken, written, and signed languages. The paper presents
implementation details, experimental results, and future scope for improvement. Findings indicate promising accuracy in
sign recognition and speech translation, highlighting the potential for real-world application in accessibility and
communication enhancement.