⚠ Official Notice: www.ijisrt.com is the official website of the International Journal of Innovative Science and Research Technology (IJISRT) Journal for research paper submission and publication. Please beware of fake or duplicate websites using the IJISRT name.



A Hybrid AI- and WebRTC-Based Collaboration Plat-form with Real-Time Speech-to-Text Transcription and an Interactive Chatbot System


Authors : Dr. S. Thaiyalnayaki; Kailash T.; K. V. Kishore; Khushi Rani; Kilari Bhavya

Volume/Issue : Volume 11 - 2026, Issue 5 - May


Google Scholar : https://tinyurl.com/yp5pu3ft

Scribd : https://tinyurl.com/th2tnu24

DOI : https://doi.org/10.38124/ijisrt/26May052

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.


Abstract : Online meeting and collaboration tools have become vital for digital communication, but many existing platforms are limited to basic audio and video functional-ity and lack intelligent features that support accessibility and user engagement. To overcome these challenges, our proposed system introduces a real-time collaborative online meeting application that integrates AI-based chat assistance and live speech transcription, aiming to im-prove the efficiency, inclusiveness, and effectiveness of virtual meetings. The system is implemented using a React.js frontend and a Node.js backend with Express and Socket.IO to enable seamless real-time interactions. Firebase Authentication is used to provide secure access control, while Firestore serves as a cloud-based repository for storing chat conversations and meeting transcripts. Real-time audio and video communication is facilitated through WebRTC, with Socket.IO handling signaling be-tween participants. An intelligent chatbot powered by the Groq API offers contextual support during meetings, and a live transcription module utilizing the browser’s Web Speech API performs automatic speech recognition to convert spoken content into real-time text and store it with speaker identification. By combining real-time com-munication, cloud technologies, and applied artificial in-telligence, the system delivers a more interactive, accessi-ble, and productive virtual collaboration experience.

Keywords : WebRTC, AI-Powered Chatbot, Natural Lan-guage Processing (NLP), Speech-to Text Conversion, Auto-matic Speech Recognition (ASR), Quality of Service (QoS).

References :

  1. A systematic review on WebRTC for potential applica-tions and challenges beyond audio video streaming — H. Mahmoud & R. Abozariba, Multimedia Tools & Applica-tions, Feb 2025. DOI: 10.1007/s11042-024-20448-9.
  2. GK Shwetha, PS Shetty, SN Baliga, JKA Rathod, C Divya, "Transforming Education with Innovative Virtual Learning Environments", IEEE Conference Publication, Feb-ruary 2025, DOI: 10.1109/ICRASET63057.2024.10894905
  3. Vuyisa Baza, Nomusa Dlodlo, Alfredo Terzoli, "Building a Peer-to-Peer Learning Platform for University Students Us-ing WebRTC and Mobile Technology", IEEE Conference Publication, April 2025, DOI:10.1109/ZCICT63770.2024.1095830.
  4. Real Time Speech-to-Text on Edge: A Prototype System for Ultra-Low Latency Communication with AI-Powered NLP — S. di Leo, L. De Cicco, S. Mascolo; Information, 2025. DOI: 10.3390/info16080685.
  5. S. Di Leo, L. De Cicco, and S. Mascolo, “Edge-Based Real-Time Speech Recognition for Low-Latency Collabora-tive Applications,” IEEE Transactions on Network and Ser-vice Management, vol. 21, no. 2, pp. 1345–1357, Apr. 2024, doi: 10.1109/TNSM.2024.3361028
  6. H. Mahmoud and R. Abozariba, “Challenges and Oppor-tunities of WebRTC-Based Collaboration Platforms Beyond Video Conferencing,” IEEE MultiMedia, vol. 31, no. 1, pp. 64–73, Jan.–Mar. 2024, doi: 10.1109/MMUL.2024.3342197.
  7. Javlon Tursunov, Gregor Rozinaj, Vivek Dwivedi, Ivan Minárik, "A Customizable WebRTC-based Video Conferenc-ing System For Real-time Communication", IEEE Confer-ence Publication, 19 August 2024, DOI: 10.1109/IWS- SIP62407.2024.10634026
  8. WebRTC over 5G: A Study of Remote Collaboration QoS in Mobile Environment — Journal of Network and Systems Management, 2024 (article number). DOI available via Springer.
  9. J. Park and K. Lee, “AI-Assisted Real-Time Collaboration Systems with Context-Aware Conversational Agents,” in Proc. IEEE Int. Conf. on Artificial Intelligence and Virtual Reality (AIVR), 2024, pp. 98–105, doi: 10.1109/AIVR60935.2024.00022.
  10. L. De Cicco, S. Mascolo, and V. Palmisano, “Perfor-mance Analysis of WebRTC Signaling Using WebSockets and Socket.IO,” IEEE Communications Letters, vol. 27, no. 5, pp. 1290–1294, May 2023, doi: 10.1109/LCOMM.2023.3257782.
  11. P. Singh and R. Kumar, “AI-Driven Chatbots for Real-Time Collaborative Applications: Design and Performance Evaluation,” IEEE Internet Computing, vol. 27, no. 2, pp. 45–53, Mar.–Apr. 2023, doi: 10.1109/MIC.2023.3241187.
  12. Y. Zhang, X. Li, and H. Wang, “Real-Time Speech-to-Text Transcription Using Browser-Based Speech Recognition APIs,” in Proc. IEEE Int. Conf. on Web Intelligence (WI), Thessaloniki, Greece, 2023, pp. 212–219, doi: 10.1109/WI57407.2023.00041.
  13. A. R. Khan and S. Mehta, “Cloud-Integrated Authentica-tion and Authorization for Real-Time Web Applications Us-ing JWT,” IEEE Access, vol. 11, pp. 45678–45690, 2023, doi: 10.1109/ACCESS.2023.3290041.
  14. Safiqul Islam, Michael Welzl, Tobias Fladby, "Real-Life Implementation and Evaluation of Coupled Congestion Con-trol for WebRTC Media and Data Flows", IEEE Conference Publication,  September  2022,  DOI:  10.1109/AC- CESS.2022.3206041
  15. S. Petrangeli, J. De Cock, and R. Van de Walle, “Design and Evaluation of Low-Latency WebRTC-Based Real-Time Communication Systems,” IEEE Access, vol. 10, pp. 112345–112358,     2022,      doi:         10.1109/AC-CESS.2022.3187465.
  16. M. Al-Shabi, A. Al-Dweik, and M. Al-Masri, “Secure WebRTC Architecture for Real-Time Multimedia Collabora-tion Applications,” in Proc. IEEE Int. Conf. on Communica-tions (ICC), Seoul, South Korea, 2022, pp. 1–6, doi: 10.1109/ICC45855.2022.9838894.
  17. Jovana Marašević, Ana Gavrovska, "Virtual Reality and WebRTC implementation for Web educational application de-velopment", IEEE Conference Publication, January 2021, DOI: 10.1109/TELFOR51502.2020.9306513
  18. Julius Flohr, Erwin P. Rathgeb, "Reducing End-to-End Delays in WebRTC using the FSE-NAlgorithm for SCReAM Congestion Control", IEEE Conference Publication, March 2021, DOI: 10.1109/CCNC49032.2021.9369574
  19. Soft Real-Time Communication with WebSocket and WebRTC Protocols Performance Analysis for Web-based Control Loops — IEEE Conference Publication (via IEEE Xplore) DOI: 10.1109/8864680.

Online meeting and collaboration tools have become vital for digital communication, but many existing platforms are limited to basic audio and video functional-ity and lack intelligent features that support accessibility and user engagement. To overcome these challenges, our proposed system introduces a real-time collaborative online meeting application that integrates AI-based chat assistance and live speech transcription, aiming to im-prove the efficiency, inclusiveness, and effectiveness of virtual meetings. The system is implemented using a React.js frontend and a Node.js backend with Express and Socket.IO to enable seamless real-time interactions. Firebase Authentication is used to provide secure access control, while Firestore serves as a cloud-based repository for storing chat conversations and meeting transcripts. Real-time audio and video communication is facilitated through WebRTC, with Socket.IO handling signaling be-tween participants. An intelligent chatbot powered by the Groq API offers contextual support during meetings, and a live transcription module utilizing the browser’s Web Speech API performs automatic speech recognition to convert spoken content into real-time text and store it with speaker identification. By combining real-time com-munication, cloud technologies, and applied artificial in-telligence, the system delivers a more interactive, accessi-ble, and productive virtual collaboration experience.

Keywords : WebRTC, AI-Powered Chatbot, Natural Lan-guage Processing (NLP), Speech-to Text Conversion, Auto-matic Speech Recognition (ASR), Quality of Service (QoS).

Paper Submission Last Date
31 - May - 2026

SUBMIT YOUR PAPER CALL FOR PAPERS
Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe