Authors :
Cynthia S; Durga N; Jayavarshini JK; Merlin Mahima A; Bala Abirami B
Volume/Issue :
Volume 10 - 2025, Issue 5 - May
Google Scholar :
https://tinyurl.com/38au9xv6
DOI :
https://doi.org/10.38124/ijisrt/25may574
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
The increasing demand for video content on digital platforms necessitates rapid, scalable, and cost-effective
production methods. This paper introduces an innovative, automated video generation platform that integrates advanced
Natural Language Processing (NLP), Text-to-Speech (TTS), and AI-driven image generation. Designed to democratize content
creation, the proposed system leverages a modular, cloud-based architecture to produce high-quality videos with minimal
human intervention. We discuss the system architecture, implementation details, workflow processes, performance evaluations,
and the challenges encountered. Future enhancements are also proposed to further expand the platform’s capabilities.
References :
- React.js Official Documentation. Available: https://reactjs.org/.
- Next.js Official Documentation. Available: https://nextjs.org/.
- Tailwind CSS Documentation. Available: https://tailwindcss.com/. ShadCN UI Components. Available: https://ui.shadcn.com/.
- Node.js Official Website. Available: https://nodejs.org/.
- Express.js Official Documentation. Available: https://expressjs.com/.
- Drizzle ORM Documentation. Available: https://orm.drizzle.team/.
- PostgreSQL Official Documentation. Available: https://www.postgresql.org/docs/.
- Firebase Documentation. Available: https://firebase.google.com/docs.
- Google Cloud Text-to-Speech API Documentation. Available: https://cloud.google.com/text-to-speech.
- Replicate API for Image Generation. Available: https://replicate.com/.
- Remotion.js Documentation. Available: https://www.remotion.dev/ docs.
- WebSockets API Documentation. Available: https://developer.mozilla. org/en-US/docs/Web/API/WebSockets API.
- OAuth 2.0 Authorization Framework. Available: https://oauth.net/2/.
- Redux Official Documentation. Available: https://redux.js.org/.
- Zustand State Management Library. Available: https://docs.pmnd.rs/ zustand/getting-started/introduction.
- Prisma ORM Documentation. Available: https://www.prisma.io/docs/.
- Vite Official Documentation. Available: https://vitejs.dev/.
- TypeScript Official Documentation. Available: https://www. typescriptlang.org/docs/.
- GitHub REST API Documentation. Available: https://docs.github. com/en/rest.
The increasing demand for video content on digital platforms necessitates rapid, scalable, and cost-effective
production methods. This paper introduces an innovative, automated video generation platform that integrates advanced
Natural Language Processing (NLP), Text-to-Speech (TTS), and AI-driven image generation. Designed to democratize content
creation, the proposed system leverages a modular, cloud-based architecture to produce high-quality videos with minimal
human intervention. We discuss the system architecture, implementation details, workflow processes, performance evaluations,
and the challenges encountered. Future enhancements are also proposed to further expand the platform’s capabilities.