Authors :
Chaitenya Chand; Prashant
Volume/Issue :
Volume 10 - 2025, Issue 6 - June
Google Scholar :
https://tinyurl.com/aywcy8y5
DOI :
https://doi.org/10.38124/ijisrt/25jun934
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
This research paper explores the development of an AI-powered text-to-image generation system leveraging
Stable Diffusion and Flask. The project aims to provide an accessible interface for users to create high-quality images
from textual descriptions while integrating multilingual support via Google Translate. The paper discusses the
methodologies employed, including deep learning techniques, API integration, and optimization strategies. Challenges
such as API rate limits, ambiguous text processing, and performance enhancements are examined. The study further
evaluates the impact of AI in creative industries and suggests future improvements for enhanced customization and mobile
deployment.
Keywords :
AI-Generated Images, Stable Diffusion, Flask, Hugging Face API, Text-to-Image, Multilingual AI, Deep Learning, Generative Models.
References :
- Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative Adversarial Networks. arXiv preprint arXiv:1406.2661. https://arxiv.org/abs/1406.2661
- Ho, J., Jain, A., & Abbeel, P. (2020). Denoising Diffusion Probabilistic Models. Advances in Neural Information Processing Systems, 33, 6840-6851. https://arxiv.org/abs/2006.11239
- Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., & Kaiser, L. (2017). Attention Is All You Need. NeurIPS. https://arxiv.org/abs/1706.03762
- OpenAI. (2021). DALL-E: Creating Images from Text. OpenAI Blog. https://openai.com/dall-e/
- Stability AI. (2023). Stable Diffusion Model Documentation. https://stability.ai/
- Hugging Face. (2023). Stable Diffusion API for AI Image Generation. https://huggingface.co/
- Google Cloud. (2023). Google Translate API Documentation. https://cloud.google.com/translate/
- Flask Official Documentation. (2023). Flask Web Framework for Python. https://flask.palletsprojects.com/
- Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022). High-Resolution Image Synthesis with Latent Diffusion Models. https://arxiv.org/abs/2112.10752
- Kingma, D. P., & Welling, M. (2013). Auto-Encoding Variational Bayes. arXiv preprint. https://arxiv.org/abs/1312.6114
This research paper explores the development of an AI-powered text-to-image generation system leveraging
Stable Diffusion and Flask. The project aims to provide an accessible interface for users to create high-quality images
from textual descriptions while integrating multilingual support via Google Translate. The paper discusses the
methodologies employed, including deep learning techniques, API integration, and optimization strategies. Challenges
such as API rate limits, ambiguous text processing, and performance enhancements are examined. The study further
evaluates the impact of AI in creative industries and suggests future improvements for enhanced customization and mobile
deployment.
Keywords :
AI-Generated Images, Stable Diffusion, Flask, Hugging Face API, Text-to-Image, Multilingual AI, Deep Learning, Generative Models.