AI-Powered Text-to-Image Generation Using Stable Diffusion and Flask


Authors : Chaitenya Chand; Prashant

Volume/Issue : Volume 10 - 2025, Issue 6 - June


Google Scholar : https://tinyurl.com/aywcy8y5

DOI : https://doi.org/10.38124/ijisrt/25jun934

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.


Abstract : This research paper explores the development of an AI-powered text-to-image generation system leveraging Stable Diffusion and Flask. The project aims to provide an accessible interface for users to create high-quality images from textual descriptions while integrating multilingual support via Google Translate. The paper discusses the methodologies employed, including deep learning techniques, API integration, and optimization strategies. Challenges such as API rate limits, ambiguous text processing, and performance enhancements are examined. The study further evaluates the impact of AI in creative industries and suggests future improvements for enhanced customization and mobile deployment.

Keywords : AI-Generated Images, Stable Diffusion, Flask, Hugging Face API, Text-to-Image, Multilingual AI, Deep Learning, Generative Models.

References :

  1. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative Adversarial Networks. arXiv preprint arXiv:1406.2661. https://arxiv.org/abs/1406.2661
  2. Ho, J., Jain, A., & Abbeel, P. (2020). Denoising Diffusion Probabilistic Models. Advances in Neural Information Processing Systems, 33, 6840-6851. https://arxiv.org/abs/2006.11239
  3. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., & Kaiser, L. (2017). Attention Is All You Need. NeurIPS. https://arxiv.org/abs/1706.03762
  4. OpenAI. (2021). DALL-E: Creating Images from Text. OpenAI Blog. https://openai.com/dall-e/
  5. Stability AI. (2023). Stable Diffusion Model Documentation. https://stability.ai/
  6. Hugging Face. (2023). Stable Diffusion API for AI Image Generation. https://huggingface.co/
  7. Google Cloud. (2023). Google Translate API Documentation. https://cloud.google.com/translate/
  8. Flask Official Documentation. (2023). Flask Web Framework for Python. https://flask.palletsprojects.com/
  9. Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022). High-Resolution Image Synthesis with Latent Diffusion Models. https://arxiv.org/abs/2112.10752
  10. Kingma, D. P., & Welling, M. (2013). Auto-Encoding Variational Bayes. arXiv preprint. https://arxiv.org/abs/1312.6114

This research paper explores the development of an AI-powered text-to-image generation system leveraging Stable Diffusion and Flask. The project aims to provide an accessible interface for users to create high-quality images from textual descriptions while integrating multilingual support via Google Translate. The paper discusses the methodologies employed, including deep learning techniques, API integration, and optimization strategies. Challenges such as API rate limits, ambiguous text processing, and performance enhancements are examined. The study further evaluates the impact of AI in creative industries and suggests future improvements for enhanced customization and mobile deployment.

Keywords : AI-Generated Images, Stable Diffusion, Flask, Hugging Face API, Text-to-Image, Multilingual AI, Deep Learning, Generative Models.

CALL FOR PAPERS


Paper Submission Last Date
30 - June - 2025

Paper Review Notification
In 2-3 Days

Paper Publishing
In 2-3 Days

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe