Authors :
N. Kavya Sri; G. R. Yagna Chaitanya; S. Abhiram; Dr. Shaikshavali Shaik
Volume/Issue :
Volume 11 - 2026, Issue 3 - March
Google Scholar :
https://tinyurl.com/3mcvzeae
Scribd :
https://tinyurl.com/5t4nfzv6
DOI :
https://doi.org/10.38124/ijisrt/26mar807
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
The explosive growth of financial data in modern institutions has created significant challenges in data
management, scalability, and analytical processing. Traditional on-premises data processing systems often suffer from high
infrastructure costs, complex maintenance, limited scalability, and slower analytical performance, making them inefficient
for handling large and continuously growing financial datasets. To address these challenges, this paper presents a serverless
data lake architecture for financial analysis using cloud computing technologies. The proposed system leverages cloud
services such as Amazon S3, AWS Glue, Amazon Athena, and Amazon QuickSight to build a scalable and efficient analytics
platform. Financial datasets collected from open-source sources such as Kaggle are ingested into Amazon S3, forming the
storage layer of the data lake. The architecture follows a structured Extract–Transform–Load (ETL) pipeline, where AWS
Glue performs automated data extraction, transformation, and conversion of raw datasets into optimized Parquet columnar
format, while also creating a metadata catalog for efficient data discovery. Analytical queries are executed using Amazon
Athena, which enables serverless SQL-based querying directly on the stored datasets. The resulting insights are visualized
through interactive dashboards using Amazon QuickSight, allowing users to explore financial patterns and trends
effectively.
The proposed architecture eliminates the need for server management while providing high scalability, improved query
performance, and cost-efficient data processing. Compared with traditional data processing systems, the serverless
approach offers greater flexibility, reduced operational overhead, and faster analytical capabilities. These advantages make
the proposed solution a highly effective and practical framework for large-scale financial data analytics in modern cloud
environments.
Keywords :
Serverless Computing , Scalability, ETL Pipeline , Data Lake Architecture.
References :
- Serverless Computing for Big Data Analytics: Performance and Cost Analysis of AWS Lambda and Google Cloud Functions. M. A. Ben Ali, Journal of Data Mining, Knowledge Discovery, and Decision Support Systems, 2023. https://theneurolabs.com/index.php/JDMKD/article/view/2023-02-04.
- Efficient Serverless Architectures: Leveraging AWS Lambda and SageMaker for Scalable Workflow Solutions R. Chandra Thota, Journal of Science & Technology, vol. 5, no. 3, June 2024. https://doi.org/10.55662/JST.2024.5302.
- Serverless Architectures and Their Influence on WebDevelopment M. S. S. Lingolu & M. K. Dobbala, Journal of Artificial Intelligence & Cloud Computing, 2024. https://doi.org/10.47363/JAICC/2024(3)297.
- Toward Security Quantification of Serverless Computing Journal of Cloud Computing, Springer, 2024. https://doi.org/10.1186/s13677-024-00703-y.
- Data Lakes: A Survey of Concepts and Architectures Computers, MDPI, 2024 — covers data lake architectures relevant to cloud analytics systems. https://www.mdpi.com/2073-431X/13/7/183.
- Event-Driven Machine Learning Infrastructure: Performance Benchmarking of Cloud Serverless Functions I. Bansal, International Journal of Intelligent Systems and Applications in Engineering, 2024. (Benchmarking AWS Lambda vs container compute for ML tasks) https://www.ijisae.org/index.php/IJISAE/article/view/7624.
- Serverless AI-Powered Recommendation Engine with AWS Lambda and SageMaker J. Banerjee & S. Barman, International Journal of Computer Trends and Technology, 2024. https://doi.org/10.14445/22312803/IJCTT-V72I12P119.
- Federated Serverless Cloud Approaches: A Comprehensive Review Computers and Electrical Engineering, Elsevier, 2025. https://doi.org/10.1016/j.compeleceng.2025.110372.
- Performance Analysis of Serverless Computing in Hybrid Cloud Environments Simran Lamba, International Journal of Engineering & Extended Technologies Research (IJEETR), 2024. https://www.ijeetr.com/index.php/ijeetr/article/view/82.
- Performance Impact on Databases Using Serverless Architectures: An Empirical Study A. R. Toorpu, International Journal of Global Innovations and Solutions, 2025. https://ijgis.org/home/article/view/24.
- The Future of Serverless Architectures in Data Engineering H. K. Pedarla, International Journal of AI, BigData, Computational and Management Studies, 2026. https://ijaibdcms.org/index.php/ijaibdcms/article/view/360.
The explosive growth of financial data in modern institutions has created significant challenges in data
management, scalability, and analytical processing. Traditional on-premises data processing systems often suffer from high
infrastructure costs, complex maintenance, limited scalability, and slower analytical performance, making them inefficient
for handling large and continuously growing financial datasets. To address these challenges, this paper presents a serverless
data lake architecture for financial analysis using cloud computing technologies. The proposed system leverages cloud
services such as Amazon S3, AWS Glue, Amazon Athena, and Amazon QuickSight to build a scalable and efficient analytics
platform. Financial datasets collected from open-source sources such as Kaggle are ingested into Amazon S3, forming the
storage layer of the data lake. The architecture follows a structured Extract–Transform–Load (ETL) pipeline, where AWS
Glue performs automated data extraction, transformation, and conversion of raw datasets into optimized Parquet columnar
format, while also creating a metadata catalog for efficient data discovery. Analytical queries are executed using Amazon
Athena, which enables serverless SQL-based querying directly on the stored datasets. The resulting insights are visualized
through interactive dashboards using Amazon QuickSight, allowing users to explore financial patterns and trends
effectively.
The proposed architecture eliminates the need for server management while providing high scalability, improved query
performance, and cost-efficient data processing. Compared with traditional data processing systems, the serverless
approach offers greater flexibility, reduced operational overhead, and faster analytical capabilities. These advantages make
the proposed solution a highly effective and practical framework for large-scale financial data analytics in modern cloud
environments.
Keywords :
Serverless Computing , Scalability, ETL Pipeline , Data Lake Architecture.