⚠ Official Notice: www.ijisrt.com is the official website of the International Journal of Innovative Science and Research Technology (IJISRT) Journal for research paper submission and publication. Please beware of fake or duplicate websites using the IJISRT name.



Serverless Datalake Architecture for Finacial Analysis Using Cloud Computing


Authors : N. Kavya Sri; G. R. Yagna Chaitanya; S. Abhiram; Dr. Shaikshavali Shaik

Volume/Issue : Volume 11 - 2026, Issue 3 - March


Google Scholar : https://tinyurl.com/3mcvzeae

Scribd : https://tinyurl.com/5t4nfzv6

DOI : https://doi.org/10.38124/ijisrt/26mar807

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.


Abstract : The explosive growth of financial data in modern institutions has created significant challenges in data management, scalability, and analytical processing. Traditional on-premises data processing systems often suffer from high infrastructure costs, complex maintenance, limited scalability, and slower analytical performance, making them inefficient for handling large and continuously growing financial datasets. To address these challenges, this paper presents a serverless data lake architecture for financial analysis using cloud computing technologies. The proposed system leverages cloud services such as Amazon S3, AWS Glue, Amazon Athena, and Amazon QuickSight to build a scalable and efficient analytics platform. Financial datasets collected from open-source sources such as Kaggle are ingested into Amazon S3, forming the storage layer of the data lake. The architecture follows a structured Extract–Transform–Load (ETL) pipeline, where AWS Glue performs automated data extraction, transformation, and conversion of raw datasets into optimized Parquet columnar format, while also creating a metadata catalog for efficient data discovery. Analytical queries are executed using Amazon Athena, which enables serverless SQL-based querying directly on the stored datasets. The resulting insights are visualized through interactive dashboards using Amazon QuickSight, allowing users to explore financial patterns and trends effectively. The proposed architecture eliminates the need for server management while providing high scalability, improved query performance, and cost-efficient data processing. Compared with traditional data processing systems, the serverless approach offers greater flexibility, reduced operational overhead, and faster analytical capabilities. These advantages make the proposed solution a highly effective and practical framework for large-scale financial data analytics in modern cloud environments.

Keywords : Serverless Computing , Scalability, ETL Pipeline , Data Lake Architecture.

References :

  1. Serverless Computing for Big Data Analytics: Performance and Cost Analysis of AWS Lambda and Google Cloud Functions. M. A. Ben Ali, Journal of Data Mining, Knowledge Discovery, and Decision Support Systems, 2023.  https://theneurolabs.com/index.php/JDMKD/article/view/2023-02-04.
  2. Efficient Serverless Architectures: Leveraging AWS Lambda and SageMaker for Scalable Workflow Solutions R. Chandra Thota, Journal of Science & Technology, vol. 5, no. 3, June 2024. https://doi.org/10.55662/JST.2024.5302.
  3. Serverless Architectures and Their Influence on WebDevelopment M. S. S. Lingolu & M. K. Dobbala, Journal of Artificial Intelligence & Cloud Computing, 2024. https://doi.org/10.47363/JAICC/2024(3)297.
  4. Toward Security Quantification of Serverless Computing Journal of Cloud Computing, Springer, 2024. https://doi.org/10.1186/s13677-024-00703-y.
  5. Data Lakes: A Survey of Concepts and Architectures Computers, MDPI, 2024 — covers data lake architectures relevant to cloud analytics systems. https://www.mdpi.com/2073-431X/13/7/183.
  6. Event-Driven Machine Learning Infrastructure: Performance Benchmarking of Cloud Serverless Functions I. Bansal, International Journal of Intelligent Systems and Applications in Engineering, 2024. (Benchmarking AWS Lambda vs container compute for ML tasks) https://www.ijisae.org/index.php/IJISAE/article/view/7624.
  7. Serverless AI-Powered Recommendation Engine with AWS Lambda and SageMaker J. Banerjee & S. Barman, International Journal of Computer Trends and Technology, 2024. https://doi.org/10.14445/22312803/IJCTT-V72I12P119.
  8. Federated Serverless Cloud Approaches: A Comprehensive Review Computers and Electrical Engineering, Elsevier, 2025. https://doi.org/10.1016/j.compeleceng.2025.110372.
  9. Performance Analysis of Serverless Computing in Hybrid Cloud Environments Simran Lamba, International Journal of Engineering & Extended Technologies Research (IJEETR), 2024. https://www.ijeetr.com/index.php/ijeetr/article/view/82.
  10. Performance Impact on Databases Using Serverless Architectures: An Empirical Study A. R. Toorpu, International Journal of Global Innovations and Solutions, 2025. https://ijgis.org/home/article/view/24.
  11. The Future of Serverless Architectures in Data Engineering H. K. Pedarla, International Journal of AI, BigData, Computational and Management Studies, 2026. https://ijaibdcms.org/index.php/ijaibdcms/article/view/360.

The explosive growth of financial data in modern institutions has created significant challenges in data management, scalability, and analytical processing. Traditional on-premises data processing systems often suffer from high infrastructure costs, complex maintenance, limited scalability, and slower analytical performance, making them inefficient for handling large and continuously growing financial datasets. To address these challenges, this paper presents a serverless data lake architecture for financial analysis using cloud computing technologies. The proposed system leverages cloud services such as Amazon S3, AWS Glue, Amazon Athena, and Amazon QuickSight to build a scalable and efficient analytics platform. Financial datasets collected from open-source sources such as Kaggle are ingested into Amazon S3, forming the storage layer of the data lake. The architecture follows a structured Extract–Transform–Load (ETL) pipeline, where AWS Glue performs automated data extraction, transformation, and conversion of raw datasets into optimized Parquet columnar format, while also creating a metadata catalog for efficient data discovery. Analytical queries are executed using Amazon Athena, which enables serverless SQL-based querying directly on the stored datasets. The resulting insights are visualized through interactive dashboards using Amazon QuickSight, allowing users to explore financial patterns and trends effectively. The proposed architecture eliminates the need for server management while providing high scalability, improved query performance, and cost-efficient data processing. Compared with traditional data processing systems, the serverless approach offers greater flexibility, reduced operational overhead, and faster analytical capabilities. These advantages make the proposed solution a highly effective and practical framework for large-scale financial data analytics in modern cloud environments.

Keywords : Serverless Computing , Scalability, ETL Pipeline , Data Lake Architecture.

Paper Submission Last Date
31 - March - 2026

SUBMIT YOUR PAPER CALL FOR PAPERS
Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe