Authors :
Shankar Dheeraj Konidena
Volume/Issue :
Volume 9 - 2024, Issue 10 - October
Google Scholar :
https://tinyurl.com/48xnp57y
Scribd :
https://tinyurl.com/yckbef2b
DOI :
https://doi.org/10.38124/ijisrt/IJISRT24OCT092
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
The current paper studies the impact of
Autoscaling on application performance in Cloud
computing environments. Cloud computing is one of the
most encouraging innovations due to its vast
applications. Predictive autoscaling is an advanced
technique that aims to address the challenges in the
autoscaling trends for large-scale systems. To account
for the Quality of Service to the customer, features like
load balancing according to the workload demand,
understanding resource allocation and utilization, and
dynamic decision-making are vital to any cloud
computing application. This paper interprets these
challenges and reviews a meta-reinforcement learning
approach for predictive autoscaling in cloud
environments. A novel RL-based predictive autoscaling
approach on a popular large-scale digital payment
platform system, Alipay, is compared with the existing
models such as Autopilot and FIRM. The aim is to
conduct a detailed analysis of performance metrics
before and after autoscaling actions, aiming to identify
optimal scaling strategies that minimize response time
and maximize resource utilization without over-
provisioning.
Keywords :
Predictive Autoscaling, Application Performance, Cloud Applications, Machine Learning, Reinforcement Learning.
References :
- Xue, S., Qu, C., Shi, X., Liao , C., Zhu, S., Tan, X., Ma, L., Wang, S., Hu, Y., Lei, L., Zheng, Y., Li, J., & Zhang, J. (2022). A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’22). https://doi.org/10.1145/3534678.3539063
- Roy, Nilabja & Dubey, Abhishek & Gokhale, Aniruddha. (2011). Efficient Autoscaling in the Cloud Using Predictive Models for Workload Forecasting. Proceedings - 2011 IEEE 4th International Conference on Cloud Computing, CLOUD 2011. 500-507. 10.1109/CLOUD.2011.42.
- Alipour, Hanieh & Hamou-Lhadj, Abdelwahab & Liu, Yan. (2014). Analysing Auto-scaling Issues in Cloud Environments.
- Shahin, A. A. (2017). Automatic Cloud Resource Scaling Algorithm based on Long Short-Term Memory Recurrent Neural Network. ArXiv. https://doi.org/10.14569/IJACSA.2016.071236
- Amir Fazli, Amin Sayedi, Jeffrey D. Shulman (2018) The Effects of Autoscaling in Cloud Computing. Management Science 64(11):5149-5163.
- J. H. Novak, S. K. Kasera and R. Stutsman, "Cloud Functions for Fast and Robust Resource Auto-Scaling," 2019 11th International Conference on Communication Systems & Networks (COMSNETS), Bengaluru, India, 2019, pp. 133-140, doi: 10.1109/COMSNETS.2019.8711058. keywords: {Delays; Cloud computing; Runtime; Current measurement; Load modeling; Servers},
- Arvindhan, M and Anand, Abhineet, Scheming a Proficient Auto Scaling Technique for Minimizing Response Time in Load Balancing on Amazon AWS Cloud (March 15, 2019). International Conference on Advances in Engineering Science Management & Technology (ICAESMT) - 2019, Uttaranchal University, Dehradun, India, Available at SSRN: https://ssrn.com/abstract=3390801 or http://dx.doi.org/10.2139/ssrn.3390801
- Hung, Che-Lun & Hu, Yu-Chen & Li, Kuan-Ching. (2012). Auto-Scaling Model for Cloud Computing System. International Journal of Hybrid Information Technology. 5. 181-186.
- Jacob, S., & G. (2021, October 5). 10 ways Google Cloud IaaS stands out. Google Cloud. https://cloud.google.com/blog/products/compute/google-clouds-iaas-platform-is-a-powerful-choice
The current paper studies the impact of
Autoscaling on application performance in Cloud
computing environments. Cloud computing is one of the
most encouraging innovations due to its vast
applications. Predictive autoscaling is an advanced
technique that aims to address the challenges in the
autoscaling trends for large-scale systems. To account
for the Quality of Service to the customer, features like
load balancing according to the workload demand,
understanding resource allocation and utilization, and
dynamic decision-making are vital to any cloud
computing application. This paper interprets these
challenges and reviews a meta-reinforcement learning
approach for predictive autoscaling in cloud
environments. A novel RL-based predictive autoscaling
approach on a popular large-scale digital payment
platform system, Alipay, is compared with the existing
models such as Autopilot and FIRM. The aim is to
conduct a detailed analysis of performance metrics
before and after autoscaling actions, aiming to identify
optimal scaling strategies that minimize response time
and maximize resource utilization without over-
provisioning.
Keywords :
Predictive Autoscaling, Application Performance, Cloud Applications, Machine Learning, Reinforcement Learning.