AI-Driven Predictive Autoscaling in Kubernetes : Reinforcement Learning for Proactive Resource Optimization in Cloud-Native Environments

Shiva Kumar Chinnam; Ravindra Karanam

doi:10.32628/CSEIT22548

Authors

Shiva Kumar Chinnam Clemson University, South Carolina, USA
Ravindra Karanam Fairleigh Dickinson University, Teaneck, NJ

DOI:

https://doi.org/10.32628/CSEIT22548

Keywords:

Kubernetes autoscaling, reinforcement learning, cloud cost optimization, predictive analytics, AWS EKS, Karpenter

Abstract

We propose a reinforcement learning-based autoscaling algorithm integrated with Karpenter on AWS EKS. Unlike threshold-based scaling, our method anticipates workload surges by analyzing historical patterns using predictive analytics, thereby reducing cloud spend and improving service availability. Simulations and real deployment benchmarks from Rialtic Inc. validate the cost efficiency and reliability of this method. The proposed system achieves 34% reduction in cloud infrastructure costs while maintaining 99.7% service availability and reducing cold start latencies by 67%. Through Q-learning optimization and temporal pattern recognition, the system demonstrates superior performance compared to traditional Horizontal Pod Autoscaler (HPA) and Vertical Pod Autoscaler (VPA) mechanisms.

References

Chen, L., Wang, S., & Liu, Y. (2019). Analysis and optimization of Kubernetes horizontal pod autoscaler for cloud-native applications. Proceedings of the International Conference on Cloud Computing and Services Science, 245-256.
Delimitrou, C., & Kozyrakis, C. (2018). Quasar: Resource-efficient and QoS-aware cluster management for heterogeneous workloads. ACM Transactions on Computer Systems, 36(4), 1-32.
Santhosh Kumar Pendyala, Satyanarayana Murthy Polisetty, Sushil Prabhu Prabhakaran. Advancing Healthcare Interoperability Through Cloud-Based Data Analytics: Implementing FHIR Solutions on AWS. International Journal of Research in Computer Applications and Information Technology (IJRCAIT), 5(1),2022, pp. 13-20. https://iaeme.com/Home/issue/IJRCAIT?Volume=5&Issue=1
Kumar, A., Singh, R., & Patel, N. (2018). Cost-aware workload scheduling in Kubernetes clusters: A multi-objective optimization approach. Journal of Cloud Computing Advances, Systems and Applications, 7(1), 15-28.
Liu, X., & Zhang, H. (2020). LSTM-based predictive scaling for containerized applications in cloud environments. IEEE Transactions on Cloud Computing, 8(3), 892-904.
Sushil Prabhu Prabhakaran, Satyanarayana Murthy Polisetty, Santhosh Kumar Pendyala. Building a Unified and Scalable Data Ecosystem: AI-DrivenSolution Architecture for Cloud Data Analytics. International Journal of Computer Engineering and Technology (IJCET), 13(3), 2022, pp. 137-153. https://iaeme.com/Home/issue/IJCET?Volume=13&Issue=3
Patel, M., Johnson, K., & Thompson, R. (2020). Q-learning optimization for virtual machine placement in heterogeneous cloud environments. International Journal of Cloud Computing and Services Architecture, 10(2), 34-47.
Williams, D., & Thompson, A. (2019). Cost optimization strategies for containerized workloads: An empirical study of Kubernetes deployments. Cloud Computing and Services Science Communications, 425, 156-170.
Zhou, P., Chen, M., & Davis, J. (2017). Reinforcement learning for adaptive resource allocation in cloud computing: A comprehensive survey. ACM Computing Surveys, 50(6), 1-38.

AI-Driven Predictive Autoscaling in Kubernetes : Reinforcement Learning for Proactive Resource Optimization in Cloud-Native Environments

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite