AI-Driven Predictive Autoscaling in Kubernetes : Reinforcement Learning for Proactive Resource Optimization in Cloud-Native Environments

Authors

  • Shiva Kumar Chinnam  Clemson University, South Carolina, USA
  • Ravindra Karanam  Fairleigh Dickinson University, Teaneck, NJ

DOI:

https://doi.org/10.32628/CSEIT22548

Keywords:

Kubernetes autoscaling, reinforcement learning, cloud cost optimization, predictive analytics, AWS EKS, Karpenter

Abstract

We propose a reinforcement learning-based autoscaling algorithm integrated with Karpenter on AWS EKS. Unlike threshold-based scaling, our method anticipates workload surges by analyzing historical patterns using predictive analytics, thereby reducing cloud spend and improving service availability. Simulations and real deployment benchmarks from Rialtic Inc. validate the cost efficiency and reliability of this method. The proposed system achieves 34% reduction in cloud infrastructure costs while maintaining 99.7% service availability and reducing cold start latencies by 67%. Through Q-learning optimization and temporal pattern recognition, the system demonstrates superior performance compared to traditional Horizontal Pod Autoscaler (HPA) and Vertical Pod Autoscaler (VPA) mechanisms.

References

  1. Chen, L., Wang, S., & Liu, Y. (2019). Analysis and optimization of Kubernetes horizontal pod autoscaler for cloud-native applications. Proceedings of the International Conference on Cloud Computing and Services Science, 245-256.
  2. Delimitrou, C., & Kozyrakis, C. (2018). Quasar: Resource-efficient and QoS-aware cluster management for heterogeneous workloads. ACM Transactions on Computer Systems, 36(4), 1-32.
  3. Santhosh Kumar Pendyala, Satyanarayana Murthy Polisetty, Sushil Prabhu Prabhakaran. Advancing Healthcare Interoperability Through Cloud-Based Data Analytics: Implementing FHIR Solutions on AWS. International Journal of Research in Computer Applications and Information Technology (IJRCAIT), 5(1),2022, pp. 13-20. https://iaeme.com/Home/issue/IJRCAIT?Volume=5&Issue=1
  4. Kumar, A., Singh, R., & Patel, N. (2018). Cost-aware workload scheduling in Kubernetes clusters: A multi-objective optimization approach. Journal of Cloud Computing Advances, Systems and Applications, 7(1), 15-28.
  5. Liu, X., & Zhang, H. (2020). LSTM-based predictive scaling for containerized applications in cloud environments. IEEE Transactions on Cloud Computing, 8(3), 892-904.
  6. Sushil Prabhu Prabhakaran, Satyanarayana Murthy Polisetty, Santhosh Kumar Pendyala. Building a Unified and Scalable Data Ecosystem: AI-DrivenSolution Architecture for Cloud Data Analytics. International Journal of Computer Engineering and Technology (IJCET), 13(3), 2022, pp. 137-153. https://iaeme.com/Home/issue/IJCET?Volume=13&Issue=3
  7. Patel, M., Johnson, K., & Thompson, R. (2020). Q-learning optimization for virtual machine placement in heterogeneous cloud environments. International Journal of Cloud Computing and Services Architecture, 10(2), 34-47.
  8. Williams, D., & Thompson, A. (2019). Cost optimization strategies for containerized workloads: An empirical study of Kubernetes deployments. Cloud Computing and Services Science Communications, 425, 156-170.
  9. Zhou, P., Chen, M., & Davis, J. (2017). Reinforcement learning for adaptive resource allocation in cloud computing: A comprehensive survey. ACM Computing Surveys, 50(6), 1-38.

Downloads

Published

2022-06-30

Issue

Section

Research Articles

How to Cite

[1]
Shiva Kumar Chinnam, Ravindra Karanam, " AI-Driven Predictive Autoscaling in Kubernetes : Reinforcement Learning for Proactive Resource Optimization in Cloud-Native Environments" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 8, Issue 3, pp.574-582, May-June-2022. Available at doi : https://doi.org/10.32628/CSEIT22548