2024 Mdpi Sensors
2024 Mdpi Sensors
Article
Auto-Scaling Techniques in Cloud Computing: Issues and
Research Directions
Saleha Alharthi, Afra Alshamsi, Anoud Alseiari and Abdulmalik Alwarafy *
Abstract: In the dynamic world of cloud computing, auto-scaling stands as a beacon of efficiency,
dynamically aligning resources with fluctuating demands. This paper presents a comprehensive
review of auto-scaling techniques, highlighting significant advancements and persisting challenges in
the field. First, we overview the fundamental principles and mechanisms of auto-scaling, including
its role in improving cost efficiency, performance, and energy consumption in cloud services. We then
discuss various strategies employed in auto-scaling, ranging from threshold-based rules and queuing
theory to sophisticated machine learning and time series analysis approaches. After that, we explore
the critical issues in auto-scaling practices and review several studies that demonstrate how these
challenges can be addressed. We then conclude by offering insights into several promising research
directions, emphasizing the development of predictive scaling mechanisms and the integration of
advanced machine learning techniques to achieve more effective and efficient auto-scaling solutions.
1. Introduction
Cloud computing enables users to access and utilize a range of computing resources,
including networks, servers, storage, and applications through the internet [1]. According
Citation: Alharthi, S.; Alshamsi, A.; to the National Institute of Standards and Technology, it is a system where you can easily
Alseiari, A.; Alwarafy, A. Auto-Scaling access and use these resources whenever you need them without much hassle or needing
Techniques in Cloud Computing: to interact a lot with the service provider [2]. Cloud computing has emerged as a critical
Issues and Research Directions. Sensors tool and is being used across various sectors, such as e-learning [3], healthcare [4], finance
2024, 24, 5551. https://doi.org/ and banking [5], manufacturing [6], and telecommunications [7].
10.3390/s24175551 In settings where data-intensive tasks are prevalent, crucial cloud computing features
Academic Editor: Jemal Abawajy like elasticity, resource allocation, and pooling, play significant roles. Scalability, a key
component of cloud computing, empowers organizations to modify their resource use to
Received: 3 July 2024 align with their needs, offering not only cost benefits but also crucial enhancements in
Revised: 26 July 2024
the performance of cloud-based applications [2]. The critical nature of scalability becomes
Accepted: 5 August 2024
particularly apparent in the realm of large-scale graph processing, which finds application
Published: 28 August 2024
in media, online gaming [8], and the internet of things (IoT) [9]. Conventional tools
for graph processing often falter with large-scale graphs, facing difficulties in balancing
scalability and cost-efficiency. [10]. Researchers in [11], discussed issues related to large-
Copyright: © 2024 by the authors.
scale optimization in elastic cloud computing. Auto-scaling, which involves dynamically
Licensee MDPI, Basel, Switzerland. increasing or decreasing resources based on specific needs and strategies, addresses this
This article is an open access article challenge by providing the flexibility to adapt.
distributed under the terms and In this review paper, we explore the concept of auto-scaling in cloud computing, a crit-
conditions of the Creative Commons ical mechanism that adjusts resources dynamically based on current demands. Section 2
Attribution (CC BY) license (https:// covers some existing works related to auto-scaling techniques. Section 3 lays the foundation
creativecommons.org/licenses/by/ by introducing the fundamentals of auto-scaling, particularly emphasizing its relevance
4.0/). in the cloud computing context. Following that, Section 4 delves into the algorithms
Acronym Definition
RL Reinforcement learning
SSCAS Self-Adaptive cloud auto-scaling system
IaaS Infrastructure as a Service
AWS Amazon Web Services
QoS Quality of service
MCS Mission Critical Services
MDP Markov decision process
ML Machine learning
DNN Deep neural network
CNFs Containerized network functions
VNFs Virtualized network functions
VMs Virtual machines
SLAs Service level agreements
HDFS Hadoop’s Distributed File System
Telco Telecommunications
CEAS Cost-Effective Auto-Scaling
SVM Support Vector Machine
OPCR On-demand provisioning of computing resource
IRS Intelligent resource scaling
IoT Internet of things
DT Digital twin
2. Related Work
Existing review papers have been published [12–21], and they have provided valuable
insights. However, with recent updates in the cloud computing environment, new research
areas have emerged. It is necessary to explore the current key challenges to propose future
research directions in this field. Therefore, this review covers auto-scaling techniques and
discusses the current challenges. Furthermore, we discuss the current applications and
domains of auto-scaling techniques. Table 2 illustrates the existing surveys and compares
them with our work.
To the best of our knowledge, no previous review paper has focused on all types of
auto-scaling techniques used in cloud environments. In this paper, we highlight different
types of auto-scaling techniques such as reactive methods (threshold rules, queuing theory)
and proactive methods (reinforcement learning, fuzzy learning, machine learning, and time
series analysis).
Sensors 2024, 24, 5551 3 of 21
Table 2. Cont.
3. Fundamentals of Auto-Scaling
This section explores the fundamental concepts of cloud computing, auto-scaling,
horizontal scaling, and vertical scaling. These concepts lay the foundation for the remainder
of the review paper, it ensures that readers have a clear insight of fundamental concepts
before delving into detailed analyses.
Cloud computing has swiftly become an essential part of daily life and is increasingly
favored by Internet users [22]. Users of cloud services pay only for the resources they use,
following a pay-per-use model. However, achieving a balance between cost-effectiveness
and performance requires precision. As shown in Figure 1 the architecture of cloud com-
puting consists of various components. The aim is to deliver top-notch cloud services at
minimal costs while ensuring prompt response to requests and system stability. Scala-
bility, a pivotal characteristic of cloud computing, addresses this challenge by enabling
flexibility and efficiency for users. To clarify the meaning of auto-scaling, auto-scaling
refers to the ability to scale in and out by adding or removing resources depending on the
requirements [7].
Auto-Scaling
Cloud Providers Auto-Scaling Feature
Supported (Yes/No)
Automatically scales number of EC2 instances for
AMAZON Yes
different applications.
Provides auto-scaling feature manually based on
WINDOWS AZURE Yes
the applications.
GOOGLE APP Owns auto-scaling technology. Yes
ENGINE Google applications. Yes
Supports auto-scaling technique in programmatic
GOGRID Yes/No
way and does not implement it.
Provides auto-scaling mechanism with high
FLEXISCALE Yes
performance and availability.
Application management service through cloud
ANEKA Yes
peer service.
Open-source cloud provided by resource
NIMBUS Yes
manager and Python modules.
Open-source cloud which provides wrapper
EUCALYPTUS Yes
service for various applications.
Open-source cloud which provides OpenNebula
OPEN NEBULA Yes
Service Management Project.
Sensors 2024, 24, 5551 7 of 21
4. Auto-Scaling Techniques
This section presents a detailed discussion of the reactive and proactive auto-scaling
methods. Auto-scaling is either reactive or proactive. Reactive methods in auto-scaling
refer to the allocation and deallocation of resources only when necessary. Furthermore,
reactive auto-scaling is the most commonly used in the market. However, proactive auto-
scaling attempts to scale resources while considering future workload patterns. A proactive
mechanism can handle sudden workload fluctuations by estimating the required number of
resources, whereas a reactive mechanism cannot handle such fluctuations [28,30]. A proac-
tive auto-scaling mechanism [31], can use past data on workload patterns. Hence, by using
a predictive technique on these data, they can estimate the future resources that are needed
for the future workload. In addition, the performance of proactive auto-scaling depends
on the accuracy of the workload prediction method. The integration of both methods can
improve a system’s throughput as discussed in [32], they introduced a hybrid auto-scaling
approach that utilizes reactive and proactive methods.
scaling decisions. Furthermore, it can improve energy efficiency by identifying the most
energy-efficient migration trends and server utilization scenarios.
demonstrates the process of collecting metrics from various microservices, such as AWS
EC2 instances. However, the model forecasts the workload, and based on these predictions,
the system decides to scale resources up or down automatically. To address the issue of
estimating the right threshold value for microservices, the authors of [39] proposed an
intelligent autonomous auto-scaling system. They combined existing machine learning
techniques with RL, and the results indicated that the microservice response time was
improved by up to 20% compared to the default auto-scaling model. Another proposal
introduced in [44] also addressed the challenges of auto-scaling in microservices in terms of
performance prediction and resource allocation. They applied particle swarm optimization
to a deep neural network (DNN) to realize optimal resource allocation. The results indi-
cated that the proposed technique improved performance while minimizing the total cost
of CPU by 30% to 50% compared to the baseline. Prior work [45] discussed the importance
of incorporating ML into AL-driven DevOps, which can lead to significant improvements
in cloud computing infrastructure. In [46], the authors developed an ML approach to
improve the efficiency and performance of cloud computing resources. In [47], the authors
described ML solutions in large-scale systems. In [48], the authors proposed a solution for
proactive mission-critical services (MCS) and described how the metrics provided by MCS
can predict future requirements to realize service scalability actions.
5. Real-World Applications
Auto-scaling techniques have become indispensable in various sectors, showcasing
their adaptability and critical role in resource optimization and performance enhancement
across diverse computing environments. This section reviews real-world applications of
auto-scaling, illustrating its effectiveness in ensuring efficient operations and resource uti-
lization in smart city infrastructure [23], social media analytics [24], telecommunications [7],
and healthcare system [53].
7. Research Directions
This section thoroughly investigates crucial future directions for auto-scaling in cloud
computing infrastructure. Based on our survey, we outlined some potential research directions
for future improvements, as discussed below.
of the existing work tried to minimize energy utilization. In [35], researchers proposed
an energy-efficient threshold to reduce energy consumption. Also, in [2], this approach
enhanced energy efficiency by shutting down unnecessary compute nodes during low
workload periods. Another important research direction that needs further exploration is
applying auto-scaling techniques to reduce energy consumption. As previously mentioned
in [37], the queuing theory-based model helps increase resource utilization and improves
energy efficiency by determining the most energy-efficient migration trends and server
utilization scenarios. Few studies have incorporated renewable energy sources, such as
solar panels, to provide power to data centers. These solutions are critical future research
directions that require additional investigation. The objective of adopting renewable energy
is to boost environmental sustainability by minimizing the dependence of data centers
on non-renewable energy sources during auto-scaling decisions. For sustainable energy
solutions, the authors of [55] encouraged researchers to explore more the combination
of hydropower dams with floating photovoltaic energy. In [56], the authors developed a
multi-agent deep reinforcement learning framework to address the challenges of renewable
energy in data centers. They successfully combined green and brown energy sources.
However, the framework was demonstrated through simulations; thus, adopting such a
system in practical cloud computing applications requires further investigation. In [57],
researchers investigated the best option in terms of power saving in both cloud computing
and edge computing.
the differences between Knative hybrid auto-scaling and default horizontal auto-scaling.
Results indicated that their system outperformed the existing default Knative horizon-
tal auto-scaling. In [60], they conducted a comparison of two auto-scaling approaches
under Kubernetes. While in [61], researchers proposed a hybrid auto-scaling framework
specifically for containerized in Kubernetes to enhance resource utilization. Furthermore,
for future improvements, the authors suggest determining a mechanism for selecting an
appropriate monitoring interval length.
8. Conclusions
In this paper, a literature survey of auto-scaling techniques was conducted, and we
outlined the main challenges of auto-scaling. Then, we presented several promising future
research directions based on the challenges. The different types of auto-scaling techniques
were presented and discussed, including reactive methods (threshold rules, queuing theory)
and proactive methods (RL, fuzzy learning, ML, and time series analysis). We discussed
existing real-world applications in cloud computing and the key challenges related to these
use cases. Additionally, we explained the fundamental concepts of cloud computing, auto-
scaling, horizontal scaling, and vertical scaling. In future work, we will explore specific
areas in auto-scaling. For example, we plan to focus more on resource allocation algorithms.
Author Contributions: Conceptualization, All authors; methodology, All authors; formal analysis,
All authors; investigation, All authors; resources, S.A., A.A. (Afra Alshamsi), and A.A. (Anoud
Alseiari); data curation, S.A., A.A. (Afra Alshamsi), and A.A. (Anoud Alseiari); writing—original
draft preparation, S.A., A.A. (Afra Alshamsi), and A.A. (Anoud Alseiari); writing—review and
editing, All authors; visualization, S.A., A.A. (Afra Alshamsi), and A.A. (Anoud Alseiari); supervi-
sion, A.A. (Adbulmalik Alwarafy); project administration, A.A. (Adbulmalik Alwarafy); funding
acquisition, A.A. (Adbulmalik Alwarafy). All authors have read and agreed to the published version
of the manuscript.
Funding: This work was supported by Startup Grant # 12T047 from the United Arab Emirates
University (UAEU).
Data Availability Statement: Data are contained within the article.
Conflicts of Interest: The authors declare no conflict of interest. The funders had no role in the design
of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or
in the decision to publish the results.
References
1. Goel, P.K.; Gulati, S.; Singh, A.; Tyagi, A.; Komal, K.; Mahur, L.S. Energy-Efficient Block-Chain Solutions for Edge and Cloud
Computing Infrastructures. In Proceedings of the 2024 2nd International Conference on Disruptive Technologies (ICDT), Greater
Noida, India, 15–16 March 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 852–856.
2. Ismahene, N.W.; Souheila, B.; Nacereddine, Z. An Auto Scaling Energy Efficient Approach in Apache Hadoop. In Proceedings
of the 2020 International Conference on Advanced Aspects of Software Engineering (ICAASE), Constantine, Algeria, 28–30
November 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–6.
3. Eljak, H.; Ibrahim, A.O.; Saeed, F.; Hashem, I.A.T.; Abdelmaboud, A.; Syed, H.J.; Abulfaraj, A.W.; Ismail, M.A.; Elsafi, A. E-learning
based Cloud Computing Environment: A Systematic Review, Challenges, and Opportunities. IEEE Access 2023, 12, 7329–7355.
[CrossRef]
4. Fathullah, M.A.; Subbarao, A.; Muthaiyah, S.; Taralunga, D. Risk Classes of Cloud Computing Project in Healthcare: A Review of
Technical Report and Standards. J. Adv. Res. Appl. Sci. Eng. Technol. 2024, 44, 169–180. [CrossRef]
5. Bodemer, O. Revolutionizing Finance: The Impact of AI and Cloud Computing in the Banking Sector. TechRxiv 2024. [CrossRef]
6. Kiatipis, A.; Xanthopoulos, A. Cloud Usage for Manufacturing: Challenges and Opportunities. Procedia Comput. Sci. 2024,
232, 1412–1419. [CrossRef]
7. Sao Cao, D.; Nguyen, D.T.; Nguyen, X.C.; Nguyen, H.B.; Lang, K.T.; Dao, N.L.; Pham, T.T.; Cao, N.S.; Chu, D.H.; Nguyen, P.H.;
et al. Elastic auto-scaling architecture in telco cloud. In Proceedings of the 2023 25th International Conference on Advanced
Communication Technology (ICACT), Pyeongchang, Republic of Korea, 19–22 February 2023; IEEE: Piscataway, NJ, USA, 2023;
pp. 401–406.
8. Boujelben, Y.; Fourati, H. A distributed auction-based algorithm for virtual machine placement in multiplayer cloud gaming
infrastructures. Int. J. Cloud Comput. 2024, 13, 80–98. [CrossRef]
9. Singh, P.D.; Singh, K.D. Interdisciplinary Approaches: Fog/Cloud Computing and IoT for AI and Robotics Integration. EAI
Endorsed Trans. AI Robot. 2024 , 3. [CrossRef]
10. Heidari, S.; Buyya, R. A cost-efficient auto-scaling algorithm for large-scale graph processing in cloud environments with
heterogeneous resources. IEEE Trans. Softw. Eng. 2019, 47, 1729–1741. [CrossRef]
11. Simic, V.; Stojanovic, B.; Ivanovic, M. Optimizing the performance of optimization in the cloud environment–An intelligent
auto-scaling approach. Future Gener. Comput. Syst. 2019, 101, 909–920. [CrossRef]
12. Fourati, M.H.; Marzouk, S.; Jmaiel, M. Towards Microservices-Aware Autoscaling: A Review. In Proceedings of the 2023 IEEE
Symposium on Computers and Communications (ISCC), Gammarth, Tunisia, 9–12 July 2023; IEEE: Piscataway, NJ, USA, 2023;
pp. 1080–1083.
Sensors 2024, 24, 5551 19 of 21
13. Tran, M.N.; Vu, D.D.; Kim, Y. A survey of autoscaling in kubernetes. In Proceedings of the 2022 Thirteenth International
Conference on Ubiquitous and Future Networks (ICUFN), Barcelona, Spain, 5–8 July 2022; IEEE: Piscataway, NJ, USA, 2022;
pp. 263–265.
14. Singh, P.; Gupta, P.; Jyoti, K.; Nayyar, A. Research on auto-scaling of web applications in cloud: Survey, trends and future
directions. Scalable Comput. Pract. Exp. 2019, 20, 399–432. [CrossRef]
15. Qu, C.; Calheiros, R.N.; Buyya, R. Auto-scaling web applications in clouds: A taxonomy and survey. ACM Comput. Surv. (CSUR)
2018, 51, 1–33. [CrossRef]
16. Garí, Y.; Monge, D.A.; Pacini, E.; Mateos, C.; Garino, C.G. Reinforcement learning-based application autoscaling in the cloud: A
survey. Eng. Appl. Artif. Intell. 2021, 102, 104288. [CrossRef]
17. Gari, Y.; Monge, D.A.; Pacini, E.; Mateos, C.; Garino, C.G. Reinforcement learning-based autoscaling of workflows in the cloud: A
survey. arXiv 2020, arXiv:2001.09957.
18. Dogani, J.; Namvar, R.; Khunjush, F. Auto-scaling techniques in container-based cloud and edge/fog computing: Taxonomy and
survey. Comput. Commun. 2023, 209, 120–150. [CrossRef]
19. Chen, T.; Bahsoon, R.; Yao, X. A survey and taxonomy of self-aware and self-adaptive cloud autoscaling systems. ACM Comput.
Surv. (CSUR) 2018, 51, 1–40. [CrossRef]
20. Rabiu, S.; Yong, C.H.; Mohamad, S.M.S. A cloud-based container microservices: A review on load-balancing and auto-scaling
issues. Int. J. Data Sci. 2022, 3, 80–92. [CrossRef]
21. Khan, A.A.; Vidhyadhari, C.H.; Kumar, S. A review on fixed threshold based and adaptive threshold based auto-scaling
techniques in cloud computing. MATEC Web Conf. 2024, 392, 01115. [CrossRef]
22. Hung, C.L.; Hu, Y.C.; Li, K.C. Auto-scaling model for cloud computing system. Int. J. Hybrid Inf. Technol. 2012, 5, 181–186.
23. Solino, A.; Batista, T.; Cavalcante, E. Decision-Making Support to Auto-scale Smart City Platform Infrastructures. In Proceedings
of the 2023 18th Iberian Conference on Information Systems and Technologies (CISTI), Aveiro, Portugal, 20–23 June 2023; IEEE:
Piscataway, NJ, USA, 2023; pp. 1–6.
24. Jannapureddy, R.; Vien, Q.T.; Shah, P.; Trestian, R. An auto-scaling framework for analyzing big data in the cloud environment.
Appl. Sci. 2019, 9, 1417. [CrossRef]
25. Alipour, H.; Liu, Y.; Hamou-Lhadj, A. Analyzing auto-scaling issues in cloud environments. CASCON 2014, 14, 75–89.
26. Atadoga, A.; Umoga, U.J.; Lottu, O.A.; Sodiya, E.O. Evaluating the impact of cloud computing on accounting firms: A review of
efficiency, scalability, and data security. Glob. J. Eng. Technol. Adv. 2024, 18, 065–074. [CrossRef]
27. Mullapudi, M.; Munjala, M.B.; Kulkarni, C. Designing a Resilient Parallel Distributed Task Infrastructure for Large-Scale Data
Processing. Available online: https://www.researchgate.net/profile/Mahidhar-Mullapudi/publication/378438900_Designing_
a_Resilient_Parallel_Distributed_Task_Infrastructure_for_Large-scale_Data_Processing/links/65d96d94adc608480ae7fa04/
Designing-a-Resilient-Parallel-Distributed-Task-Infrastructure-for-Large-scale-Data-Processing.pdf (accessed on 1 June 2024).
28. Singh, S.T.; Tiwari, M.; Dhar, A.S. Machine Learning based Workload Prediction for Auto-scaling Cloud Applications. In
Proceedings of the 2022 OPJU International Technology Conference on Emerging Technologies for Sustainable Development
(OTCON), Raigarh, India, 8–10 February 2022; IEEE: Piscataway, NJ, USA, 2023; pp. 1–6.
29. Ashalatha, R.; Agarkhed, J. Evaluation of auto scaling and load balancing features in cloud. Int. J. Comput. Appl. 2015 , 117.
30. Taha, M.B.; Sanjalawe, Y.; Al-Daraiseh, A.; Fraihat, S.; Al-E’mari, S.R. Proactive Auto-Scaling for Service Function Chains in
Cloud Computing based on Deep Learning. IEEE Access 2024, 12, 38575–38593. [CrossRef]
31. Ahamed, Z.; Khemakhem, M.; Eassa, F.; Alsolami, F.; Al-Ghamdi, A.S.A.M. Technical study of deep learning in cloud computing
for accurate workload prediction. Electronics 2023, 12, 650. [CrossRef]
32. Pereira, P.; Araujo, J.; Maciel, P. A hybrid mechanism of horizontal auto-scaling based on thresholds and time series. In
Proceedings of the 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC), Bari, Italy, 6–9 October 2019;
IEEE: Piscataway, NJ, USA, 2019; pp. 2065–2070.
33. Quattrocchi, G.; Incerto, E.; Pinciroli, R.; Trubiani, C.; Baresi, L. Autoscaling Solutions for Cloud Applications under Dynamic
Workloads. IEEE Trans. Serv. Comput. 2024, 17, 804–820. [CrossRef]
34. Hu, Y.; Deng, B.; Peng, F. Autoscaling prediction models for cloud resource provisioning. In Proceedings of the 2016 2nd IEEE
International Conference on Computer and Communications (ICCC), Chengdu, China, 14–17 October 2016; IEEE: Piscataway, NJ,
USA, 2016; pp. 1364–1369.
35. Malla, P.A.; Sheikh, S.; Shahid, M.; Mushtaq, S.U. Energy-efficient sender-initiated threshold-based load balancing (e-STLB) in
cloud computing environment. Concurr. Comput. Pract. Exp. 2024, 36, e7943. [CrossRef]
36. Tournaire, T.; Castel-Taleb, H.; Hyon, E. Efficient Computation of Optimal Thresholds in Cloud Auto-scaling Systems. ACM
Trans. Model. Perform. Eval. Comput. Syst. 2023, 8, 1–31. [CrossRef]
37. Kushchazli, A.; Safargalieva, A.; Kochetkova, I.; Gorshenin, A. Queuing Model with Customer Class Movement across Server
Groups for Analyzing Virtual Machine Migration in Cloud Computing. Mathematics 2024, 12, 468. [CrossRef]
38. Heimerson, A.; Eker, J.; Årzén, K.E. A Proactive Cloud Application Auto-Scaler using Reinforcement Learning. In Proceedings of
the 2022 IEEE/ACM 15th International Conference on Utility and Cloud Computing (UCC), Vancouver, WA, USA, 6–9 December
2022; IEEE: Piscataway, NJ, USA, 2022; pp. 213–220.
39. Khaleq, A.A.; Ra, I. Intelligent autoscaling of microservices in the cloud for real-time applications. IEEE Access 2021, 9, 35464–35476.
[CrossRef]
Sensors 2024, 24, 5551 20 of 21
40. Joyce, J.E.; Sebastian, S. Reinforcement learning based autoscaling for kafka-centric microservices in kubernetes. In Proceedings
of the 2022 IEEE 4th PhD Colloquium on Emerging Domain Innovation and Technology for Society (PhD EDITS), Bangalore,
India, 4–5 November 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1–2.
41. Alnawayseh, S.E.; Muhammad, M.H.G.; Hassan, Z.; Fatima, M.; Aslam, M.S.; Ibrahim, A.; Ateeq, K. Resource Provisioning in
Cloud Computing using Fuzzy Logic Control System: An adaptive approach. In Proceedings of the 2023 International Conference
on Business Analytics for Technology and Security (ICBATS), Dubai, United Arab Emirates, 7–8 March 2023; IEEE: Piscataway,
NJ, USA, 2023; pp. 1–9.
42. Mahapatra, A.; Majhi, S.K.; Mishra, K.; Pradhan, R.; Rao, D.C.; Panda, S.K. An energy-aware task offloading and load balancing
for latency-sensitive IoT applications in the Fog-Cloud continuum. IEEE Access 2024, 12, 14334–14349. [CrossRef]
43. Abdulazeez, D.H.; Askar, S.K. A Novel Offloading Mechanism Leveraging Fuzzy Logic and Deep Reinforcement Learning to
Improve IoT Application Performance in a Three-Layer Architecture within the Fog-Cloud Environment. IEEE Access 2024, 12,
39936–39952. [CrossRef]
44. Luan, S.; Shen, H. Minimize Resource Cost for Containerized Microservices Under SLO via ML-Enhanced Layered Queueing
Network Optimization. In Proceedings of the 2024 14th International Conference on Cloud Computing, Data Science &
Engineering (Confluence), Noida, India, 18–19 January 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 631–637.
45. Jensen, A. AI-Driven DevOps: Enhancing Automation with Machine Learning in AWS. Integr. J. Sci. Technol. 2024 , 1.
46. Chhabra, M.; Arora, G. The Impact of Machine Learning (ML) Driven Algorithm Ranking and Visualization on Task Scheduling
in Cloud Computing. In Proceedings of the 2023 3rd International Conference on Advancement in Electronics & Communication
Engineering (AECE), Ghaziabad, India, 23–24 November 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 969–972.
47. Yang, R.; Ouyang, X.; Chen, Y.; Townend, P.; Xu, J. Intelligent resource scheduling at scale: A machine learning perspective. In
Proceedings of the 2018 IEEE Symposium on Service-Oriented System Engineering (SOSE), Bamberg, Germany, 26–29 March
2018; IEEE: Piscataway, NJ, USA, 2018; pp. 132–141.
48. Spantideas, S.; Giannopoulos, A.; Cambeiro, M.A.; Trullols-Cruces, O.; Atxutegi, E.; Trakadas, P. Intelligent Mission Critical
Services over Beyond 5G Networks: Control Loop and Proactive Overload Detection. In Proceedings of the 2023 International
Conference on Smart Applications, Communications and Networking (SmartNets), Istanbul, Turkey, 25–27 July 2023; IEEE:
Piscataway, NJ, USA, 2023; pp. 1–6.
49. Pfister, B.J.; Scheinert, D.; Geldenhuys, M.K.; Kao, O. Daedalus: Self-Adaptive Horizontal Autoscaling for Resource Efficiency of
Distributed Stream Processing Systems. arXiv 2024, arXiv:2403.02093.
50. Gkontzis, A.F.; Kotsiantis, S.; Feretzakis, G.; Verykios, V.S. Temporal Dynamics of Citizen-Reported Urban Challenges: A
Comprehensive Time Series Analysis. Big Data Cogn. Comput. 2024, 8, 27. [CrossRef]
51. Westergaard, G.; Erden, U.; Mateo, O.A.; Lampo, S.M.; Akinci, T.C.; Topsakal, O. Time Series Forecasting Utilizing Automated
Machine Learning (AutoML): A Comparative Analysis Study on Diverse Datasets. Information 2024, 15, 39. [CrossRef]
52. Muccini, H.; Vaidhyanathan, K. A machine learning-driven approach for proactive decision making in adaptive architectures. In
Proceedings of the 2019 IEEE international conference on software architecture companion (ICSA-C), Hamburg, Germany, 25–26
March 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 242–245.
53. Ahn, Y.W.; Cheng, A.M.; Baek, J.; Jo, M.; Chen, H.H. An auto-scaling mechanism for virtual resources to support mobile,
pervasive, real-time healthcare applications in cloud computing. IEEE Netw. 2013, 27, 62–68. [CrossRef]
54. Dougherty, B.; White, J.; Schmidt, D.C. Model-driven auto-scaling of green cloud computing infrastructure. Future Gener. Comput.
Syst. 2012, 28, 371–378. [CrossRef]
55. Grimson, M.; Almeida, R.; Shi, Q.; Bai, Y.; Angarita, H.; Pacheco, F.S.; Schmitt, R.; Flecker, A.; Gomes, C.P. Scaling Up Pareto
Optimization for Tree Structures with Affine Transformations: Evaluating Hybrid Floating Solar-Hydropower Systems in the
Amazon. In Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 20–28 February 2024;
Volume 38, pp. 22067–22075.
56. Jayanetti, A.; Halgamuge, S.; Buyya, R. Multi-Agent Deep Reinforcement Learning Framework for Renewable Energy-Aware
Workflow Scheduling on Distributed Cloud Data Centers. IEEE Trans. Parallel Distrib. Syst. 2024, 35, 604–615. [CrossRef]
57. Baynum, A.; Hao, W. Exploring the Impact of Cloud Computing and Edge Computing on Resource Consumption for Mobile
Devices with Generative Artificial Intelligence APIs. In Proceedings of the 2024 IEEE 14th Annual Computing and Communication
Workshop and Conference (CCWC), Las Vegas, NV, USA, 8–10 January 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 0603–0609.
58. Shi, T.; Ma, H.; Chen, G.; Hartmann, S. Auto-Scaling Containerized Applications in Geo-Distributed Clouds. IEEE Trans. Serv.
Comput. 2023, 16, 4261–4274. [CrossRef]
59. Meng, C.; Tong, H.; Wu, T.; Pan, M.; Yu, Y. Multi-Level ML Based Burst-Aware Autoscaling for SLO Assurance and Cost Efficiency.
arXiv 2024, arXiv:2402.12962.
60. Choonhaklai, P.; Chantrapornchai, C. Two Autoscaling Approaches on Kubernetes Clusters Against Data Streaming Applications.
In Proceedings of the 2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC),
Jeju, Republic of Korea, 25–28 June 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–6.
61. Vu, D.D.; Tran, M.N.; Kim, Y. Predictive hybrid autoscaling for containerized applications. IEEE Access 2022, 10, 109768–109778.
[CrossRef]
62. Ju, L.; Singh, P.; Toor, S. Proactive autoscaling for edge computing systems with kubernetes. In Proceedings of the IEEE/ACM
International Conference on Utility and Cloud Computing Companion, Leicester, UK, 6–9 December 2021; pp. 1–8.
Sensors 2024, 24, 5551 21 of 21
63. Wang, Y.; Li, Y.; Guo, J.; Fan, Y.; Chen, L.; Zhang, B.; Wang, W.; Zhao, Y.; Zhang, J. On-demand provisioning of computing
resources in computing power network with mixed CPU and GPU. In Proceedings of the 2023 21st International Conference on
Optical Communications and Networks (ICOCN), Qufu, China, 31 July–3 August 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–3.
64. Jeon, J.; Jeong, B.; Jeong, Y.S. Intelligent resource scaling for container based digital twin simulation of consumer electronics.
IEEE Trans. Consum. Electron. 2023, 70, 3131–3140. [CrossRef]
65. Gu, R.; Chen, Y.; Liu, S.; Dai, H.; Chen, G.; Zhang, K.; Che, Y.; Huang, Y. Liquid: Intelligent resource estimation and network-
efficient scheduling for deep learning jobs on distributed GPU clusters. IEEE Trans. Parallel Distrib. Syst. 2021, 33, 2808–2820.
[CrossRef]
66. Hwang, R.H.; Peng, M.C.; Huang, C.W.; Lin, P.C.; Nguyen, V.L. An unsupervised deep learning model for early network traffic
anomaly detection. IEEE Access 2020, 8, 30387–30399. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.