Coal Ai
Coal Ai
To cite this article: C. Bisset, P. V. Z. Venter & R. Coetzer (2023) A systematic literature
review on machine learning applications at coal-fired thermal power plants for improved
energy efficiency, International Journal of Sustainable Energy, 42:1, 845-872, DOI:
10.1080/14786451.2023.2244618
1. Introduction
The global power industry is under pressure to develop more efficient ways to operate and to reduce
the impact of variable operations on market supply and the environment (Sleiti, Kapat, and Vesely
2022). Minimal attention was given to energy management during the early 1970s (Petrecca 2014).
However, energy security was a big concern after the oil crisis, leading to more energy-efficient tech-
nologies and operations (Kaya and Keyes IV 1980). Studies published over the last few decades
focused predominantly on energy management, emphasising the need for more effective ways to
generate and use energy resources globally (Bunse et al. 2011; Narisco and Martins 2020; Schulze
et al. 2016). This need originates from the challenges that emerge by balancing the use of valuable
resources, the human population and providing proper living conditions (Narisco and Martins
2020).
Energy efficiency is the attempt to decrease the recourses used while considering resource
depletion, monetary savings, environmental impacts, and resource security (Lovins 2017; Patterson
1996). Energy efficiency is considered one of the primary contributors to a sustainable society and is
considered a critical aspect in all industries and countries. In South Africa, 2003 data indicate that
the mining sector consumes more than 175 PJ of energy annually. Thus, it is identified as the high-
est consumer of electricity (Oladiran and Meyer 2007). These energy-intensive operations could be
more sustainable, provided the government’s regulations on carbon emissions (Awuah-Offei 2016).
Numerous initiatives have been launched globally over the last few years to highlight the impor-
tance of energy efficiency (Narisco and Martins 2020), including, but not limited to, climate and
energy initiatives by the European environment agency (Climate & Energy Package 2020), and
the energy star programme (Energy Star Overview 2016). Given this high electricity demand, the
Chinese government and various other countries, including India (Mishra 2004) and Zimbabwe
(Murehwa et al. 2012), focus explicitly on thermal power plants for electricity generation. A thermal
plant consists of various boilers that feed into common steam receivers (Murehwa et al. 2012).
These feed sources are mainly coal, oil, nuclear or gas. Most power plants use coal to generate elec-
tricity (Bi et al. 2014; Murehwa et al. 2012).
A coal-fired boiler converts water into steam (Jagarnath 2019). Two boilers exist in coal-fired
thermal plants: the pulverised coal-fired boiler and the fluidised bed combustion boiler. In India,
70% of electricity is produced by coal-fired power plants, and in China, thermal power generation
contributes 73.4% to the total capacity (Bi et al. 2014; Mishra 2004). Coal generally has a low calo-
rific value and a high ash content (Mishra 2004). The problem is that the electricity generated from
these thermal power plants produces greenhouse gases, including nitrogen oxides (NOx ) and car-
bon monoxide (CO) (Bi et al. 2014). Therefore, the impact of power generation globally comprises
high environmental and ecological risks.
From an engineering viewpoint, Haddadin et al. (2023) indicated that a balance could be reached
by maintaining the current structure while increasing the use of renewable energy within power
plants. This prospect will provide a dual benefit for a cleaner environment amid a few changes
to the power system’s components and framework (Haddadin et al. 2023). Despite the need to
shift from coal to other renewable energy sources, the generation and production of electricity in
coal-fired thermal power plants will continue for at least another twelve years (Zima et al. 2022).
Coal-fired power plants generate 38% of electrical power worldwide (Kitto 1996). For this reason,
the focus should be on technological improvements, behavioural changes and the price or policy
tools to induce these changes towards optimal efficiency (Lovins 2017).
One operational objective of the coal-fired boiler is to reach optimal operating efficiency while
considering reliability and cost (Rambalee et al. 2011). Various technologies have been
implemented to optimise and stabilise these thermal plants. These technologies include model pre-
dictive control (MPC) and advanced process control (APC), which have been introduced to ensure
stable control of the boiler and to balance the burner management system (Rambalee et al. 2011).
These technologies contribute to reducing harmful emissions (i.e., NOx and CO).
With the rapid development of these technologies, a large quantity of data has been captured
(Sleiti, Kapat, and Vesely 2022). The volume of data is increasing in all sectors, which results in
more opportunities for improved efficiency (Narisco and Martins 2020). However, the structure
and volume of the datasets are becoming more complex and newer methods are required to deal
with these complexities (Sleiti, Kapat, and Vesely 2022). Most enterprises focus on machine learn-
ing (ML) to explore and operate these volumes of data (Narisco and Martins 2020). Machine learn-
ing is a subset of artificial intelligence (AI) (Shinde and Shah 2018). Ayodele (2010) stated that
analysing machine learning algorithms is classified as computational learning theory. Subsequently,
the learning is not considered consciousness learning but finding patterns in the data instead.
Machine learning algorithms provide valuable insight into the difficulty of learning relationships
in various environments (Ayodele 2010).
Machine learning techniques have been developed since the 1950s, and many advancements
have been made within the last few decades. Prominent machine learning applications include
the following industries: manufacturing (Wuest et al. 2016), retail (Jia, Zhao, and Tong 2013),
finance (Goodell et al. 2021), and the energy sector (Nabavi et al. 2020). There has been an increase
INTERNATIONAL JOURNAL OF SUSTAINABLE ENERGY 847
in academic papers on energy management and various new developments focusing specifically on
machine learning and sustainability. However, Prashar (2017) stated that there is still a lack of
knowledge on implementing these concepts for energy efficiency.
For this reason, this research study aims to investigate machine learning applications at coal-
fired thermal power plants for improved energy efficiency by conducting a systematic literature
review. To achieve the aim, the following objectives have been formulated:
The research paper is outlined as follows. First, a detailed literature study on machine learning
algorithms is provided in Section 2, specifically on supervised, unsupervised, semi-supervised and
reinforcement learning. Section 3 outlines the search protocol developed for the systematic litera-
ture review and discusses the results in Section 4. Conclusions and future recommendations based
on the results obtained in this study are provided in Sections 5 and 6, respectively.
difference between the two variables, which is the error signal, is sent back to the learning system.
The system evaluates the difference and adjusts the parameters accordingly to minimise the error
signal, the difference between the (Yi ) and(Ỹ i )variables. The primary purpose of this learning pro-
cess is to minimise the error signal of the training set, and to obtain the optimal set of learning sys-
tem parameters (Liu 2011).
The objective of training the artificial system is to estimate the parameters based on known data,
whereas testing aims to assess the predictions on unseen data. The data used in the training process
are not used in the testing of the system. For this reason, it is not guaranteed that the system will
perform the same in both instances. In addition, the training process is complex and lacks general-
izability. Generalizability is defined when the system is trained to minimise training error and com-
plexity (Liu 2011).
Supervised learning algorithms include, but are not limited to, linear classifiers such as the sup-
port vector machine, decision tree, and Bayesian networks (Ayodele 2010). Supervised learning can
be used for discriminative pattern classification since the classes can be manipulated. However, with
supervised learning, collecting and labelling unlabelled data is challenging. Various mistakes can
occur, and it is time-consuming (Liu 2011). In the following sections, various machine learning
algorithms are explained.
The hyperplanes in N-dimensional space separate two classes from each other. The primary goal
is to select the hyperplane with the maximum margin. A maximum margin is defined as the maxi-
mum distance between the two classes.
Gandhi (2018) stated that future data points could be classified with more confidence by max-
imising the distance. A hyperplane depends on the number of features. The hyperplane is a line if
the support vector machine has two features. However, if the number of features is three, the hyper-
plane is two-dimensional. A support vector is a data point closest to the hyperplane. Subsequently,
this vector influences the position and orientation of the hyperplane (Lovins 2017).
The average response of the training data of the group is considered the prediction for that group.
The function is applied to all the data, and the cost is calculated accordingly. For classification, the
following cost function is formulated:
K
Classification:G = ( pk (1 − pk )) (2)
k=1
The variable (pk ) is the proportion within each class k, k = 1, 2, . . . , K. A perfect class purity is
defined when the (pk ) value is either 1 or 0, meaning the group contains all inputs from a similar
class. A minimum number of training inputs on each leaf can be defined for a decision tree. This
will ensure that the tree stops splitting after reaching a specific number of splits. Subsequently, the
parameter known as the maximum depth can also be set, which is defined as the length of the long-
est path from the tree’s root to the leaf (Gupta 2017).
Other supervised machine learning techniques include the Bayesian network, Naive Bayes and
the Gaussian Process (GP). The Bayesian network is a probabilistic model that mainly uses Bayesian
inference to compute probabilities. The model consists of nodes and directed edges (Brownlee
2019). These models consider conditionally dependent and independent relationships between ran-
dom data points. Like the rest of the algorithms discussed in this section, these models are also
trained before predicting the probabilities of events.
The Naïve Bayes is defined as a probabilistic model based on the Bayes Theorem. The Naïve
Bayes algorithm is mainly applied to solve classification problems. Bayes Theorem is defined as fol-
lows (Chauhan 2022):
P(B|A)P(A)
P(A|B) = (3)
P(B)
where P(A|B) is the probability of A occurring, given B has already occurred. P(B|A) is the prob-
ability of B occurring given A has already occurred. P(A) is defined as the probability of A occur-
ring, and P(B) is the probability of B occurring.
850 C. BISSET ET AL.
The Gaussian Process (GP) is designed specifically to solve regression and probabilistic classifi-
cation problems. These processes are versatile, so various kernels can be specified, and the predic-
tion interpolates the observations. However, these processes lose efficiency in high-dimensional
spaces (Yi 2022).
Unlike supervised learning, unsupervised learning algorithms do not use labelled data (Ayodele
2010). Unsupervised learning is complex since the hidden patterns and relationships between the
data points are unknown. Unsupervised learning algorithms primarily aim to identify hidden pat-
terns and group similarities (Javatpoint 2022).
The unsupervised learning algorithm mainly addresses three problems (IBM Cloud Education
2020b): clustering, association, and dimensionality reduction. Clustering is defined as a method
that clusters or groups objects in one specific group that has similarities (Javatpoint 2022). An
association rule is a method that identifies the relationships between various variables in a dataset
(Javatpoint 2022). The size of the datasets used influences the performance. However, overfitting is
a common challenge when working with large data sets. Dimension reduction is applied to reduce
the number of features. This algorithm also preserves the integrity of the data while reducing the
variables to a reasonable size (IBM Cloud Education 2020b).
Unsupervised learning algorithms can solve complex problems in a considerable amount of
time. However, the output might not be as accurate as supervised learning since the data is not
labelled, and no result is known in advance (javatpoint 2022). One of the most popular unsuper-
vised learning algorithms is neural networks (NN). This algorithm is discussed in the following
section.
a powerful machine learning tool, specifically for image and speech recognition. A node is similar to
a linear regression model, consisting mainly of data, weights, a threshold value, and an output. A
node’s equation is defined as follows (IBM Cloud Education 2020a):
m
wi xi + bias = w1 x1 + w2 x2 + w3 x3 + . . . + wm xm + bias (4)
i=1
Weights (wi ) are assigned to the node when the input layer is determined. The weight depends
on the importance of the provided variable (IBM Cloud Education 2020a). An activation function is
used to determine the output. An activation function is defined as follows (IBM Cloud Education
2020a):
⎧
⎨ 1 if wi xi + b ≥ 0
Output = f (x) = i
(5)
⎩ 0 if wi xi + b , 0
i
If the predicted output of a specific node is above the threshold value, the node is activated, and data
is sent to the following layer of the entire network. Therefore, the output of one node becomes the
input of another. A feedforward neural network is defined when data is passed from one node to
another in the network (IBM Cloud Education 2020a).
There exist various types of artificial neural networks. Only the most common types are dis-
cussed in this paper. Figure 4 depicts the most basic neural network as demonstrated by Frank
Rosenblatt in 1958 (Rosenblatt 1958):
Feedforward neural networks are also referred to as multi-layer perceptions (MLPs). They are
comprised of sigmoid neurons and not perceptions since most real-world problems are non-linear.
Convolutional neural networks (CNN) typically use principles from linear algebra, specifically
matrix multiplication, to identify specific relationships in the data. Recurrent neural networks
(RNN) consist of various feedback loops. Time-series data are used to make precise predictions
(Rosenblatt 1958).
Johnson (2022) stated that one of the most common ways to train neural networks is to use the
backpropagation method. This method is applied by tuning the neural network weights by consid-
ering the error rate obtained from the previous epoch. As explained in Section 2.1, the primary pur-
pose of training any machine learning algorithm is to reduce the error rate to deliver a more reliable
model for accurate predictions. This approach is mostly followed in training neural networks since
it is fast and easy to programme. There are no parameters to tune except the number of inputs, and
it is considered flexible (Johnson 2022).
A study conducted by Smrekar et al. (2009) employed machine learning by developing an artifi-
cial neural network (ANN) model that predicts steam properties from a coal-fired boiler. The
authors considered three input parameters: steam pressure, temperature, and flow. The models
developed in this study yielded high accuracy on both the training and testing data. Furthermore,
the study demonstrated the usefulness of ANN modelling for coal-fired boilers. The authors stated
that some boiler processes, such as pulverises and combustion, are challenging to develop physical
models since the relations between these variables are not always known. For this reason, ANN
models are applicable since these types of models can consider most things known and unknown
(Smrekar et al. 2009).
Other unsupervised machine learning techniques include the K-Nearest Neighbor and the K-
means clustering. The K-Nearest Neighbor is applied to solve both classification and regression pro-
blems. The K-Nearest Neighbor assumes that similar characteristics exist between data points near
each other. The euclidean distance is used to calculate the distance between the data points (Har-
rison 2018).
The K-means clustering technique divides data points into K groups, where K is defined as the
number of clusters (IBM Cloud Education 2020b). The number of clusters is based on the distance
from each group’s centroid. All points allocated near the centroid are clustered into one category. A
small K-value indicates large groupings with less granularity, and a large K-value means smaller
groupings with more granularity.
Semi-supervised learning is defined when the data is only half labelled (IBM Cloud Education
2020b). Subsequently, unsupervised and semi-supervised learning is mainly applied to prevent
data from being incorrectly labelled.
exploration and exploitation are prevalent definitions. A decision must be made where both are
balanced (Bhatt 2019).
i=1 yi − y
i=1 i
n
Nagelkerke (1991) indicated that the coefficient of determination estimates the percentage of
variance explained in a given dataset. The root-mean-square error (RMSE) and the normalised
RMSE are estimated as illustrated in Equations (7–8)
1 n
RMSE = (yi − ŷi )2 (7)
n i=1
RMSE
NRMSE = × 100% (8)
ymax − ymin
The root-mean-square error (RMSE) estimates the standard deviation of the residuals between
the predicted values and the actual values of an observation (Otto 2019). Furthermore, the normal-
ised RMSE (NRMSE) enables comparing various models with different scales (Otto 2019). The
mean absolute error (MAE) and mean absolute percentage error (MAPE) are illustrated in
Equations (9–10)
1 n
MAE = |yi − ŷi | (9)
n i=1
1 n
yi − ŷi
MAPE = × 100% (10)
n i=1 yi
The mean absolute error (MAE) presents the error obtained between the predicted values and the
real values of the specific dataset (Qi et al. 2020). Furthermore, the mean absolute percentage error
(MAPE) estimates the average magnitude of the error between the predicted value and the actual
values of a specific dataset (De Myttenaere et al. 2016).
Baressi Šegota et al. (2020) stated that the K-fold cross-validation is part of the training pro-
cess of a machine learning model, specifically when the data sets used are small. First, the data is
split into k splits, where (k−1) subsets of the dataset are used for training, and the remaining
subset is used for testing. This process is repeated k times until all the splits have been used
as the testing split precisely once. Subsequently, this allows the entire dataset to be used as a test-
ing set systematically and provides more information on the model’s performance (Baressi
Šegota et al. 2020).
854 C. BISSET ET AL.
important to note that these variables have been selected since they represent energy efficiency and
cleaner production. The Mean Square Error was used as a performance metric to evaluate the
models’ effectiveness and performance (Haddadin et al. 2023).
From these case studies, it is evident that the application of machine learning is extended to
solve a wide range of problems in various areas, including renewable energy predictions, fault
detection and optimal power generation in coal-fired thermal power plants. Furthermore,
machine learning provides the ability to make more accurate decisions in real-time and deliver
sustainable solutions (Ghosh 2017). To further extend the machine learning analysis, an in-
depth investigation is conducted on the type of machine learning algorithms and how these algor-
ithms were applied in coal-fired thermal power plants for improved energy efficiency from 2012 -
2022. First, the research method of the systematic literature review is outlined, followed by a dis-
cussion of the results.
I. Research Question 1: Which machine learning algorithms were applied at coal-fired thermal
power plants for improved energy efficiency?
II. Research Question 2: How were these machine learning algorithms applied at coal-fired ther-
mal power plants for improved energy efficiency?
856 C. BISSET ET AL.
1. Machine learning
2. Energy efficiency
3. Coal-fired thermal power plant
A manual search was conducted, and only published scientific journal articles were selected. The
following databases were used to identify the relevant studies:
1. Science Direct
2. Scopus
3. Web of Science
4. IEEE Explore
5. Emerald Insight
1. Non-English articles
Eligibility All duplicates were removed after screening each study to ensure the studies met the inclusion and exclusion
criteria. No duplicates were identified.
1. Science Direct = 2
2. Scopus = 11
3. Web of Science = 0
4. IEEE Explore = 1
5. Emerald Insight = 0
Total = 14
Inclusion A full-text assessment was done on all 14 studies to investigate machine learning applications at coal-fired
thermal power plants for improved energy efficiency. The results are discussed in Section 4.
One researcher extracted the data, and the other checked the extraction. All three researchers
evaluated the selected studies to ensure that the studies adhered to the quality assessment
process.
4. Discussion of results
In this section, the results are discussed.
(Continued )
INTERNATIONAL JOURNAL OF SUSTAINABLE ENERGY 859
Table 2. Continued.
Type of machine learning model
# Title of the study applied Year Database Reference
integration of AI and IoT, specifically
in the smart city and energy sectors.
5 The Implementation of K-Means K-Means clustering 2022 Scopus Wahyuningtyas and
Clustering in Coal-fired Power Plant Asrol (2022)
Performance.
6 A systematic review of big data in Review paper: Energy computing 2022 Scopus Dhanalakshmi and
energy analytics using energy techniques in the energy-orientated Ayyanathan
computing techniques domain were investigated in this (2022)
study. The authors indicated an
increase in energy analytics due to
the rapid increase in data. In
conclusion, this study highlighted
that the thermal power plant, smart
grid, and power systems datasets
were extensively utilised, and the
artificial neural network (ANN) was
mainly applied.
7 Repowering Industrial Combined Other – This study is classified as a 2022 Scopus Variny and
Heat and Power Units: a gas turbine-based repowering Kšiňanová (2022)
contribution to Cleaner Energy study. Repowering industrial units
Production results in a reduction of greenhouse
gas emissions. This study specifically
focused on fuel and carbon
monoxide reduction. In conclusion,
two conservative repowering
options were identified, and both
indicated feasible economics with a
simple payback period.
8 Improvement of marine steam An Artificial neural network – A multi- 2020 Scopus Baressi Šegota et al.
turbine conventional exergy layered perceptron (MLP) (2020)
analysis by neural network
application
9 Optimisation of a 660 MWE The following machine learning 2020 Scopus Muhammad Ashraf
supercritical power plant methods were developed in this et al. (2020)
performance: A case of industry 4.0 study: 1. Artificial neural network
in the data-driven operational (ANN): 2. Multiple linear regression
management. Part 2: power (MLR): 3. The least-square vector
generation machine (LSSVM)
10 Artificial neural network and its Review paper: This study aimed to 2020 Scopus Babatunde (2020)
applications in the energy sector – summarise artificial intelligence
An overview methods’ significance in the energy
sector. The authors indicated that
artificial neural networks are
effective methods designed to solve
complex problems within
modelling, control, and
optimisation.
11 Data-driven framework for boiler Other – This study aimed to propose 2016 Scopus Nikula, Ruusunen,
performance monitoring a framework for monitoring a steam and Leiviskä
boiler’s real-time operations. The (2016)
boiler’s actual efficiency was
compared and monitored with the
expected efficiency. The
performance of the boiler is
measured using a statistical process
control chart. The framework was
verified using fluidised bed boiler
and corner-fired boiler historical
data. In conclusion, the strongest
correlations between the variables
and boiler efficiency were consistent
in both cases.
(Continued )
860 C. BISSET ET AL.
Table 2. Continued.
Type of machine learning model
# Title of the study applied Year Database Reference
12 Neuro-genetic model for crude oil An artificial neural network model 2016 Scopus Chiroma and
price prediction while considering combined with the genetic Abdulkareem
the impact of uncertainties algorithm (neuro-genetic) (2016)
13 A methodology for energy savings An Artificial neural network (ANN) 2015 Scopus Rossi and Velázquez
verification in the industry with the (2015)
application for a CHP (combined
heat and power) plant
14 Control Strategy for Denitrification A deep reinforcement learning (DLR) 2020 IEEE Fu et al. (2020)
Efficiency of a Coal-Fired Power model Explore
Plant Based on Deep Reinforcement
Learning
authors of these studies referred to other studies that contained the keywords used in this systematic
literature review. Therefore, these studies were included in the search. All reviews and other studies
are explained. The studies that applied machine learning models are further discussed in Table 3.
For this reason, only an overview of the model type is provided in Table 2.
Table 2 provides an overview of each study identified in the search by referring to the following:
Title, type of machine learning model applied in each study, published year, database, and the refer-
ence. According to the search protocol developed, the studies were further analysed by addressing
the two research questions.
4.1.1. Research question 1: which machine learning algorithms were applied in coal-fired
thermal power plants for improved energy efficiency?
Fourteen studies contained all three keywords: Machine learning, energy efficiency and coal-fired
thermal power plant. Two studies were identified in Science Direct, 11 in Scopus and one in
IEEE Xplore. Seven of these studies were published in 2022, four in 2020, two in 2016 and one
in 2015. These results are summarised in Figure 6.
Figure 6 demonstrates that most studies were identified in Scopus, and zero studies were
identified in Web of Science and Emerald Insight. Furthermore, most studies were published
recently, in 2022, and fewer studies were published before 2015. These results indicate the
increased attention machine learning applications have gained over the last decade at coal-
fired thermal power plants for improved energy efficiency. Various machine learning algorithms
were developed and applied in the 14 studies identified. Figure 7 provides an outline of these
types of algorithms.
The types of machine learning algorithms developed in these studies were identified as follows:
. K-Means clustering
. The artificial neural networks (ANN)
. Multiple linear regression (MLR)
. The least-square vector machine (LSSVM)
. Deep reinforcement learning (DRL) and deep learning.
Muhammad Ashraf et al. (2020) applied three different machine learning algorithms. For this
reason, 16 algorithms are identified in Figure 7. These results showed that artificial neural networks
were the most popular algorithm to solve the identified problems. The multi-layered perceptron
(MLP) was mainly developed and applied within these.
Four studies conducted a review: Yu et al. (2022), Wu et al. (2022), Dhanalakshmi and Ayya-
nathan (2022), and Babatunde (2020). These reviews did not particularly apply or develop machine
learning models but provided an overview of these algorithms. The ‘other’ studies identified in
Figure 7 did not apply or develop machine learning models: Zima et al. (2022), Variny and Kšiňa-
nová (2022), and (Nikula, Ruusunen, and Leiviskä 2016). However, the authors of these studies
referred to other studies that contained the keywords used in this systematic literature review.
Therefore, these studies were included in the search. From the studies identified, it is also important
to note that no review, like the one conducted in this study, was identified. Research question two is
addressed in the next section.
4.1.2. Research question 2: how were these machine learning algorithms applied in coal-
fired thermal power plants for energy efficiency?
A further review of the seven studies provided insight into how these algorithms were applied in
industry to address specific problems by focusing on the datasets, input parameters, training
method and results. Furthermore, to provide a more comprehensive overview of the current litera-
ture, a detailed overview is provided of each study’s contribution and potential future recommen-
dations. A description of each is provided in Table 3.
Table 3 provides an overview of how the different machine learning algorithms were applied in
each study, followed by an overview of each study’s contribution and future recommendations. The
seven studies developed and implemented machine learning models in various forms. As discussed
in Section 2, a specific process is followed in training supervised, unsupervised and reinforcement
learning algorithms.
Table 3 shows that four studies developed unsupervised learning algorithms, including artificial
neural networks and the K-means clustering algorithm. One study applied one supervised algor-
ithm (the least square vector machine), and two focused on deep learning or deep reinforcement
learning.
862
Table 3. Describes how machine learning models were applied in each study.
# Title of the study An overview of each machine learning application Contribution(s) and Future Research
1 Digital twin in energy industry: Proposed a robust This study did not specifically apply to machine learning. The contributions of this study include the following:
C. BISSET ET AL.
digital twin for power plants and other complex However, deep learning is used to assist the authors in
capital-intensive large engineering systems developing a database model. To achieve high reliability while 1. A comprehensive overview of digital twins subject to the
considering cost, a digital twin (DT) architecture is required for application, frameworks, and architectures for power plants.
future power plants. The paper consists of three sections: 2. Research related to digital twins and how these can
contribute towards energy savings applications.
. The authors provide an overview of digital twins and research 3. The authors also proposed a robust digital twin for power
regarding energy applications. plants.
. The design requirements for the digital twin are established
by considering physics-based formulation, sensor data Future research includes the following:
analysis, statistical analysis, and the system Genome.
. Valuable future recommendations are provided. 1. Any digital twin requires actual data and a description
representing a holistic view of the system, focusing explicitly
The authors highlighted that a data-driven approach alone is not on each part and connection.
sufficient for achieving improved reliability. They recommended 2. The authors suggested that data-driven algorithms capable of
that a low-order physics model should operate with the updated predicting dynamic system behaviour are still lacking in the
system parameters to enhance the results of the data-driven literature.
approach. 3. A data-driven model alone is not sufficient. For this reason,
The database model is designed to use ADL to detect and identify the authors proposed that a physics-based model must be
the differences between the dynamic systems model (DSM) and integrated with system parameters to improve the
the digital twin (DT). The sensor failures are identified using deep interpretation of the results from the data-driven processes.
learning and are sent back to the sensor database. The sensor 4. The authors noted that research related to cyber security with
database is updated accordingly. digital twins had not been reported in the literature, which
In conclusion, this study demonstrated the discrepancies between can be a future research study.
the dynamic system models (DSM), anomaly detection, and deep
learning (ADL). Using operational data, the vector autoregressive
model was applied for anomaly detection in utility gas turbines.
2 The Implementation of K-Means Clustering in Coal- The k-Means Clustering algorithm is developed for coal-fired The contributions of this study include the following:
fired Power Plant Performance power plant performance. Two attributes are required for
power plant performance: Equivalent Availability Factor (EAF) 1. The authors demonstrated the ability of machine learning to
and Equivalent Outage Rate (EFOR) The data set used in this analyse current power plant conditions successfully.
study is from 2020, and the data performance indices in these
investigations are the abovementioned attributes. The authors Future research includes the following:
obtained the data set from the Generation Availability
Information System (GAIS) database. The authors pre-processed 1. A real dataset is required to control current conditions to
the data to ensure data integrity. A number of 576 numerical improve the decision-making process.
datasets of the two attributes were pre-processed. The 2. Furthermore, the authors illustrated that a decision-support
relationship between the EAF and the EFOR was investigated to system may facilitate the decision-maker in making faster and
determine the characteristics of the power plant. The analysis more reliability decisions.
(Continued)
concluded a negative correlation between the two attributes,
and the authors did a further investigation. The features of the
input data determined the number of clusters, and the Elbow
method was applied to analyse the total number of clusters
used explicitly for power plant clustering. After an in-depth
investigation, 4 clusters were selected for optimal power plant
performance. This number of clusters produced each cluster’s
highest EAF value and the lowest EFOR value In conclusion, the
higher the EAF value and the lower the EFOR value, the better
the plant performs. For this reason, the authors evaluated these
values and made conclusions accordingly. Applying this
machine learning technique, the results indicated which plant
needs to be prioritised for maintenance.
3 Improvement of marine steam turbine conventional Neural networks were applied to improve the steam turbine’s The contributions of this study include the following:
exergy analysis by neural network application. conventional exergy. Subsequently, a multi-layered perceptron
(MLP) model is developed in Python programming language 1. Developing a neural network to improve the marine steam
using the Scikit-Learn library. The data set comprised 125 points turbine conventional exergy analysis.
for the ambient temperature, pressure, steam mass flow rate, 2. This study demonstrated the usefulness of Artificial
and exergy destruction and efficiency values for the high- intelligence and that these algorithms could be applied to
pressure cylinder, low-pressure cylinder, and the entire turbine. calculate the exergy destruction and exergy efficiency of a
The input variables were the steam temperature, pressure, and marine steam turbine.
863
performance. The K-fold cross-validation method was applied
864
Table 3. Continued.
# Title of the study An overview of each machine learning application Contribution(s) and Future Research
with ten folds, and the data set was randomly split into training
and testing data.
C. BISSET ET AL.
4 Optimisation of a 660 MWE supercritical power plant This study considers 24 thermo-electric parameters of a specific The contributions of this study include the following:
performance: A case of industry 4.0 in the data- power plant to model the generator power under different
driven operational management. Part 2: power power generation scenarios. These operating parameters were 1. The ability of a least-square vector machine (LSSVM) model is
generation. obtained from the power plant’s turbine, boiler, and generator demonstrated in this study. This model outperformed the
sides. The data set consisted of 1900 data points. The following multiple linear regression (MLR) models and the artificial
machine learning methods were developed in this study: neural network (ANN). This model’s reliable performance in
modelling generator power has been highlighted in this
1. Artificial neural network (ANN): The feedforward study.
backpropagation algorithm is developed in this study. The 2. In this study, the power plant’s operation control has been
authors used the gradient descent with momentum as a improved by employing data-driven optimisation strategies.
training function. The tangent hyperbolic and purelin were
employed as transfer functions in both the hidden and output
layer of the ANN model.
2. Multiple linear regression (MLR): The least square vector
machine (LSSVM) was used to develop this model. First, the
difference (referred to as residuals) between the actual and
observed values was calculated. Second, the sum of the
square of residuals is minimised to obtain the best fit i.e. least
squares minimisation was employed.
3. The least-square vector machine (LSSVM): This model was
trained based on the structure risk minimisation (SRM)
principle. The relationship between the input and output
variables is determined by using a Gaussian kernel function.
All variables were standardised for model training. The
authors used the Bayesian optimiser and the expected
improvement per second in conjunction with the acquisition
function to optimise the model under 30 epochs.
After training the MLR, ANN and LSSVM models, the models were
validated using new (unseen) data. The unseen data were defined
as generator power data and were not included in the training
dataset. The coefficient of determination, mean absolute
percentage error (MAE), root-mean-square error (RMSE),
normalised RMSE (NRMSE), and the mean absolute percentage
error (MAPE) were estimated to analyse the model’s robustness
and effectiveness. The results demonstrated that the LSSVM
outperformed the other two models in predicting the plant’s
(Continued)
generator power. This model was further used to predict the
generator power of the plant while considering numerous
operating strategies.
5 Neuro-genetic model for crude oil price prediction The authors proposed a neural network model combined with the The contributions of this study include the following:
while considering the impact of uncertainties genetic algorithm (neuro-genetic) to predict the crude oil price
while considering uncertainty. The parameters optimised are 1. The authors proposed an artificial intelligence approach to
the neural network’s bias, weights, and hidden layer nodes. The predict the crude oil price while considering uncertainties. The
values for the input neurons, the output neurons, the activation neuro-genetic model developed outperformed the neural
function, the weights, and the hidden layers were defined prior network (NN) and the support vector machine (SVM).
to training the model. The model was trained by splitting the 2. Considering uncertainty in decision-making provides a more
dataset into 80% training and 20% testing. The optimal realistic and practical application of crude oil prediction.
parameter values were obtained through trial and error, and the
sensitivity of the neuro-genetic model was analysed. The Future research includes the following:
models were trained on two datasets obtained from the Energy
Information Administration of the US Department of Energy. 1. Advancing and updating the current model to consider the
The first dataset used in this study was obtained from May 1987 prediction of crude oil in different prediction horizons.
to July 1991, excluding uncertainties. The second data set, 2. The framework may be modified to be applied to optimise
including uncertainties, was obtained from August 1990 to nuclear data libraries and nitride fuel.
February 1991. The proposed method was implemented using
Matlab 2012b.
The performance of the neuro-genetic model was investigated
865
Table 3. Continued.
866
# Title of the study An overview of each machine learning application Contribution(s) and Future Research
7 Control Strategy for Denitrification Efficiency of a Coal- In this study, a 1000MW unit is used as a case study. A deep The contributions of this study include the following:
Fired Power Plant Based on Deep Reinforcement reinforcement learning (DLR) model is developed with a long
C. BISSET ET AL.
Learning short-term memory model, the Asynchronous Advantage Actor- 1. One of the primary contributions is that the approach
Critic algorithm (A3C). The data used in this study was obtained proposed in this study demonstrates the ability to be feasible
from the plant’s Distributed Control System (DCS). The authors and universally applicable in industry.
used 10000 sets of data from a 1000MW coal-fired boiler.
Variables, including, but not limited to, spray ammonia mass Future research includes the following:
flow, Inlet mass concentration, boiler load, inlet mass and inlet
flue gas temperature, were identified. The data was pre- 1. The authors indicated that the following should be further
processed, followed by using this data as input for the Long investigated: the control method of the combustion process,
short-term memory neural network (LSTM). Subsequently, this the system cost of coal-fired power plants and the
LSTM model was used for the DRL’s environment. The DLR denitrification process. These elements should be investigated
method used the state value output by the environment as an to improve the stability and reliability of control strategies.
input to produce a new updated action value. By conducting
several iterations, a new control strategy of SCR was obtained.
The Back Propagation algorithm was used to train the LSTM. For
the training, six main variables were defined as input variables;
the model consisted of 3 LSTM layers and 64 nodes per layer.
The model was optimised using the Adam optimiser algorithm,
and the data were split into 20% testing and 80% training data.
Two performance metrics were used: the root mean square
error (RMSE) and the R-squared. The accuracy of the efficiency
calculated was 91.7%. In conclusion, it was shown that the
approach is feasible and universally applicable.
INTERNATIONAL JOURNAL OF SUSTAINABLE ENERGY 867
There are similarities between the studies that applied unsupervised learning algorithms.
Muhammad Ashraf et al. (2020), Baressi Šegota et al. (2020) and Rossi and Velázquez (2015)
used the backpropagation algorithm to train the artificial neural networks. Typical performance
metrics used to determine the performance of the ANN included the following: The coefficient
of determination, the root mean square error (RMSE), and the mean absolute error (MAE).
Most other algorithms followed a similar approach, as explained in Section 2.
Multiple researchers emphasised the value of artificial neural networks. Artificial neural network
models are flexible and can be applied to problems even if much of the information is unknown
(Smrekar et al. 2009). These algorithms are designed to solve complex problems related explicitly
to modelling, control and optimisation (Babatunde 2020). Furthermore, these algorithms can
model complex non-linear relationships between inputs and outputs, which is much more challen-
ging than traditional models (Chiroma and Abdulkareem 2016).
Wu et al. (2022) highlighted that the articles identified in their review demonstrated the impor-
tance and the applicability of various machine learning algorithms, including decision trees, sup-
port vector machines (SVM), and artificial neural networks (ANN), specifically feedforward deep
neural networks and recurrent neural networks. Another systematic review conducted by Dhana-
lakshmi and Ayyanathan (2022) revealed that the artificial neural network (ANN) was mostly
applied in the identified studies.
The type of machine learning model employed depends on the volume of data and the pro-
blem identified. However, as these different studies are evaluated, it is evident that there is a
common thread across all these research studies identified from conducting the review. Specifi-
cally, most researchers suggested that artificial neural networks be used when working with
large datasets and complex problems, especially those related to industrial processes and
power plants.
Evidently, each study contributed to academia and industry by incorporating realistic and prac-
tical approaches. The studies by Sleiti, Kapat, and Vesely (2022) and Wahyuningtyas and Asrol
(2022) indicated the importance of a decision support system for optimal decision-making.
These studies illustrated that a data-driven approach alone is insufficient, and a tool is required
to integrate it with system parameters for optimal decision-making. Furthermore, the importance
of actual data is highlighted in Table 3 by Sleiti, Kapat, and Vesely (2022) and Wahyuningtyas and
Asrol (2022). Real data will improve decision-making and provide an accurate view of the system
and its components, specifically when applying machine learning.
From the review, three studies mentioned machine learning: Zima et al. (2022), Variny and Kši-
ňanová (2022), and Nikula, Ruusunen, and Leiviskä (2016), four conducted a review: Yu et al.
(2022), Wu et al. (2022), Dhanalakshmi and Ayyanathan (2022), and Babatunde (2020), and
seven applied machine learning at a coal-fired thermal power plant for energy efficiency (Baressi
Šegota et al. 2020; Chiroma and Abdulkareem 2016; Fu et al. 2020; Rossi and Velázquez 2015; Sleiti,
Kapat, and Vesely 2022; Wahyuningtyas and Asrol 2022). The seven studies applied different
machine learning algorithms, including K-means clustering, multiple linear regression (MLR),
the least-square vector machine (LSSVM), deep reinforcement learning (RL) and artificial neural
networks (ANN). Typical performance metrics used to determine the performance of the ANN
included the following: The coefficient of determination, the root mean square error (RMSE),
and the mean absolute error (MAE).
Various researchers emphasised the value of artificial neural networks. Artificial neural network
models are flexible and can be applied to problems even if all the information is unknown (Smrekar
et al. 2009). The models can solve complex problems related explicitly to process modelling, control
and optimisation. Furthermore, these algorithms can model complex non-linear relationships
between inputs and outputs, which is much more challenging than traditional models (Chiroma
and Abdulkareem 2016). The studies by Sleiti, Kapat, and Vesely (2022) and Wahyuningtyas and
Asrol (2022) indicated the importance of a decision support system for optimal decision-making.
These studies illustrate that a data-driven approach alone is insufficient, and a tool is required to
integrate with system parameters for optimal decision-making. The results obtained from this
study underline the significance of some applications of machine learning algorithms. For instance,
the artificial neural network is famous for detecting non-linear patterns and predicting the behav-
iour of most industrial processes and power plants (Rossi and Velázquez 2015). The potential future
recommendations are identified and explained in Section 6.
Another limitation identified in the literature is applying the long short-term memory network
(LSTM) technique. The LSTM has been developed and applied in various industries because it can
process time series data (Pan et al. 2021). Furthermore, the LSTM can remember previous infor-
mation and use this memory to calculate the current output (Pan et al. 2021).
For future research, it is recommended that this modelling technique is applied for optimal oper-
ational control of various units at coal-fired thermal power plants. Applying the LSTM may also
contribute towards improving the energy efficiency of coal-fired boilers.
Furthermore, neural network predictive control (NNPC) is a model integrated with predictive
control (MPC). One of the primary contributors of this model is its ability to be employed in
real-time for simple controls and disturbance reduction (Megahed, Abdelkader, and Zakaria
2019). No such model has been identified in this review. For this reason, it may be valuable to inves-
tigate the application of such a model to enhance a power plant’s automation and operational
decision-making. It is recommended that the readers refer to Table 3 for a more comprehensive
overview of each study’s potential future research opportunities. These opportunities highlight
the limitations of current literature and may be investigated for future research.
Disclosure statement
No potential conflict of interest was reported by the author(s).
ORCID
C. Bisset http://orcid.org/0000-0001-5296-5945
P. V. Z. Venter http://orcid.org/0000-0003-4963-6248
R. Coetzer http://orcid.org/0000-0002-7505-3911
References
Albliwi, S., J. Antony, S. A. H. Lim, and T. van der Wiele. 2014. “Critical Failure Factors of Lean Six Sigma: A
Systematic Literature Review.” International Journal of Quality & Reliability Management 31 (9): 1012–1030.
Alvarez Quiñones, L. I., C. A. Lozano-Moncada, and D. A. Bravo Montenegro. 2023. “Machine Learning for
Predictive Maintenance Scheduling of Distribution Transformers.” Journal of Quality in Maintenance
Engineering 29 (1): 188–202. https://doi.org/10.1108/JQME-06-2021-0052
Awuah-Offei, K. 2016. “Energy Efficiency in Mining: A Review with Emphasis on the Role of Operators in Loading
and Hauling Operations.” Journal of Cleaner Production 117: 89–97. https://doi.org/10.1016/j.jclepro.2016.01.035
Ayodele, T. O. 2010. Types of Machine Learning Algorithms. In www.intechopen.com. IntechOpen. https://www.
intechopen.com/chapters/10694.
Babatunde, D. E. 2020. Artificial Neural Network and Its Applications in the Energy Sector: An Overview. 670216917.
Baressi Šegota, S., I. Lorencin, N. Anđelić, V. Mrzljak, and Z. Car. 2020. “Improvement of Marine Steam Turbine
Conventional Exergy Analysis by Neural Network Application.” Journal of Marine Science and Engineering 8
(11): 884. https://doi.org/10.3390/jmse8110884
Bhatt, S. 2019. Reinforcement Learning 101. [online] Medium. https://towardsdatascience.com/reinforcement-
learning-101-e24b50e1d292.
Bi, G. B., W. Song, P. Zhou, and L. Liang. 2014. “Does Environmental Regulation Affect Energy Efficiency in China’s
Thermal Power Generation? Empirical Evidence from a Slacks-Based DEA Model.” Energy Policy 66: 537–546.
https://doi.org/10.1016/j.enpol.2013.10.056
Boser, B. E., I. M. Guyon, and V. N. Vapnik. 1992. “A Training Algorithm for Optimal Margin Classifiers.” In
Proceedings of the Fifth Annual Workshop on Computational Learning Theory, 144–152, July.
Brownlee, J. 2019. A Gentle Introduction to Bayesian Belief Networks. [online] Machine Learning Mastery. https://
machinelearningmastery.com/introduction-to-bayesian-belief-networks/.
Bunse, K., M. Vodicka, P. Schönsleben, M. Brülhart, and F. O. Ernst. 2011. “Integrating Energy Efficiency
Performance in Production Management–gap Analysis Between Industrial Needs and Scientific Literature.”
Journal of Cleaner Production 19 (6–7): 667–679. https://doi.org/10.1016/j.jclepro.2010.11.011
Chauhan, N. 2022. Naïve Bayes Algorithm: Everything You Need to Know. [online] KDnuggets. https://www.
kdnuggets.com/2020/06/naive-bayes-algorithm-everything.html.
870 C. BISSET ET AL.
Chiroma, H., and S. Abdulkareem. 2016. “Neuro-genetic Model for Crude oil Price Prediction While Considering the
Impact of Uncertainties.” International Journal of Oil, Gas and Coal Technology 12 (3): 302–333.
Climate & energy package. 2020. Climate.ec.europa.eu. https://climate.ec.europa.eu/eu-action/climate-strategies-
targets/2020-climate-energy-package_en.
De Myttenaere, A., B. Golden, B. Le Grand, and F. Rossi. 2016. “Mean Absolute Percentage Error for Regression
Models.” Neurocomputing 192: 38–48. https://doi.org/10.1016/j.neucom.2015.12.114
Dhanalakshmi, J., and N. Ayyanathan. 2022. “A Systematic Review of Big Data in Energy Analytics Using Energy
Computing Techniques.” Concurrency and Computation: Practice and Experience 34 (4): e6647. https://doi.org/
10.1002/cpe.6647
Energy Star Overview. 2016. Energystar.gov. https://www.energystar.gov/about.
Fu, J., H. Xiao, H. Wang, and J. Zhou. 2020. “Control Strategy for Denitrification Efficiency of Coal-Fired Power Plant
Based on Deep Reinforcement Learning.” IEEE Access 8: 65127–65136. https://doi.org/10.1109/ACCESS.2020.
2985233
Gandhi, R. 2018. Support Vector Machine — Introduction to Machine Learning Algorithms. Towards Data Science;
Towards Data Science, June 7. https://towardsdatascience.com/support-vector-machine-introduction-to-
machine-learning-algorithms-934a444fca47.
Ghosh, P. 2017. The Value of Machine Learning: Benefits and Best Practices - Dataversity. Los Angeles, CA:
Dataversity, June 8. https://www.dataversity.net/value-machine-learning-benefits-best-practices/
Goodell, J. W., S. Kumar, W. M. Lim, and D. Pattnaik. 2021. “Artificial Intelligence and Machine Learning in Finance:
Identifying Foundations, Themes, and Research Clusters from Bibliometric Analysis.” Journal of Behavioral and
Experimental Finance 32: 100577. https://doi.org/10.1016/j.jbef.2021.100577.
Gupta, P. 2017. Decision Trees in Machine Learning. [online] Towards Data Science. Available at: https://
towardsdatascience.com/decision-trees-in-machine-learning-641b9c4e8052.
Haddad, A., and O. Mohamed. 2022. “Qualitative and Quantitative Comparison of Three Modeling Approaches for a
Supercritical Once-Through Generation Unit.” International Journal of Energy Research 46 (14): 20780–20800.
https://doi.org/10.1002/er.8330
Haddadin, M., O. Mohamed, W. Abu Elhaija, and M. Matar. 2023. “Performance Prediction of a Clean Coal Power
Plant via Machine Learning and Deep Learning Techniques.” Energy & Environment. 0958305X231160590.
Harrison, O. 2018. Machine Learning Basics with the K-Nearest Neighbors Algorithm. [online] Medium. https://
towardsdatascience.com/machine-learning-basics-with-the-k-nearest-neighbors-algorithm-6a6e71d01761.
IBM Cloud Education. 2020a. What are Neural Networks? [online] www.ibm.com. Accessed 05 August 05, 2023.
Available at: https://www.ibm.com/topics/neural-networks.
IBM Cloud Education. 2020b. What is Unsupervised Learning? Www.ibm.com; IBM Cloud Education, September 21.
https://www.ibm.com/cloud/learn/unsupervised-learning#:~:text=Unsupervised%20learning%2C%20also%
20known%20as.
Jagarnath Mahato. 2019. Jagarnath Mahato. [online] Coal Handling Plants. https://www.coalhandlingplants.com/
boiler-in-thermal-power-plant/.
javatpoint. 2022. “Supervised Machine Learning - Javatpoint.” Www.javatpoint.com, 2022, www.javatpoint.com/
supervised-machine-learning.
Jia, L., Q. Zhao, and L. Tong. 2013. Retail pricing for stochastic demand with unknown parameters: An online
machine learning approach. In 2013 51st Annual Allerton Conference on Communication, Control, and
Computing (Allerton) (pp. 1353-1358). IEEE. 2013, October.
Johnson, D. 2022. Back Propagation Neural Network: Explained With Simple Example. [online] www.guru99.com.
https://www.guru99.com/backpropogation-neural-network.html.
Kaya, A., and M. A. Keyes IV. 1980. “Energy Management Technology in Pulp, Paper and Allied Industries.” IFAC
Proceedings Volumes 13 (12): 609–622. https://doi.org/10.1016/S1474-6670(17)69499-0
Khalid, S., H. Hwang, and H. S. Kim. 2021. “Real-world Data-Driven Machine-Learning-Based Optimal Sensor
Selection Approach for Equipment Fault Detection in a Thermal Power Plant.” Mathematics 9 (21): 2814.
https://doi.org/10.3390/math9212814
Khalid, S., W. Lim, H. S. Kim, Y. T. Oh, B. D. Youn, H. S. Kim, and Y. C. Bae. 2020. “Intelligent Steam Power Plant
Boiler Waterwall Tube Leakage Detection via Machine Learning-Based Optimal Sensor Selection.” Sensors 20 (21):
6356. https://doi.org/10.3390/s20216356
Khan, R. H., and J. Y. Khan. 2013. “A Comprehensive Review of the Application Characteristics and Traffic
Requirements of a Smart Grid Communications Network.” Computer Networks 57 (3): 825–845. https://doi.
org/10.1016/j.comnet.2012.11.002
Kitchenham, B., O. P. Brereton, D. Budgen, M. Turner, J. Bailey, and S. Linkman. 2009. “Systematic Literature
Reviews in Software Engineering–a Systematic Literature Review.” Information and Software Technology 51 (1):
7–15. https://doi.org/10.1016/j.infsof.2008.09.009
Kitto, J. B. 1996. “Developments in Pulverized Coal-Fired Boiler Technology.” In Missouri Valley Electric Association
Engineering Conference, 1–11, April.
INTERNATIONAL JOURNAL OF SUSTAINABLE ENERGY 871
Kumar, A., and G. Garg. 2020. “Systematic Literature Review on Context-Based Sentiment Analysis in Social
Multimedia.” Multimedia Tools and Applications 79 (21–22): 15349–15380. https://doi.org/10.1007/s11042-019-
7346-5
Liu, B. 2011. “Supervised Learning.” In Web Data Mining, 63–132. Berlin, Heidelberg: Springer.
Lovins, A. 2017. “Energy Efficiency.” Energy Economics 1: 234–258.
Makala, B., and T. Bakovic. 2020. Artificial Intelligence in the Power Sector. EMCompass No. 81. Washington,
DC: International Finance Corporation. https://openknowledge.worldbank.org/handle/10986/34303.
Mangaroo-Pillay, M., and R. Coetzee. 2022. “Lean Frameworks: A Systematic Literature Review (SLR) Investigating
Methods and Design Elements.” Journal of Industrial Engineering and Management 15 (2): 202–214. https://doi.
org/10.3926/jiem.3677
Megahed, T. F., S. M. Abdelkader, and A. Zakaria. 2019. “Energy Management in Zero-Energy Building Using Neural
Network Predictive Control.” IEEE Internet of Things Journal 6 (3): 5336–5344. https://doi.org/10.1109/JIOT.
2019.2900558
Mishra, U. C. 2004. “Environmental Impact of Coal Industry and Thermal Power Plants in India.” Journal of
Environmental Radioactivity 72 (1-2): 35–40. https://doi.org/10.1016/S0265-931X(03)00183-8
Muhammad Ashraf, W., G. Moeen Uddin, A. Hassan Kamal, M. Haider Khan, A. A. Khan, H. Afroze Ahmad, F.
Ahmed, et al. 2020. “Optimisation of a 660 MWe Supercritical Power Plant Performance—a Case of Industry
4.0 in the Data-Driven Operational Management. Part 2. Power Generation.” Energies 13 (21): 5619. https://
doi.org/10.3390/en13215619
Murehwa, G., D. Zimwara, W. Tumbudzuku, and S. Mhlanga. 2012. “Energy Efficiency Improvement in Thermal
Power Plants.” International Journal of Innovative Technology and Exploring Engineering (IJITEE) 2 (1): 20–25.
Nabavi, S. A., A. Aslani, M. A. Zaidan, M. Zandi, S. Mohammadi, and N. Hossein Motlagh. 2020. “Machine Learning
Modeling for Energy Consumption of Residential and Commercial Sectors.” Energies 13 (19): 5171. https://doi.
org/10.3390/en13195171
Nagelkerke, N. J. 1991. “A Note on a General Definition of the Coefficient of Determination.” Biometrika 78 (3): 691–
692. https://doi.org/10.1093/biomet/78.3.691
Narciso, D. A., and F. G. Martins. 2020. “Application of Machine Learning Tools for Energy Efficiency in Industry: A
Review.” Energy Reports 6: 1181–1199.
Nikula, R. P., M. Ruusunen, and K. Leiviskä. 2016. “Data-driven Framework for Boiler Performance Monitoring.”
Applied Energy 183: 1374–1388. https://doi.org/10.1016/j.apenergy.2016.09.072
Oladiran, M. T., and J. P. Meyer. 2007. “Energy and Exergy Analyses of Energy Consumptions in the Industrial Sector
in South Africa.” Applied Energy 84 (10): 1056–1067. https://doi.org/10.1016/j.apenergy.2007.02.004
Ongsulee, P. 2017. “Artificial Intelligence, Machine Learning and Deep Learning.” In 2017 15th International
Conference on ICT and Knowledge Engineering (ICT&KE), 1–6. IEEE. November.
Otto, S. 2019. How to normalise the RMSE. Www.marinedatascience.co, January 7. https://www.marinedatascience.
co/blog/2019/01/07/normalizing-the-rmse/.
Pan, H., T. Su, X. Huang, and Z. Wang. 2021. “LSTM-based Soft Sensor Design for Oxygen Content of Flue gas in
Coal-Fired Power Plant.” Transactions of the Institute of Measurement and Control 43 (1): 78–87. https://doi.org/
10.1177/0142331220932390
Patterson, M. G. 1996. “What is Energy Efficiency?: Concepts, Indicators and Methodological Issues.” Energy Policy
24 (5): 377–390. https://doi.org/10.1016/0301-4215(96)00017-1
Pejić-Bach, M., and N. Cerpa. 2019. “Planning, Conducting and Communicating Systematic Literature Reviews.”
Journal of Theoretical and Applied Electronic Commerce Research 14 (3): 1–4. https://doi.org/10.4067/S0718-
18762019000300101
Petrecca, G. 2014. “Energy Conversion and Management.” Cham, Switzerland, 101–103.
Prashar, A. 2017. “Energy Efficiency Maturity (EEM) Assessment Framework for Energy-Intensive SMEs: Proposal
and Evaluation.” Journal of Cleaner Production 166: 1187–1201. https://doi.org/10.1016/j.jclepro.2017.08.116
Qi, J., J. Du, S. M. Siniscalchi, X. Ma, and C. H. Lee. 2020. “On Mean Absolute Error for Deep Neural Network Based
Vector-to-Vector Regression.” IEEE Signal Processing Letters 27: 1485–1489. https://doi.org/10.1109/LSP.2020.
3016837
Rambalee, P., G. Gous, P. G. R. De Villiers, N. Mcculloch, and G. Humphries. 2011. Power Control: Advanced Process
Control. Control Engineering, June 20. https://www.controleng.com/articles/power-control-advanced-process-
control/
Rosenblatt, F. 1958. “The Perceptron: A Probabilistic Model for Information Storage and Organisation in the Brain.”
Psychological Review 65 (6): 386. https://doi.org/10.1037/h0042519
Rossi, F., and D. Velázquez. 2015. “A Methodology for Energy Savings Verification in Industry with Application for a
CHP (Combined Heat and Power) Plant.” Energy 89: 528–544. https://doi.org/10.1016/j.energy.2015.06.016
Schulze, M., H. Nehler, M. Ottosson, and P. Thollander. 2016. “Energy Management in Industry–a Systematic Review
of Previous Findings and an Integrative Conceptual Framework.” Journal of Cleaner Production 112: 3692–3708.
https://doi.org/10.1016/j.jclepro.2015.06.060
872 C. BISSET ET AL.
Shinde, P. P., and S. Shah. 2018. “A Review of Machine Learning and Deep Learning Applications.” In 2018 Fourth
international conference on computing communication control and automation (ICCUBEA) (pp. 1–6). IEEE,
August.
Sleiti, A. K., J. S. Kapat, and L. Vesely. 2022. “Digital Twin in Energy Industry: Proposed Robust Digital Twin for
Power Plant and Other Complex Capital-Intensive Large Engineering Systems.” Energy Reports 8: 3704–3726.
https://doi.org/10.1016/j.egyr.2022.02.305
Smrekar, J., M. Assadi, M. Fast, I. Kuštrin, S. De, 2009. “Development of Artificial Neural Network Model for a Coal-
Fired Boiler Using Real Plant Data.” Energy 34 (2): 144–152.
Sogabe, T., D. B. Malla, S. Takayama, S. Shin, K. Sakamoto, K. Yamaguchi, T. P. Singh, M. Sogabe, T. Hirata, and Y.
Okada. 2018. Smart grid optimisation by deep reinforcement learning over discrete and continuous action space.
In 2018 IEEE 7th World Conference on Photovoltaic Energy Conversion (WCPEC)(A Joint Conference of 45th IEEE
PVSC, 28th PVSEC & 34th EU PVSEC) (pp. 3794-3796). IEEE, June.
Song, M. L., L. Yang, J. Wu, and W. D. Lv. 2013. “Energy Saving in China: Analysis on the Energy Efficiency via
Bootstrap-DEA Approach.” Energy Policy 57: 1–6. https://doi.org/10.1016/j.enpol.2012.11.001
Variny, M., and M. Kšiňanová. 2022. “Repowering Industrial Combined Heat and Power Units: A Contribution to
Cleaner Energy Production.” Polish Journal of Environmental Studies 31 (3): 2861–2879. https://doi.org/10.15244/
pjoes/144096
Virtualitics. 2021. Using Machine Learning in the Energy Sector. Blog.virtualitics.com, November 24. https://blog.
virtualitics.com/using-machine-learning-in-the-energy-sector.
Wahyuningtyas, D., and M. Asrol. 2022. “The Implementation of K-Means Clustering in Coal-Fired Power Plant
Performance.” International Journal of Emerging Technology and Advanced Engineering 12 (3): 1–8. https://doi.
org/10.46338/ijetae0322_01
Wu, S. R., G. Shirkey, I. Celik, C. Shao, and J. Chen. 2022. “A Review on the Adoption of AI, BC, and IoT in
Sustainability Research.” Sustainability 14 (13): 7851. https://doi.org/10.3390/su14137851
Wuest, T., D. Weimer, C. Irgens, and K.-D. Thoben. 2016. “Machine Learning in Manufacturing: Advantages,
Challenges, and Applications.” Production & Manufacturing Research, [Online] 4 (1): 23–45. https://doi.org/10.
1080/21693277.2016.1192517.
Yi, W. 2022. Understanding Gaussian Process, the Socratic Way. [online] Medium. https://towardsdatascience.com/
understanding-gaussian-process-the-socratic-way-ba02369d804.
Yu, W., P. Patros, B. Young, E. Klinac, and T. G. Walmsley. 2022. “Energy Digital Twin Technology for Industrial
Energy Management: Classification, Challenges and Future.” Renewable and Sustainable Energy Reviews 161:
112407. https://doi.org/10.1016/j.rser.2022.112407
Zima, W., J. Taler, S. Grądziel, M. Trojan, A. Cebula, P. Ocłoń, P. Dzierwa, et al. 2022. “Thermal Calculations of a
Natural Circulation Power Boiler Operating Under a Wide Range of Loads.” Energy 261: 125357. https://doi.org/
10.1016/j.energy.2022.125357