Deep Learning for Structural Health
Deep Learning for Structural Health
Hung V. Dang , Mohsin Raza , Tung V. Nguyen , T. Bui-Tien & Huan X. Nguyen
To cite this article: Hung V. Dang , Mohsin Raza , Tung V. Nguyen , T. Bui-Tien & Huan X.
Nguyen (2020): Deep learning-based detection of structural damage using time-series data,
Structure and Infrastructure Engineering, DOI: 10.1080/15732479.2020.1815225
Article views: 10
the DL models are applicable for various tasks such as dam- architectures such as Long Short Term Memory (LSTM) and
age detection, damage localization, damage severity, etc. by its simplified version Gated Recurrent Unit were developed by
fine-tuning a few last layers of the architecture or using scientists. Zhao, Wang, Yan, and Mao (2016) developed two
transfer learning techniques. Therefore, researchers in the LSTM-based methods, namely basic LSTMs, and Deep LSTMs,
engineering community have recently applied different for SHM of high-speed CNC machines using sensory data.
neural networks models for SHM, such as multi-layer per- Their results confirmed that LSTM network could perform
ceptron (MLP), Recurrent Neural Networks (RNN), much better than a number of baseline methods. Yuan, Wu,
Convolution Neural Networks (CNN). and Lin (2016) investigated the remaining useful life of aero-
A MLP consists of an input layer, single or multiple hid- engine utilizing LSTM under various operation modes and sev-
den layers, and an output layer. The number of hidden eral degradation scenarios. They found that the standard ver-
layers and the number of neurons per layer are hyperpara- sion of LSTM itself has a strong ability to achieve accurate
meters of the network, usually determined via some search (both long term and short term) prediction during the degrad-
strategies such as grid search or random search. To detect ation process. Lei, Liu, and Jiang (2019) developed LSTM-based
faults in rotary machinery components, Lu, Wang, and method for fault diagnosis of wind turbines based on multi-
Zhou (2017) developed a stacked autoencoder network com- sensor time-series signals. In their study, LSTM achieved best
posed of three hidden layers, providing direct health state performance among neural network architectures, including
classification from raw time series dataset without any prior the vanilla RNN, MLP, and Deep CNN.
signal processing techniques. Being interested in fault diag- The 2DCNN is a well-known architecture in computer
nosis of rotating machinery, Ma, Sun, and Chen (2018) built vision which helps to build several deep learning applica-
a novel coupling deep neural networks which could capture tions surpassing human experts in image classification and
joint information among multimodal sensory data to achieve object recognition. In order to leverage the advances of
high accurate prediction results. Galloway, Catterson, Fay, 2DCNNs, researchers have developed various techniques to
Robb, and Love (2016) studied fault diagnosis of tidal tur- convert one-dimensional sensory data into two-dimensional
bines by implementing a deep network. It was found that images, before feeding to a 2DCNN model, designed for
their deep learning approach achieved better results com- damage detection tasks. Wang and Gao (2017) tackled the
pared to feature-based methods involving k-nearest neigh- fault classification of the gearbox with the help of a
bor classifier, support vector machine, and decision trees. 2DCNN-based method and time-scale images received from
Another deep learning architecture for handling time-ser- wavelet transformations of vibration signals. The proposed
ies data is one-dimensional Convolutional Neural Network method could attain an accuracy of more than 99% in clas-
(1DCNN), which applies the convolution operator on sifying different fault types in the tested gears. Lu et al.
sequences of data points to extract informative features. (2017) investigated bearing fault diagnosis by means of a
Abdeljaber et al. (2018) proposed a 1DCNN model to detect 2DCNN using 2D feature maps of monitoring data. In their
changes in structural properties of a steel frame using meas- study, the 1D time-series signal was rearranged into 2D
ured acceleration signals. J. Li, Dackermann, Xu, and Samali maps via a matrix reconstruction method. The comparison
(2011) published promising results for structural damage experiment results are competitive and promising as classifi-
detection of Euler-Bernoulli beams by combining 1DCNN cation accuracies were more than 90% on average, and vari-
and original waveform signals in lieu of handcrafted fea- ation ranges were less than 2%. Recently, Tang, Chen, Bao,
tures. Avci, Abdeljaber, Kiranyaz, Hussein, and Inman and Li (2019) applied 2DCNN to the anomaly detection
(2018) addressed the loss of connection stiffness of a steel problem of a long-span cable-stayed bridge, utilizing 2D
frame structure via a novel SHM method using 1DCNN representation of accelerometer sensor signals. Their results
and WSNs. Zhang, Miyamori, Mikami, and Saito (2019) suggest that 2DCNN approach fared better than the existing
developed a 1D-CNN method for vibration SHM (VSHM) method in structural anomaly detection, and was scalable to
of bridge structures and successfully tested on both a simpli- include more signal data from multiple measurants.
fied laboratory model and a real steel bridge. Ince (2019) A notable trend towards DL techniques can be seen in
demonstrated that the 1DCNN architecture was highly various application areas, but there are still limited studies
effective in real-time monitoring of motor conditions on the applicability of DL in structural health monitoring,
because their model took only 1.0 ms per classification, and especially for full-scale structures. Moreover, reviewed works
the experimental accuracy was more than 97%. Recently, only focused on one specific DL algorithm, but no extensive
Liu and Zhang (2020) applied the CNN model to investigate study comparing different DL algorithms in the civil engin-
the Maryland and Delaware highway bridges using historical eering field on multi-facets, including detection accuracy,
data and achieves a high accuracy of more than 85%. time complexity, and required data storage. This study is
The recurrent neural network is a special architecture thus motivated by the need for a more insightful and com-
among Deep learning algorithms designed for capturing time- plete picture of the applicability of DL algorithms to the
dependent characteristics. Thus, RNNs are naturally proposed Structural Damage Detection (SDD). The main contribu-
for feature learning of sensor measurements. However, the sen- tions of this work are summarized as follows:
sor data usually consist of long sequential samples, therefore,
the vanilla RNN suffers either gradient exploding or vanishing. Design of a modulated workflow flexible in switching
To cope with this long-range dependencies, some derived RNN different DL algorithms and fusing data from multiple
STRUCTURE AND INFRASTRUCTURE ENGINEERING 3
sensors, thus enabling to 1) compare the practical effect- The time series sensory signal can also be represented in
iveness of various DL algorithms, 2) perform multiple frequency domain via Fourier Transformation (FT). Any
structural damage detection tasks such as damage local- signal can be expressed as a sum of infinite series of sinus-
ization, damage severity, and 3) handle time series data oids. The FT will decompose a signal into its constituent
polluted by noises through the noise injection learn- frequencies together with their associated coefficients. For
ing method. civil structure, the frequency range of interest depends on
Evaluation of strengths and weaknesses of each technique both dynamic characteristics of structures and external
through three case studies of structures ranging from excitations.
relatively small structures (i.e., a reinforced concrete Moreover, it is possible to observe the evolution of fre-
beam) to considerably large structures (i.e. a stayed-cable quency components overtime using the short-time Fourier
bridge). It was found that 2DCNN constantly achieves Transform (STFT) technique, which results in a 2D repre-
the highest accuracy prediction, but it requires additional sentation of the signal. A long signal is divided into shorter
storage for images, while LSTM and 1DCNN demon- segments of equal length; then FT is applied to each seg-
strate a fair balance between accuracy and computa- ment. Mathematically, the STFT is formulated as follows:
tion complexity. ð1
STFTðs, f Þ ¼ xðtÞwðtsÞe2pift dt: (2)
1
The rest of the paper is organized as follows: Section 2
briefs basic vibration signal analysis, including its time- where x(t) is the signal to be transformed, wðtÞ is the win-
domain representation, frequency-domain representation, dow function, i is the imaginary unit, f and s are frequency
and 2D time-frequency representation. Section 3 presents and time index of the STFT, respectively. Applying time dis-
the architectures of four investigated algorithms: MLPs, cretization, the formulation is rewritten as follows:
LSTM, 1DCNN, and 2DCNN. In Section 4, the performance X
nþN1
of these algorithms is compared through three case studies STFTn, f ¼ xk wkn e2pifk , (3)
involving a 1D continuous beam, 2D steel frame, and 3D k¼n
stayed-cable bridge. Finally, the conclusions are drawn in in which n stands for the segment number, N denotes the
Section 5. length of a segment. The 2D representation of STFT ampli-
tude is called the spectrogram of the signal. Figure 1 illus-
trates an example of acceleration signal with its Fourier
2. Deep learning-based structural damage detection
transformation and spectrogram.
Currently, there is a vast number of techniques developed
to improve the performance of DL, yet making these techni-
2.2. Multi-Layer perceptron-based algorithm
ques practical and applicable to real scenarios is not a trivial
task. One of the vital elements in building a successful DL The first investigated neural network is the multi-layer per-
application is to select an appropriate DL architecture, ceptron (MLP) composed of input layer, hidden layer and
understand its impact on the application performance as output layer. Measured signal data are fed into the input
well as its tradeoffs. Thereby, in this part, we investigate the layer, which is then synthesized through one or multiple
applicability of the most widely used architectures across a hidden layers before emerging at the output layer as damage
spectrum of sectors (Jensen, Pedersen, & Thomsen, 2017) to diagnostic predictions. Mathematically, this process can be
SHM, namely, MLP, 1DCNN, LSTM, and 2DCNN after a formulated as (Burkov, 2019):
brief introduction to the structural dynamic response. Y ¼ FðXjWÞ ¼ fL ð:::f2 ðf1 ðXjW1 ÞjW2 ÞWL Þ, (4)
where L is the total number of layers in the network includ-
2.1. Introduction to structural dynamic response ing the input and output layer, W is the matrix of parame-
In this work, acceleration signals serve as input for DL algo- ters, X and Y denote the input and output vectors,
rithms. A set of essential points across the structure is pre- respectively. The connection between two consecutive layers
defined; then, dynamic responses are measured and is performed by combining a linear matrix operation and a
collected. It is common to apply a simple and rapid stand- nonlinear activation function as follows:
ardization processing on measured signals before feeding fl ðXl jWl Þ ¼ hðWl Xl þ bl Þ with l ¼ 1, 2, :::, L (5)
data into the DL model. The technique rescales data to a
where is the element-wise product, Xl is the input of the
range having zero mean and unit variance. The mathemat-
l-layer, bl is bias vector, and h denotes the activation func-
ical formula of the standardization technique is expressed as
tion that is a sigmoid function for hidden layers and the
follows:
softmax function for the output layer. For this study, struc-
xl
xnew ¼ , (1) tural damage detection tasks are posed as multi-classification
r problems, e.g., detect damaged/undamaged state (0/1 class),
where x denotes the original data, xnew is the standardized assess damage severity (healthy, minor, medium, major
data, l and r are the mean value and the standard deviation class), etc.; thus the softmax function is selected at the out-
of the original data, respectively. put layer as a permanent part of the method. Softmax
4 H. V. DANG ET AL.
function normalizes evaluated values into a form of prob- dataset is significantly lower than training accuracy) and
ability falling between [0, 1], probabilities of all classes are requires a high computational cost. In this work, the num-
added up to 100%, and the class with the highest probability ber of output classes is basically under 100 therefore, num-
will be selected as predicted class. The softmax activation ber of perceptrons in hidden layer are set to 1000. Figure 2
function is formulated as (François, 2017): shows the architecture of the implemented MLP with 1000-
2 3 2 1 3 perceptron hidden layer.
y1 ewL XL
6 . 7 6 7
6 . 7
6 . 7 6 ... 7
6 7 1 6 k 7 2.3. Long short term memory-based algorithm
Y ¼ fL ðXL jWL Þ ¼ 6 yk 7 ¼ PK wi X 6 w XL 7
6 e L 7, (6)
6 . 7 e L L 6 .. 7
6 . 7 k¼1
4 . 5 In this section, LSTM network is reviewed and adapted for
4 . 5 K structural damage detection purposes. The fundamental the-
yK ewL XL
ory of LSTM refers to the work of (Hochreiter &
where K is the number of output classes, yk is the predicted Schmidhuber, 1997). The structure of LSTM consists of
probability of the class k, wkL is the weight row vector k of jointly connected repeating cells. Each cell has three gates,
the parameter matrix HL associated with the kth output namely forget gate, input gate, and output gate for control-
class, i.e.: WL ¼ ½w1L . . . wkL . . . wKL T : ling information flow. Outputs of LSTM sequences are fed
Herein, the network parameters are computed in a super- into a fully connected layer with softmax activation func-
vised fashion, which means the structural state is labeled in tion, which further provides the probability for each pre-
advance, next the database are divided into three groups, dicted class (see Figure 3).
namely training, validation, and testing datasets. The value The mathematical formulation of this model is described
of network parameters are randomly initialized, then they as follows. Given xt, the input at time step t, ht1 the output
are iteratively updated via the backpropagation algorithm to of LSTM cell at time step t 1, a linear transformation of
minimize the deviation between prediction values and anno- the combination of xt and ht1 is expressed by:
tated structural states. At a step t of the training process, Lðht1 , xt Þ ¼ W ½ht1 , xt þ b, (9)
the parameter values are updated by:
where W and b are the weight matrix and bias vector of
ij ij j
wl ðtÞ ¼ wl ðt1Þ þ gdil xðl1Þ ðtÞ, (7) the network.
Then, the three gates inside each cell of LSTM are writ-
where l denotes the layer number, dli is the error at node i ten (Olah, 2015):
of layer l, which is calculated backward from the deviation
ff ¼ rðLf ðht1 , xt ÞÞ
at output layer dL and g stands for the learning rate. The
fi ¼ rðLi ðht1 , xt ÞÞ (10)
training process terminates when a tolerable error level is
fo ¼ rðLo ðht1 , xt ÞÞ,
met or the number of iterations reaches to a limit.
The deviation between the prediction and actual struc- in which the subscripts f, i, o indicate the forget gate, input
tural states is computed via the cross-entropy function gate and output gate, respectively. r is the sigmoid activa-
(Janocha & Czarnecki, 2017) as follows: tion function.
X At every time step t, the new candidate of information is
Loss ¼ dL ¼ yk log pð^y k Þ, (8) firstly created by applying the tanh activation function on a
k
linear transformation of the combination ½ht1 ; xt :
where k denotes the class number in output, p is the prob-
ability estimate, y is the actual label and ^y stands for predic- Ct ¼ tanhðLc ðht1 , xt ÞÞ, (11)
tion value. where Lc is a creation linear operator whose formula is
Theoretically, MLP network with one hidden layer can similar to Eq. (9). Then the flow of information from previ-
approximate any functions in the training process on the ous step is updated with the new candidate by the gates of
condition of a sufficiently large hidden layer (Goodfellow LSTM cell:
et al., 2016). However, such a large hidden layer usually
leads to the over-fitting phenomena (accuracy on test st ¼ ðff ht1 Þ丣ðfi Ct Þ, (12)
STRUCTURE AND INFRASTRUCTURE ENGINEERING 5
Figure 4. Architecture of 1DCNN for structural damage detection adapted from (Lin et al., 2017).
fully connected layer before giving the classification results length of 9 m (Figure 7). Its cross-section dimensions are
at the output layer using the softmax activation function. 40 60 cm2, Young’s modulus E is 30 GPa with 95% confi-
dence interval being E95 ¼ ð160:05Þ=E (Noguchi & Nemati,
1995), Poisson’s ratio is 0.3, mass density q is between 2200
3. Performance evaluation
and 2400 kg/m3 (Kosmatka, Kerkhoff, & Panarese, 2019),
In this section, the implementation of the DL algorithms is and the modal ratio is 2%. The beam is modeled in the
investigated through three case studies from a simple 1D Finite Element software Abaqus Standard (Dassault, 2016a),
structure to a full-scale 3D structure. Their performances in using the three dimensional beam element. The element has
terms of accuracy, time complexity, required storage as well six degrees of freedom (dofs) at each node, including three
as the robustness in dealing with noisy data are quantified translations and three rotations around the x, y, and z direc-
and compared based on calculated results. The next subsec- tions. Section properties and material properties are sum-
tion will go through three case studies from a simple one- marized in Table 1. The beam is subjected to a time-varying
dimensional beam structure to a two-dimensional frame uniform load modeled as a Gaussian random process with
structure to a three-dimensional full-scale stayed- amplitude falling between 1 and 20 kN/m, and the coeffi-
cable bridge. cient of variation is 0.1. Figure 7(b) shows a representative
time history of load amplitude at the middle of the beam.
The self-weight is implicitly accounted for in the simulation
3.1. Case study 1: 1D continuous beam excited by
through the given material mass density and gravitational
random force
acceleration.
The first numerical experiment is a continuous concrete Each span of the continuous beam is divided into nine
beam usually occurring in buildings subjected to time-vary- segments of 1 m length. The damage is randomly assigned
ing uniformly distributed loads reflecting machinery excita- to these segments. The damage is modeled as a reduction in
tion, for example. The beam consists of three spans of equal the effective height of the section, leading to element
STRUCTURE AND INFRASTRUCTURE ENGINEERING 7
Figure 7. Three-span continuous beam under time-varying uniformly distributed load. a) Schematic representation of the beam. b) Example of a time-varying
amplitude loading at the middle of the beam.
Table 1. Numerical input parameters for the continuous beam case study.
Number f1 f10 Duration of Time step Mesh size Element Analysis
of mode (Hz) (Hz) simulation (s) (s) (m) type method
10 11.67 169.67 120 1e-3 0.2 B21 Modal analysis
Table 2. Random parameters for Monte Carlo simulations of the continuous beam structure.
E q Loading Damage Damage
Parameter (kN/m2) (kg/m3) (kN/m) level (%) location
Distribution Normal Uniform Uniform Uniform uniform
Value E95 ¼ [28.5 31.5] [2200–2400] [1–20] [0, 5, 15, 25] [s1-s9]
0: No damage.
stiffness loss, thus resulting in changes in the dynamic satisfied accuracy but requires less computational effort than
behavior of the structure. Three damage levels are investi- the direct integration, therefore it is suitable to generate an
gated, corresponding to 5%, 15%, and 25% reduction in the extensive dataset for DL algorithms. The number of modes
effective height. In reality, this type of damage is often included the modeling is selected through a conventional
observed in reinforced concrete elements prone to cracking, convergence analysis, resulting in a value of 10 for the
spalling, and so forth. The monitoring system is simulated present simply supported beams. The simulation duration is
by using 9 virtual accelerometers measuring vertical acceler- T ¼ 120 s, and the sampling frequency is f ¼ 200 Hz.
ation responses of the beam; three vrtual sensors are Figure 8 illustrates an example of time-series sensor data
mounted on each span (at 1/4, 1/2, and 3/4 span). The setup obtained from numerical simulations.
configuration is schematically depicted in Figure 7. It is sup- Intuitively, using time-series from only one sensor hardly
posed that virtual sensors work in ideal conditions, which provides accurate results about damage localization across
means there is no noise included in the obtained acceler- the whole structure, while taking multiple sensor data as
ation time-series. The variation of responses only results input for the neural network model can benefit the model
from uncertainties in input parameters and various damage performance. There are two ways to handle multiple time
scenarios introduced. The inputs for the Monte Carlo simu- series: i) combining these time series into a 3D time-ampli-
lation are summarized in Table 2, other parameters remain tude-sensor tensor or even 4D time-frequency-amplitude-
identical between calculations. The setup configuration is sensor tensor before feeding to a neural network ii) each
schematically depicted in Figure 7. time series enters to a separate model of the same architec-
Due to the uncertainty of external load, material proper- ture. The former way needs to increase the dimension of
ties’ variation, and various damage scenarios, we use the input data, while the latter one requires to repeat training
Monte Carlo method to generate an input database of 10000 process by the number of networks. To cope with these
samples and run extensive finite element simulations. The issues, one revisits the second approach by concatenating
finite element simulation is realized by the use of the mode- the output of each neural network into a long vector before
based dynamic method in Abaqus Standard, which provides passing through a common output layer with the softmax
the time-dependent response of the structure via the prin- activation function to provide damage classification output,
ciple of superposition. At first, the eigenvalue analysis pro- as schematically illustrated in Figure 9. In this way, the
cedure (subspace projection method) is conducted to extract resulting unified model has the capability of using the ori-
modal characteristics, the number of extracted modes ginal multiple time series as input without increasing input
should be sufficient to reflect the real structural behavior dimensions or repeating the training process.
closely. After that, the amplitude coefficient associated with The first structural damage detection task is to estimate
each mode in response to external excitations is determined; damage severity. The data are labeled by the four introduced
the response of the structure is obtained by superposing damage levels, namely, healthy, minor, medium, and major,
modal responses. More details of the mode-based dynamic as listed in Table 3. The damage classes are evenly distrib-
analysis can be found in (Dassault, 2016c). In general, uted, i.e., there are 2500 cases per damage level. Such uni-
modal dynamic procedure could provide results with form distribution simplifies the training process by avoiding
8 H. V. DANG ET AL.
Figure 10. Evolution of training loss and validation accuracy in function of the number of epochs in the training process.
Figure 12. Confusion matrix for damage localization obtained by different DL algorithms.
Table 4. Damage detection results obtained with different Deep learning algorithms.
Damage severity Damage localization
Task
Model MLP 1DCNN LSTM 2DCNN MLP 1DCNN LSTM 2DCNN
Number of epoch 300 300 300 100 300 300 300 100
Total traing time (s) 900 1188 2250 3650 712 807 1918 2100
Testing accuracy 78.8% 87.5% 93.0% 98.9% 83.9% 86.0% 95.0 % 99.2%
Figure 13. Three-story frame structure experiment (Los Alamos National Laboratory, 2002).
Figure 14. Acceleration time series measured at the second floor for 11 structural states.
Table 6. Damage detection results on experimental database of a 3-story frame. using vibration signal measured during and shortly after
Task MLP 1DCNN LSTM 2DCNN excitation is an appealing alternative because obtained sig-
of trucks (m/s) loss (%) location nals are rich in information in terms of amplitude, variabil-
Detection accuracy (%) 97 97 99 99 ity, and frequency contents. Since resulting models can
Localization accuracy (%) 87 93 93.5 95
Severity accuracy (%) 85 91 92.6 94.6 depend on specific excitation, e.g., earthquake records, it is
Training time (s) 350 470 930 1250 beneficial to consider different earthquake records, enriching
Number of trainable 389806 342670 326606 540550 the diversity of database and robustness of DL methods.
parameters
Storage (Mb) 1010 1010 1010 1945 Specifically, for this case study, the frame is subjected to
three simulated ground motion with characteristics of the
1940 El Centro earthquake, the 1999 Chi-Chi earthquake,
In fact, it is convenient to train a model able to detect and the 2001 El Salvador earthquake. The ground acceler-
underlying damages from free or ambient vibration meas- ation time histories of these earthquakes are highlighted in
urements because of their accessibility. On the other hand, Figure 16(c). Virtual accelerometer sensors were placed at
12 H. V. DANG ET AL.
the frame connections to capture frame movements during will be advantageous for classifiers, especially those having
ground motions. the capability of exploiting time-varying dependence such as
In general, damages do not take place before earthquake LSTM and 2DCNN. A procedure using only a portion of
events, but during or shortly after. Therefore simulations signals recorded after the occurrence of damage is also feas-
are performed in two phases: The first phase corresponding ible, but it will require corresponding undamaged data to
to the first half 30 s of earthquake records where the struc- identify property shifts, especially when working with differ-
ture is supposedly still intact, then in the second phase, con- ent seismic spectrums triggering different eigenmodes of
nection stiffness reductions of different levels, i.e., 0% the structure.
(healthy), 30%, 60%, and 90%, are arbitrarily introduced to Dynamic analysis is performed with the help of the
two connections (multiple damage scenarios) by reducing mode-based dynamic analysis (Awkar & Lui, 1999) and
rotational stiffness c via Eq. (18), other connections are kept modeling transfer technique in which the outputs of the first
unchanged. The selection of 30 s adopted here comes from stage are considered as inputs of the second stage. More
the observation that most significantly strong part of EQ details about seismic analyses of a frame structure via
(Figure 16) records falls between [0–30 s], specifically the mode-based dynamic analysis could be found in (Dassault,
root mean square (RMS) value of the first 30 s is greater 2016b). In engineering practice, using linear dynamic ana-
than 8 times that of the 30 s second part for all three con- lysis method still a preferable choice compared to nonlinear
sidered EQ records, thereby it is supposed that the second counterpart thanks to its practicality, accuracy of results can
part does not incur further damage. The recorded 60 s- be adjusted further via a safety factor. Tables 7 and 8 enclo-
length acceleration is used as input data to train the deep ses numerical parameters of the finite element model.
learning models further. As changes in connection stiffness The same procedure presented for the first example is
could lead to changes in global stiffness of the structure, applied for this 2D frame to build deep learning models:
using complete signals recorded from before to after damage performing Monte Carlo simulation, involving parameters’
event, could store property shift in a single data. Thus, it variation, realization of extensive finite element analysis,
labelling data by structural states, training deep learning
models for each sensors, combining output of all sensors
into a final classification result and evaluation of the mod-
els’ performance. As steel is a homogeneous material and
usually manufactured with a highly controlled process, then
their properties are more reliable than those of concrete,
which is heterogeneous material and can be manually cast
in situ. Therefore, one excludes the variation of steel proper-
ties from the Monte Carlo simulation in this case study.
The main uncertainty sources for Monte Carlo simulations
Table 7. Numerical input parameters for the 2D steel frame case study.
Time Mesh
Number f1 f20 Duration of step size Element Analysis
of mode (Hz) (Hz) simulation (s) (s) (m) type method
20 3.36 93.43 120 1e-3 0.2 B21 Modal
analysis
Figure 15. Damage detection results in terms of accuracy and efficiency.
Figure 16. a) Schematic representation of the two-bay five-story steel frame subjected to earthquake ground motion. b) 2D model of the steel frame in ABAQUS.
c) Ground accelerations of three investigated earthquake: Elcentro, Elsalvado and Chichi, respectively (Haddadi et al., 2008).
STRUCTURE AND INFRASTRUCTURE ENGINEERING 13
Table 8. Random parameters for Monte Carlo simulations of the 2D steel frame structure.
Damage location Stiffness
Parameter Earthquake Vertical attribute Horizontal attribute reduction (%)
Distribution Uniform Uniform Uniform uniform
Value [El Salvador,] 2-combination 2-combination [0, 30,60, 90]
ElCentro, Chi-Chi] of [r, c, l] of [1–5]
0: No damage.
of this example are degrees of stiffness reduction, damage points of different damage scenarios mix together or data
locations, and earthquake records drawn from uniform dis- points belonging to the same damage class but are not prop-
tributions as listed in Table 8. erly grouped, the utilized algorithm needs to be improved.
For this case study, ones investigate the double damage Figure 17 demonstrates the results of the stiffness reduc-
localization tasks. The multiple damage detection is much tion identification for steel frame under earthquake ground
more challenging than single damage detection due to the motion. The rows of the figure correspond to MLP, LSTM,
increasing number of scenarios, especially for complex struc- 1DCNN, and 2DCNN algorithms, respectively. The left side
tures. There are 15 frame connections in total; then, the num- of each row presents the results of detecting the building
ber of scenarios is only 15 for single damage, but rises to 105 storey of damaged connection pairs, including the percent-
cases for double damage, will be 455 cases for triple damage age accuracy and their tSNE representation. The right-hand
scenarios, and so forth. With a given database, when the num- side highlights results related to the vertical attribute. For
ber of classes increases, the training sub-dataset for each class vertical attributes, all four deep learning algorithms show
will be fewer, thus the overall accuracy decreases. To over- high accuracies of more than 97%. The tSNE indicates that
come this limitation, the damage scenarios are divided into obtained results for different groups are well separated, but
different subcategories of a smaller number of classes by MLP is not as good as the three other algorithms in cluster-
incorporating more complementary attributes. ing data, for example, the data of the pair (r,r) are subdi-
For the considered scenario, further information, i.e., the vided into three locations. For horizontal attributes of
story number (horizontal attribute), and the related frame col- damage localization, as the number of output classes
umns (vertical attribute), are integrated into the frame con- increases to 15, the superiority of LSTM and 2DCNN is well
nection information apart from node number (see Table 9). demonstrated. Their average accuracies are 97% and 99%
There are 5 stories; thus, the number of double damage scen- respectively, whereas, for MLP and 1DCNN algorithms,
arios with respect to the horizontal attribute reduces to 15 some pairs of damaged connections obtain under 60%,
cases. The label for each case is denoted by corresponding sto- (such as the pair (2, 5) for MLP and pair (3,4) for 1DCNN).
ries; for example, label (3, 5) indicates damages occurring in The tSNE also highlights this observation. The tSNE of
the beams of the third and fifth stories. Whereas with 3 MLP’s results shows an area where data points of several
groups of columns: left, center, and right column (denoted by classes overlap together at the bottom left corner. In the
l, c, r), there are only 9 possible cases which are annotated by tSNE of 1DCNN, the pair (2, 5) and (3, 4) are not well split.
a pair of numbers, for instance, (l, c) means damage happens In contrast, the tSNE of LSTM displays good clustering
in left and center columns. Then, a two-head deep neural net- results, but there are some data points far apart from the
work is trained, one head for detecting the story of damaged others. The tSNE of 2DCNN has the best clustering image.
connections, another for their vertical attribute. By combining Remark 3: The proposed DL workflow has the ability to
these two results, the damaged connections are accurately diagnose the multi-damage scenarios across the structure with
localized. This multi-head strategy is more amenable in prac- high accuracy. 2DCNN consistently provides the best detection
tice than the direct multi-classification approach when deal- results, followed by LSTM and 1DCNN. In contrast, the per-
ing with a large number of output classes and unchanged formance of MLP algorithm needs to be improved further by
database volumes. modifying the network’s hyperparameters. Theoretically, MLP
To visualize the classification results, t-Distributed architecture can attain an accuracy as high as that of 2DCNN,
Stochastic Neighbor Embedding (t-SNE) technique is but the hyperparameter optimization is another laborious task,
adopted due to its ability of dimensionality reduction in vis- and change radically for different SDD tasks, thus it is more
ualizing high-dimensional datasets (Maaten & Hinton, practical to invoke other DL algorithms.
2008). Since deep learning models will extract multiple pat-
terns from raw data, thus using t-SNE provides a graphical
3.4. Case study 3: 3D bridge under moving load
presentation about how efficient each algorithm in cluster-
ing different damage scenarios. If input signals of the same For the third case study, the DL approaches are applied to a
damage class are well grouped and separated from other full-scale bridge structure. In stayed-cable bridges, cable is
classes, the DL algorithm is effective. In contrast, if data one of the most crucial components (Montoya, Deodatis,
14 H. V. DANG ET AL.
Figure 17. Damage localization accuracy for double damaged scenarios obtained by different Deep Learning algorithms and the corresponding tSNE visualizations.
Betti, & Waisman, 2015), having a great impact on the prestressed girders of 2.6 m depth and I-shaped transverse
dynamic behavior of the structure. If there is a loss in the steel beams spaced at 4.4 m intervals. The structure is sus-
prestressed force in cables, the structural rigidity will be sig- pended by a system of cables composed of 48 stays con-
nificantly reduced. The common method to examine the nected to either side of two outstanding pylons. The pylon
operational condition of cables is measuring vibrations of has the H shape with a height of 150 m, including two legs
each cable, calculating its eigenfrequency values then deriv- whose cross-section dimension varies gradually from
ing the actual prestressed force in cables from empirical for- 12 7.7 m at the base to 5.5 4.7 m at the top, linked by a
mulae. However, this method does not allow for directly transverse beam at the 87 m level. The details of the geometric
assessing the interaction between global dynamic responses dimensions of the bridge are illustrated in Figure 18(b, c). In
of the structure and the cable system. Alternatively, by using terms of cable parameters, the cross-section of each cable
the presented deep learning methods, one attempts to detect is 0.019 m2, and an average value of the effective cable ten-
the location of cable prone to prestressed force reduction sion is selected as 6256 kN after measurements of (Cruz &
via bridge vibration time series, which can be obtained from Miranda, 2017). The material parameters of the bridges are
accelerometer sensors installed across the bridge. as follows: concrete of C45/55 grade with Ec¼44.17 GPa,
The presented methodologies are applied to the Vasco da volumic density of 2400 kg/m3; structural steel S355J2G3
Gama cable-stayed bridge in Lisbon, Portugal (Figure 18(a)), with Es¼210 GPa and volumic density of 7850 kg/m3. For
for assessing tension reductions in cables based on dynamic the structural damping of the Vasco da Gama Bridge, it is
responses of the structure to crossing vehicles. The main assumed that a damping ratio of 0.3% is applied for all
structure of the bridge consists of a central span of 420 m vibration modes.
length, and three lateral spans of 62, 70.7 and 72 m either A finite element model of the bridge is developed to
side, leading to a total length of 829 m (Pedro, Oliveira, & simulate the structural behavior when a vehicle travels
Reis, 2010). The bridge deck is 31 m wide, made by a 25 cm across, (see Figure 19(a)) using Abaqus software. The bridge
thick concrete slab supported by two longitudinal deck is modeled by four noded S4R shell elements from the
STRUCTURE AND INFRASTRUCTURE ENGINEERING 15
Figure 18. a) Vasco da Gama bridge located in Lisbon, Portugal (Vasco de Gama bridge, 2011) b) Schematic representation of the bridge and measurement points’
location c) Pylon elevation with dimension in place.
Figure 19. a) Finite element model of the Vasco da Gama bridge built in Abaqus and b) a standard vehicular load (HS20 truck) according to American Association
of State Highway and Transportation Officials (AASHTO, 1998).
Table 10. Numerical input parameters for the 3D stayed-cable bridge case study.
Duration of Time step Mesh size Beam Cable Deck Connection Analysis
simulation (s) (s) (m) ele. type ele. type ele. type type method
60 1e4 1.0 B31 T3D2 S4R Zero-length Explicit
connector analysis
Abaqus element library. The pylons of the cable-stayed stressed action. The pre-stressed value per strand is obtained
bridges are modeled using 3D beam elements. The element by dividing the tension force by the strand area. In FEM,
has six degrees of freedom (dofs) at each node, including the tension is modeled as an initial condition, directly
three translations and three rotations around the x, y and z applied on cable elements at the beginning of the analysis
directions. The stay cables are modeled using two-noded using predefined field, mechanical category, where the type
T3D2 truss elements (Olamigoke, 2018) subjected to pre- is selected as stress. The pre-stressed value is the same for
16 H. V. DANG ET AL.
all cables except for the damaged one, whose value is virtu- below 30% tension loss in one stay-cable in detecting either
ally reduced by a prescribed damage level. Table 10 lists damage location or damage severity. Therefore, three levels
numerical input parameters for the 3D FEM of the stayed- of tension reduction investigated are 30%, 60%, and 100%
cable bridge in Abaqus. (total loss of tension). Furthermore, when working with
In terms of vehicular load, a standard HS20 truck, added-noise data, preliminary calculation with low-level ten-
according to the American Association of State Highway sion loss (about 5%) did not yield satisfying results then was
and Transportation Officials (AASHTO, 1998) specifications, not reported in this manuscript. The damage is arbitrarily
is selected in this study. The truck weighs 36 tons, has three introduced to one of 128 cables, at the beginning of the
axles with configuration and loading details highlighted as simulation. In total, 10000 simulations are realized, includ-
in Figure 19(b). Multiple cases of vehicular loads are consid- ing different damage scenarios, and various vehicular load
ered in the calculation, such as one or multiples vehicles cases, as summarized in Table 11. The learning process is
traveling at different velocities. Each truck is modeled by 3 conducted in a supervised fashion in which each series of
concentrated loads of 35, 145 and 145 kN, respectively, mov- dynamic responses of the bridge is labeled by levels of dam-
ing at a constant velocity on the centerline of the bridge age or location of damaged cables. The number of samples
deck. During movement, the distance between these three is uniformly distributed among classes, i.e., approximately
loads (4.3 m and 9.0 m) remains constant. In the Abaqus 80 samples (10000/128) for damage localization, and 2500
software, the subroutine DLoad is a convenient tool to (10000/4) for damage severity.
simulate such moving loads. The velocity of trucks is Figure 20 presents the validation accuracy and training
selected from the range [10, 15, 20, 25, 30] m/s, the number loss retrieved during training processes for two tasks: dam-
of trucks varies from 1 to 5, and the distance between two age severity and damage localization. The accuracy on the
consecutive vehicles is fixed at 100 m. The structural valid dataset is increased as a function of the number of
response is virtually measured at 30 different points (15 epochs, and the training loss decreases at the same time.
points each side) along the deck, as highlighted in Once the valid accuracy becomes stable, and the training
Figure 18(b). process could be terminated. It can be seen that for damage
Once a detailed 3D FEM of the bridge is constructed, severity tasks, 2DCNN outperforms the other algorithms
various damage scenarios are introduced into the FEM to with the converged accuracy of around 94%, followed by
generate a synthetic database of structure response. LSTM, 1DCNN, and MLP with accuracy of 90%, 85%, and
Following the experimental program conducted by 80%, respectively. It is also noteworthy that the learning
Nazarian, Ansari, Zhang, and Taylor (2016), methods using curve of 2DCNN becomes stable after only 20 epochs, while
the bridge deck’s responses were inefficient for damages of it requires more than 200 epochs for other methods. The
similar patterns are observed for damage localization tasks.
Table 11. Random parameters for Monte Carlo simulations of the stayed- Otherwise, the damage localization tasks receive higher
cable bridge structure.
accuracy than the damage severity tasks. A possible explan-
Parameter Number Velocity Tension Damage
of trucks (m/s) loss (%) location ation is that the influence of each cable on the dynamic
Distribution Uniform Uniform Uniform uniform behavior of the structure is not the same, a minor tension
Value [1–5] [10, 15, 20, 25, 30] [0, 30, 60, 90] [0–128] loss of a critical cable (long stay-cables connected to the
0: No damage. middle of the bridge) could cause damage equivalent to a
Figure 20. Evolution of training loss and validation accuracy in function of the number of epochs in the training process.
STRUCTURE AND INFRASTRUCTURE ENGINEERING 17
Table 12. Stayed-cable bridge’s damage detection results obtained with different Deep learning algorithms.
Damage severity Damage localization
Task
Model MLP 1DCNN LSTM 2DCNN MLP 1DCNN LSTM 2DCNN
Storage (Gb) 9.2 9.2 9.2 18.9 9.2 9.2 9.2 18.9
Time for 1 epoch (s) 5.1 11.1 24.9 354.0 4.7 9.5 10.6 290
Total traing time (s) 2550 5550 12450 17700 2350 4750 5300 14500
Testing accuracy 78.8% 87.5% 93.0% 98.9% 83.9% 86.0% 95.0 % 99.2%
The results are obtained after training with 500 epochs for MLP, 1DCNN, LSTM, and 50 epochs for 2DCNN.
Figure 21. Evolution of prediction accuracy versus the added-noise level for different Noise-to-Signal ratios.
medium tension loss of another cable (short stay-cables external factors such as device instability, environmental
close to the pylon). effects, systematic errors. Herein, we consider white-noise,
Table 12 summarizes the computed results of four Deep which is a classical yet essential problem for time series ana-
learning algorithms for the damage detection of the stayed- lysis (Z. Li et al., 2019), reflecting unknown ambient excita-
cable bridge. Although 2DCNN requires the minimal num- tion, usually addressed in the SHM literature (de Castro,
ber of epochs during training, the training time for each Baptista, & Ciampa, 2019). Moreover, present methods will
epoch (354 s) is significantly higher than that of other algo- automatically learn relevant patterns and mitigate the effect
rithms. Therefore, the total training time of 2DCNN is 1.4, of noise through their hierarchical architecture rather than
3.2, and 6.9 times higher than that of LSTM (17,700 s versus resorting to a per-noise preprocessing step; thus, it is poten-
12,450 s), 1DCNN (5550s), and MLP (2550s). In addition, tially applicable to other types of perturbations. Herein, we
apart from the original time-series signals, 2DCNN needs to consider white-noise where its amplitude is defined using
store associated images of 2D time-frequency representation root mean square (RMS) value of the vibration data and is
of vibration signals, therefore its required storage is twice as represented as follows:
much. This drawback impedes the practicality of 2DCNN.
On the other hand, LSTM, which achieves significantly xinoise ¼ xi þ a:gi , (19)
higher accuracy in a commensurable running time without
additional storage, is promising solution in real-time SHM where xi is the vibration signal retrieved from sensor it,
applications. 1DCNN also bears potential for wider use due xinoise is added-noise data; gi is the white noise vector with
to its advantages as demonstrated in these case studies: zero mean and unit variance, a is the noise amplitude
higher accurate than MLP, very rapid, and no need for depended on the RMS of original data.
extended storage. In the Machine Learning literature, to increase the
robustness of models when dealing with noisy data (Yun,
Yi, & Bahng, 2001) suggested to train models with low-level
3.5. Damage identification with noisy data
added noise dataset instead of virgin datasets, referred as
In the following part, the robustness of the DL is investi- the noise injection learning method. Thereby, the models
gated with respect to time-history sensory data contami- are retrained with training data augmented by noise-to-sig-
nated by noise. Noise can be attributed to various sources: nal ratios (NSR) 3%, 5%, and 7% then tested with data cor-
one rooted from internal structures’ properties, other due to rupted by noise with NSR up to 20%.
18 H. V. DANG ET AL.
Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic opti- Nazarian, E., Ansari, F., Zhang, X., & Taylor, T. (2016). Detection of
mization. arXiv Preprint: 1412.6980. tension loss in cables of cable-stayed bridges by distributed monitor-
Kiranyaz, S., Avci, O., Abdeljaber, O., Ince, T., Gabbouj, M., & Inman, ing of bridge deck strains. Journal of Structural Engineering, 142(6),
D. J. (2019). 1D convolutional neural networks and applications: A 4016018. doi:10.1061/(ASCE)ST.1943-541X.0001463
survey. Preprint arXiv:1905.03554. Noguchi, T., & Nemati, K. M. (1995). Relationship between compres-
Kosmatka, S. H., Kerkhoff, B., & Panarese, W. C. (2019). Design and sive strength and modulus of elasticity of high strength concrete.
control of concrete mixtures. Skokie, IL: Portland Cement Journal of Structural and Construction Engineering (Transactions of
Association. AIJ), 60(474), 1–10. doi:10.3130/aijs.60.1_8
Lee, F. F., Karpathy, A., & Johnson, J. (2018). CS231n: Convolutional Olah, C. (2015). Understanding LSTM networks. Retrieved from
neural networks for visual recognition. Retrieved from http://cs231n.
https://colah.github.io/posts/2015-08-Understanding-LSTMs.
stanford.edu.
Olamigoke, O. (2018). Structural response of cable-stayed bridges to
Lei, J., Liu, C., & Jiang, D. (2019). Fault diagnosis of wind turbine
cable loss (Doctoral dissertation). University of Surrey.
based on long short-term memory networks. Renewable Energy, 133,
Pan, S. J., & Yang, Q. (2010). A survey on transfer learning. IEEE
422–432. doi:10.1016/j.renene.2018.10.031
Li, J., Dackermann, U., Xu, Y. L., & Samali, B. (2011). Damage identifi- Transactions on Knowledge and Data Engineering, 22(10),
cation in civil engineering structures utilizing pca-compressed 1345–1359. doi:10.1109/TKDE.2009.191
residual frequency response functions and neural network ensem- Pedro, J., Oliveira, J., & Reis, A. J. (2010). Nonlinear analysis of com-
bles. Structural Control and Health Monitoring, 18(2), 207–226. doi: posite steel–concrete cable-stayed bridges. Engineering Structures,
10.1002/stc.369 32(9), 2702–2716. doi:10.1016/j.engstruct.2010.04.041
Li, Z., Lam, C., Yao, J., & Yao, Q. (2019). On testing for high-dimen- Sekulovic, M., & Salatic, R. (2001). Nonlinear analysis of frames with
sional white noise. The Annals of Statistics, 47(6), 3382–3412. doi: flexible connections. Computers & Structures, 79(11), 1097–1107.
10.1214/18-AOS1782 doi:10.1016/S0045-7949(01)00004-9
Lin, Y. Z., Nie, Z. H., & Ma, H. W. (2017). Structural damage detec- Tang, Z., Chen, Z., Bao, Y., & Li, H. (2019). Convolutional neural net-
tion with automatic feature-extraction through deep learning. work-based data anomaly detection method using multiple informa-
Computer-Aided Civil and Infrastructure Engineering, 32(12), tion for structural health monitoring. Structural Control and Health
1025–1046. doi:10.1111/mice.12313 Monitoring, 26(1), e2296. doi:10.1002/stc.2296
Liu, H., & Zhang, Y. (2020). Bridge condition rating data modeling Vasco de Gama bridge. (2011). Retrieved from https://arab-trip.com/
using deep learning algorithm. Structure and Infrastructure wp-content/uploads/2019/07.
Engineering, 16(10), 1447–1414. doi:10.1080/15732479.2020.1712610 Wang, Y. R. P., & Gao, R. (2017). Virtualization and deep recognition
Los Alamos National Laboratory. (2002). Retrieved from http://www. for system fault classification. Journal of Manufacturing Systems, 44,
lanl.gov/projects/damage_id/index.htm. 310–316. doi:10.1016/j.jmsy.2017.04.012
Lu, C., Wang, Z., & Zhou, B. (2017). Intelligent fault diagnosis of roll- Yuan, M., Wu, Y., & Lin, L. (2016). Fault diagnosis and remaining use-
ing bearing using hierarchical convolutional network based health
ful life estimation of aero engine using lstm neural network. In
state classification. Advanced Engineering Informatics, 32, 139–151.
IEEE International Conference on Aircraft Utility Systems.
doi:10.1016/j.aei.2017.02.005
Yun, C. B., Yi, J. H., & Bahng, E. Y. (2001). Joint damage assessment
Ma, M., Sun, C., & Chen, X. (2018). Deep coupling autoencoder for
fault diagnosis with multimodal sensory data. IEEE Transactions on of framed structures using a neural networks technique. Engineering
Industrial Informatics, 14(3), 1137–1145. doi:10.1109/TII.2018. Structures, 23(5), 425–435. doi:10.1016/S0141-0296(00)00067-5
2793246 Zhang, Y., Miyamori, Y., Mikami, S., & Saito, T. (2019). Vibration-
Maaten, L. V. D., & Hinton, G. (2008). Visualizing data using t-SNE. based structural state identification by a 1-dimensional convolu-
Journal of Machine Learning Research, 9(11), 2579–2605. tional neural network. Computer-Aided Civil and Infrastructure
Montoya, A., Deodatis, G., Betti, R., & Waisman, H. (2015). Physics- Engineering, 34(9), 822–839. doi:10.1111/mice.12447
based stochastic model to determine the failure load of suspension Zhao, R., Wang, J., Yan, R., & Mao, K. (2016). Machine health moni-
bridge main cables. Journal of Computing in Civil Engineering, toring with LSTM networks. In 10th International Conference on
29(4), B4014002. doi:10.1061/(ASCE)CP.1943-5487.0000393 Sensing Technology.