0% found this document useful (0 votes)

50 views13 pages

Deep Learning and Inverse Problems: Ali Mohammad-Djafari Orcid Number:0000-0003-0678-7759, Ning Chu, Li Wang, Liang Yu

Uploaded by

tojet46168

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views13 pages

Deep Learning and Inverse Problems: Ali Mohammad-Djafari Orcid Number:0000-0003-0678-7759, Ning Chu, Li Wang, Liang Yu

Uploaded by

tojet46168

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Deep Learning and Inverse Problems

Ali Mohammad-Djafari 1,2 orcid number:0000-0003-0678-7759,

Ning Chu2,3 , Li Wang4 , Liang Yu5,6
1 International Science Consulting and Training (ISCT), 91440 Bures sur Yvette, France; [email protected]
2 Zhejiang Shangfeng special blower company, Shaoxing 312352, China; [email protected]
3 Mechanical and Electrical Eng. Coll., Hainan Vocational Univ of Science and Tech. Haikou 571126, China
arXiv:2309.00802v1 [cs.LG] 2 Sep 2023

4 Central South University, Changsha, China; [email protected]

5 Shanghai Jiao Tong Univ., Shaghai 200240, China; [email protected]
6 Northwestern Polytechnical Univ., Xian 710072, China.

Presented at MaxEnt 2023: International Workshop on Bayesian Inference and Maximum Entropy
Methods in Science and Engineering, Max Planck Institut, Garching, Germany, July 3-7, 2023.
A modified and combined version of this paper will appear in MaxEnt2023 Proceedings.

Abstract
Machine Learning (ML) methods and tools have gained great success in many data, signal, im-
age and video processing tasks, such as classification, clustering, object detection, semantic segmen-
tation, language processing, Human-Machine interface, etc. In computer vision, image and video
processing, these methods are mainly based on Neural Networks (NN) and in particular Convolu-
tional NN (CNN), and more generally Deep NN.
Inverse problems arise anywhere we have indirect measurement. As, in general those inverse prob-
lems are ill-posed, to obtain satisfactory solutions for them needs prior information. Different regu-
larization methods have been proposed where the problem becomes the optimization of a criterion
with a likelihood term and a regularization term. The main difficulty however, in great dimensional
real applications, remains the computational cost. Using NN, and in particular Deep Learning (DL)
surrogate models and approximate computation can become very helpful.
In this work, we focus on NN and DL particularly adapted for inverse problems. We consider two
cases: First the case where the forward operator is known and used as physics constraint, the second
more general data driven DL methods.

key words: Neural Network, Deep Learning (DL), Inverse problems, Physics based DL

1 Introduction
In science and any engineering problem, we need to observe (measure) quantities. Some quantities
are directly observable (e.g.; Length), and some others are not (e.g.; Temperature). For example, to
measure the temperature, we need an instrument (thermometer) that measures the length of the liquid

1
in the thermometer tube, which can be related to the temperature. We may also wants to observe
its variation in time or its spatial distribution. One way to measure the spatial distribution of the
temperature is using an Infra-Red (IR) camera. But, in general, all these instruments, give indirect
measurements, related to what we really want to measure through some mathematical relation, called
Forward model. Then, we have to infer on the desired unknown from the observed data, using this
forward model or a surrogate one [1].
As, in general, many inverse problems are ill-posed, many classical methods for finding well-posed
solutions for them are mainly based on regularization theory. We may mention those, in particular,
which are based on the optimization of a criterion with two parts: a data-model output matching cri-
terion and a regularization term. Different criteria for these two terms and a great number of standard
and advanced optimization algorithms have been proposed and used with great success. When these
two terms are distances, they can have a Bayesian Maximum A Posteriori (MAP) interpretation where
these two terms correspond, respectively, to the likelihood and prior probability models [1].
The Bayesian approach gives more flexibility in choosing these terms via the likelihood and the
prior probability distributions. This flexibility goes much farther with the hierarchical models and
appropriate hidden variables [2]. Also, the possibility of estimating the hyper-parameters gives much
more flexibility for semi-supervised methods.
However, the full Bayesian computations can become very heavy computationally. In particular
when the forward model is complex and the evaluation of the likelihood needs high computational
cost. In those cases using surrogate simpler models can become very helpful to reduce the computa-
tional costs, but then, we have to account for uncertainty quantification (UQ) of the obtained results [3].
Neural Networks (NN) with their diversity such as Convolutional (CNN), Deep learning (DL), etc.,
have become tools as fast and low computational surrogate forward models for them.
In the last three decades, the Machine Learning (ML) methods and algorithms have gained great
success in many computer vision (CV) tasks, such as classification, clustering, object detection, seman-
tic segmentation, etc. These methods are mainly based on Neural Networks (NN) and in particular
Convolutional NN (CNN), Deep NN, etc. [4–6, 6, 7, 7, 8, 8–10].
Using these methods directly for inverse problems, as intermediate pre-processing or as tools for
doing fast approximate computation in different steps of regularization or Bayesian inference have
also got success, but not yet as much as they could. Recently, the Physics-Informed Neural Networks
have gained great success in many inverse problems, proposing interaction between the Bayesian for-
mulation of forward models, optimization algorithms and ML specific algorithms for intermediate
hidden variables. These methods have become very helpful to obtain approximate practical solutions
to inverse problems in real world applications [6, 8, 11–17].
In this paper, first, in Section 2, a few general idea of ML, NN and DL are summarised, then in
Section 3, 4 and 5, we focus on the NN and DL methods for inverse problems. First, we present same
cases where we know the forward and its adjoint model. Then, we consider the case we may not have
this knowledge and want to propose directly data driven DL methods [18, 19].

2
2 Machine Learning and Neural Networks basic idea
The main idea in Machine Learning is first to learn a model fW ( x) from a great number of input-output
training data, for example, in a supervised classification problem, data classes: ( xi , ci ), i = 1, · · · N:
Learning Data Learning step
→ → θ,
( xi , ci )iN=1 The weights W of the NN model ϕW ( x) are obtained
and then, when a new case (Test x j ) appears, it uses the learned weights W to give a decision c j .

Test case Data Testing step

→ → cj.
xj The learned weights W are used in ϕW ( x)
Figure 1 shows the main process of ML.
Training step

Training Machine
data - learning
( xi , ci )iN=1 algorithm

?
Novel data - Model - Output
xj ϕW ( x) cj

Figure 1: Basic Machine Learning process: First Learn a model, then use it. Learning step needs a rich
enough data base which costs a lot. When the model is learned and tested, its use is easy, fast and its
cost is low.

3 ML for inverse problems

To show the possibilities of the interaction between inverse problems methods, Machine learning and
NN methods, the best way is to give a few examples.

3.1 First example: A known linear forward model

The first and easiest example is the case of linear inverse problems g = H f + ϵ where we know the
forward model H and quadratic regularization where the solution is defined as:
bf = arg min ∥ g − H f ∥2 + λ∥ f ∥2 ,

(1)
f
which has an analytic expression and we have the following relations:

bf = ( H t H + λI )−1 H t g = A g = BH t g or still bf = H t ( 1 H H t + I )−1 g = H t C g, (2)

3
where A = ( H t H + λI )−1 H t , B = ( H t H + λI )−1 and C = ( λ1 H H t + I )−1 .
These relations can be presented schematically as:

g→ A → bf , g→ Ht → B → bf , g→ C → Ht → bf .

As we can see, these relations directly induce linear feed forward NN structure. In particular, if H
represents a convolution operator, then H t , H t H and H H t are too, as well as the operators B and C.
Thus the whole inversion can be modelled by CNN [5, 10].
For the case of Computed Tomography (CT), the first operation is equivalent to an analytic inver-
sion, the second corresponds to Back-Projection first followed by 2D filtering in the image domain, and
the third correspond to to the famous Filtered Back-Projection (FBP) which is implemented on classical
CT scans. These three cases are illustrated on Figure 2.
Analytical
Inversion

g→ A → bf
Direct NN

Totally data driven NN Inversion

2D
Filtering
Analytical

Ht
Inversion

g→ → B → bf or
Back
Projection

2D Filering
By NN

Backprojection + NN 2D filtering
Analytical
Inversion

Analytical
Inversion

g→ C → Ht → bf or
Back
Projection

Filtering
by NN

NN 1D filtering + Back projection (FBP)

Figure 2: Three linear NN structures which are derived directly from quadratic regularization inver-
sion method. Right part of this figure is adapted from [10].

3.2 Second example: Image denoising with a two layers CNN

The second example is the denoising g = f + ϵ with ℓ1 regularizer:

bf = Db z = arg min { J (z)} with J (z) = ∥ g − Dz| + λ∥z∥1 ,

z and b (3)
z
where D is a filter, i.e., a convolution operator. This can also be considered as the MAP estimator
with a double exponential prior. It is easy to show that the solution can be obtained by a convolution
followed by a thresholding [20, 21].

bf = Db z = S 1 ( D t g ),
z and b (4)
λ

where Sλ is a thresholding operator.

g→ Dt → Thresholding → b
z→ D → bf or equivalently g → Two layers CNN → bf .

4
3.3 Third example: A Deep learning equivalence of iterative gradient based algorithms
One of the classical iterative methods in linear inverse problems algorithm is based on the gradient
descent method to optimize J ( f ) = ∥ g − H f ∥2 :

f (k+1) = f (k) + αH t ( g − H f (k) ) = αH t g + ( I − αH t H ) f (k) , (5)

where the solution of the problem is obtained recursively. Everybody knows that, when the forward
model operator H is singular or ill-conditioned, this iterative algorithm starts by converging, but it
may diverge easily. One of the experimental methods to obtain an acceptable approximate solution
is just to stop the iterations after K iterations. This idea can be translated to a Deep Learning NN by
using K layers. Each layer represents one iteration of the algorithm. See Figure 3.

g

?
αH t
?

?
?
?
f (1) -( I − αH t H )- + --( I − αH t H )- + - ... -( I − αH t H )- + - f (K )

Figure 3: A K layers DL NN equivalent to K iterations of the basic optimization algorithm.

This DL structure can easily be extended to a regularized criterion: J ( f ) = 12 ∥ g − H f ∥2 + λ∥ D f ∥2 ,

where
f (k+1) = f (k) + α[ H t ( g − H f (k) ) − λD t D ] = αH t g + ( I − αH t H − αλD t D ) f (k) . (6)
We just need to replace ( I − αH t H ) by ( I − αH t H − αλD t D ).
This structure can also be extended to all the sparsity enforcing regularization terms such as ℓ1 and
Total Variation (TV) using appropriate algorithms such as ISTA (Iterative Soft Thresholding Algorithm)
or its fast version FISTA. by replacing the update expression and by adding a NL operation much like
the ordinary NNs. A simple example is given in the following subsection.

3.4 Fourth example: ℓ1 regularization and NN

Let us consider the linear inverse problem g = H f + ϵ with ℓ1 regularization criterion:

J ( f ) = ∥ g − H f ∥22 + λ∥ f ∥1 , (7)

and an iterative optimization algorithm, such as ISTA:

△
f (k+1) = Proxℓ1 f (k) , λ = Sλα αH t g + ( I − αH t H ) f (k) , (8)

where Sθ is a soft thresholding operator and α ≤ |eig( H t H )| is the Lipschitz constant of the normal
operator. When H is a convolution operator, then:

5
• ( I − αH t H ) f (k) can also be approximated by a convolution and thus considered as a filtering
operator;
1 t
• αH g can be considered as a bias term and is also a convolution operator; and

• Sθ =λα is a nonlinear point wise operator. In particular when f is a positive quantity, this soft
thresholding operator can be compared to ReLU activation function of NN. See Figure 4.

g

?
αH t

?
f (k) - ( I − αH t H ) - + - - f ( k +1)

Figure 4: One block of a NN correspond to one iteration of ℓ1 regularization.

3.4.1 DL structure based on iterative inversion algorithm

Using the iterative gradient based algorithm with fixed number of iterations for computing a GI or a
regularized one as explained in previous section can be used to propose a DL structure with K lay-
ers, K being the number of iterations before stopping. Figure 5 shows this structure for a quadratic
regularization which results to a linear NN and Figure 6 for the case of ℓ1 regularization.

g g

? ?
αH t W0

?
?
f (k) - ( I − αH t H ) - + - - f ( k +1) f (k) - W (k) - + - - f ( k +1)

Figure 5: A K layers DL NN equivalent to K iterations of a basic gradient based optimization algorithm.

A quadratic regularization results to a linear NN while a ℓ1 regularization results to a classical NN with
a nonlinear activation function. Left: supervised case. Right: unsupervised case. In both cases, all the
K layers have the same structure.

In all these examples, we directly could obtain the structure of the NN from the Forward model
and known parameters. However, in these approaches there are some difficulties which consist in the
determination of the structure of the NN. For example, in the first example, obtaining the structure of
B depends on the regularization parameter λ. The same difficulty arises for determining the shape and
the threshold level of the Thresholding bloc of the network in the second example. The same need of
the regularization parameter as well as many other hyper parameters are necessary to create the NN
structure and weights. In practice, we can decide, for example, on the number and structure of a DL

6

g g g

? ? ?
W0 W0 W0

bf (1-
) (K )
? ? ?
- ... - - bf
W (1) - + -

--
W (2) - + -

W (K ) - + -

Figure 6: All the K layers of DL NN equivalent to K iterations of an iterative gradient based optimiza-
tion algorithm. The simplest solution is to choose W 0 = αH and W (k) = W = ( I − αH t H ), k =
1, · · · , K. A more robust, but more costly is to learn all the layers W (k) = ( I − α(k) H t H ), k = 1, · · · , K.

network, but as their corresponding weights depend on many unknown or difficult to fix parameters,
ML may become of help. In the following we first consider the training part of a general ML method.
Then, we will see how to include the physics based knowledge of the forward model in the structure
of learning.

4 More Physics based ML using linear transformations

As mentioned above, in general, in practice, a rich enough and complete data set is not often available
in particular for inverse problems. We have, as far as possible, to use the physics of the forward
operator H. Sometimes, the forward operator can be described in a transform domain, such Fourier or
Wavelets. Here, we explore these situations.

4.1 Decomposition of the NN structure to fixed and trainable parts

The first easiest and understandable method consists in decomposing the structure of the network W
in two parts: a fixed part and a learnable part. As the simplest example, we can consider the case
of analytical expression of the quadratic regularization: bf = ( H H t + λDD t )−1 H t g = BH t g which
suggests to have a two layers network with a fixed part structure H t and a trainable one B = ( H H t +
λDD t )−1 . See Figure 7.
It is interesting to note that in X-ray Computed Tomography (CT) the forward operator H is called
Projection, the adjoint operator H t is called Back-Projection (BP) and the B operator is assimilated to a
2D filtering (convolution).

4.2 Using Singular value decomposition of forward and backward operators

Using the eigenvalues and eigenvectors of the pseudo or generalized inverse operators

H † = [ H t H ] −1 H t or H † = H t [ H H t ] −1 , (9)

7
Learning Fixed Physics based ( Trainable part )
K
Data → part → ef k → b →B
B = arg min ∑ ∥ f k − ϕ( B ef k )∥2 + λR( B))
b
K
( g k , f k ) k =1 t
f k = H gk
e B k =1

New Data Physics based part Trained part

→ ef = H t g → ef j → → bf j
gj j j
bf = ϕ( B
j
b ef )
j

Figure 7: Training (top) and Testing (bottom) steps in the first use of physics based ML approach

and Singular value decomposition (SVD) of the operators [ H t H ] and [ H H t ] give another possible de-
composition of the NN structure. Let us to note

H H t = U∆V ′ or equivalently H t H = V ∆U ′ , (10)

where ∆ is a diagonal matrix containing the singular values, U and V containing the corresponding
eigenvectors. This can be used to decompose the W to four operators:

W = V ′ ∆U H t or W = H t V ∆U ′ , (11)

where three of them can be fixed and only one ∆ can be trainable. It is interesting to know that when
the forward operator H has a shift-invariant (convolution) property, then the operators U and V ′ will
correspond, respectively, to the FT and IFT operators and the diagonal elements of Λ correspond to
the FT of the impulse response of the convolution forward operator. So, we will have a fixed layer
corresponding to H t which can be interpreted as a matched filtering, then a fixed FT layer which is
a feed-forward linear network, a trainable filtering part corresponding to the diagonal elements of Λ
and a forth fixed layer corresponding to IFT. See Figure 8.

Fixed Physics Fixed Physics Trainable part Fixed Physics

Data
→ based → ef → based → → based → bf
g ′
ef = H t g U or FT Λ V or IFT

Figure 8: A four-layers NN with three physics based fixed corresponding to H t , U ′ or FT and V or IFT
layers and one trainable layer corresponding to Λ.

5 Learning step general approach

The ML approach can become helpful if we could have a great amount of data: inputs-outputs
{( f , g )k , k = 1, 2, ..., K } examples. Thus, during the Training step, we can learn the coefficients of the
NN and then use it for obtaining a new solution bf for a new data g.

8
The main issue is the limited number of data input-output examples {( f , g )k , k = 1, 2, ..., K } we can
have for the training step of the network.

5.1 Fully learned method

Let consider a one layer NN where the relation between its input g k and output f k is given by f k =
ϕ(W g k ) where W is the weighting parameters of the NN and ϕ is the point wise non linearity function
of the output NN output layer. The estimation of W from the training data in the learning step is done
by an optimization algorithm which optimizes a Loss function L defined as
K
L(W ) = ∑ ℓk ( f k , ϕ(W g k )) + λ∥W ∥2 with ℓk ( f k , ϕ(W g k ) = ∥ f k − ϕ(W g k )∥2 , (12)
k =1

a quadratic distanceor any other appropriate distance or divergence or a probabilistic one

ℓk ( f k , ϕ(W g k ) = E ∥ f k − ϕ(W g k )∥2 , and where ∥W ∥2 is a regularizing term and λ its parameter.
When the NN is trained and we obtain the weights W, c then we can use it easily when a new case
(Test g j ) appears, just by applying: f k = ϕ(W c g k ). These two steps of Training and Using (called also
Testing) are illustrated in Figure 9.

Training Data Learning or Training New case Using or Testing

→ →W → → bf j
( g k , f k )kK=1
c
W = arg minW {L(W )}
c gj bf = ϕ(W
j
cg )
j

Figure 9: Training (top) and Testing (bottom) steps in a ML approach

The scheme that we presented is general and can be extended to any multi-layer NN and DL. In
fact, if we had a great number of data-ground truth examples {( f , g )k , k = 1, 2, ..., K } with K much
more than the number of elements Wm,n of the weighting parameters W, then, we did not even have
any need for forward model H. This can be possible for very low dimensional problems [10]. But, in
general, in practice we do not have enough data. So, some prior or regularizer is needed to obtain a
usable solution.

6 Application: Infrared imaging

In many industrial application, Infrared (IR) imaging is used to diagnosis and to survey the tempera-
ture field distribution of the objects. Two great difficulties with these images are: low resolution and
important noise. To increase the resolution, we may use deconvolution methods if we can get the
point spread function (PSF) of the camera. A solution to reduce the noise can also be obtained via
a total variation prior modeling. Indeed, both objectives can be reached via a regularization or the
Bayesian approach. Also, as the final objective is to segment image to obtain different levels of temper-
ature (background, normal, high, and very high), we propose to design a BDL NN which gets as input
a low resolution and noisy image and outputs a segmented image with 3 or 4 levels.

9
Input g denoising deconv Segmentation Segmented
IR image→ C1 − Th − C2 → C3 − Thr − C4 → SegNet → image

Figure 10: The proposed 4 layers NN for denoising, deconvolution and segmentation of IR images

Figure 11: Example of expected results: First row: a simulated IR image (left), its ground truth labels
(middle) , the result of the deconvolution and segmentation (right). Second row: a real IR image (left)
and the result of its deconvolution and segmentation (right).

To train this NN, we can generate different known shaped synthetic images to consider as the
ground truth and simulate the blurring effects of temperature diffusion, via the convolution of different
appropriate point spread functions and add some noise to generate realistic images. We can also use
a black body thermal sources, for which we know the shape and the exact temperature, and acquire
different images at different conditions. All these images can be used for the training of the network.
We propose then to use a four groups of layers DL structure as it is shown in Figure 10, to train it
with one hundred images artificially generated and one hundred images obtained with a black body
experiment. Then, trained model can be used for the desired task on a test set images. Figure 11, we
show one such expected results. More details will be given in a near future paper.

10
7 Conclusions and Challenges
Classical methods for inverse problems are mainly based on regularization methods or on Bayesian
inference with a connection between them via the Maximum A Posteriori (MAP) point estimation.
The Bayesian approach gives more flexibility, in particular for determination of the regularization pa-
rameter. However, whatever deterministic or Bayesian computations still is a great problem for high
dimensional problems.
Recently, the Machine Learning (ML) methods have become a good help for some aspects of these
difficulties. Nowadays, ML, Neural Networks (NN), Convolutional NN (CNN) and Deep Learning
(DL) methods have obtained great success in classification, clustering, object detection, speech and
face recognition, etc., But, they need a great number of training data and and they may fail very easily,
in particular for inverse problems.
In fact, using only data based NN without any specific structure coming from the forward model
(Physics) may work for small size problems. However, the progress arrives via their interaction with
the model based methods. In fact, the success of CNN and DL methods greatly depends on the
appropriate choice of the network structure. This choice can be guided by the model based meth-
ods [3, 4, 10, 20–25].
In this work, we presented a few examples of such interactions. We explored a few cases: first when
the forward operator is known. Then, when we use the forward model partially or in the transform
domain. As we could see, the main contribution of ML and NN tools can be in reducing the costs of
the inversion method when an appropriate model is trained. However, to obtain a good model, there
is a need for sufficiently rich data and a good network structure obtained from the physics knowledge
of the problem in hand.
For inverse problems, when the forward models are non linear and complex, NN and DL may be of
great help. However, we may still need to choose the structure of the NN via an approximate forward
model and approximate Bayesian inversion [11, 12, 14].

References
[1] A. Mohammad-Djafari, “Inverse problems in imaging science: from classical regularization meth-
ods to state of the art bayesian methods,” in International Image Processing, Applications and Systems
Conference, pp. 1–2, Nov 2014.

[2] H. Ayasso and A. Mohammad-Djafari, “Joint ndt image restoration and segmentation using
gauss-markov-potts prior models and variational bayesian computation,” IEEE Transactions on
Image Processing, vol. 19, pp. 2265–2277, Sept 2010.

[3] Y. Zhu and N. Zabaras, “Bayesian deep convolutional encoder–decoder networks for surrogate
modeling and uncertainty quantification,” Journal of Computational Physics, 2018.

[4] M. Unser, K. H. Jin, and M. T. McCann, “A review of convolutional neural networks for inverse
problems in imaging,” ArXiv, 2017.

11
[5] I. Y. Chun, Z. Huang, H. Lim, and J. Fessler, “Momentum-net: Fast and convergent iterative neu-
ral network for inverse problems,” IEEE Transactions on Pattern Analysis and Machine Intelligence,
pp. 1–1, 2020.

[6] Z. Fang, “A high-efficient hybrid physics-informed neural networks based on convolutional neu-
ral network,” IEEE Transactions on Neural Networks and Learning Systems, pp. 1–13, 2021.

[7] G. Ongie, A. Jalal, C. A. Metzler, R. G. Baraniuk, A. G. Dimakis, and R. Willett, “Deep learning
techniques for inverse problems in imaging,” IEEE Journal on Selected Areas in Information Theory,
vol. 1, no. 1, pp. 39–56, 2020.

[8] D. Gong, Z. Zhang, Q. Shi, A. van den Hengel, C. Shen, and Y. Zhang, “Learning deep gradient
descent optimization for image deconvolution,” IEEE Transactions on Neural Networks and Learning
Systems, vol. 31, no. 12, pp. 5468–5482, 2020.

[9] S. Ren, K. Sun, C. Tan, and F. Dong, “A two-stage deep learning method for robust shape recon-
struction with electrical impedance tomography,” IEEE Transactions on Instrumentation and Mea-
surement, vol. 69, no. 7, pp. 4887–4897, 2020.

[10] A. Lucas, M. Iliadis, R. Molina, and A. K. Katsaggelos, “Using deep neural networks for inverse
problems in imaging: Beyond analytical methods,” IEEE Signal Processing Magazine, 2018.

[11] M. Raissi, P. Perdikaris, and G. E. Karniadakis, “Physics informed deep learning (part i): Data-
driven solutions of nonlinear partial differential equations,” arXiv preprint arXiv:1711.10561, 2017.

[12] M. Raissi, P. Perdikaris, and G. E. Karniadakis, “Physics informed deep learning (part ii): Data-
driven discovery of nonlinear partial differential equations,” arXiv preprint arXiv:1711.10566, 2017.

[13] Y. Chen, L. Lu, G. E. Karniadakis, and L. D. Negro, “Physics-informed neural networks for inverse
problems in nano-optics and metamaterials,” arXiv: Computational Physics, 2019.

[14] M. Raissi, P. Perdikaris, and G. E. Karniadakis, “Physics-informed neural networks: A deep learn-
ing framework for solving forward and inverse problems involving nonlinear partial differential
equations,” Journal of Computational Physics, 2019.

[15] D. Gilton, G. Ongie, and R. Willett, “Neumann networks for linear inverse problems in imaging,”
IEEE Transactions on Computational Imaging, vol. 6, pp. 328–343, 2020.

[16] K. de Haan, Y. Rivenson, Y. Wu, and A. Ozcan, “Deep-learning-based image reconstruction and
enhancement in optical microscopy,” Proceedings of the IEEE, vol. 108, no. 1, pp. 30–50, 2020.

[17] H. K. Aggarwal, M. P. Mani, and M. Jacob, “Modl: Model-based deep learning architecture for
inverse problems,” IEEE Transactions on Medical Imaging, vol. 38, no. 2, pp. 394–405, 2019.

[18] A. Mohammad-Djafari, “Hierarchical markov modeling for fusion of x ray radiographic data
and anatomical data in computed tomography,” in Proceedings IEEE International Symposium on
Biomedical Imaging, pp. 401–404, July 2002.

12
[19] A. Mohammad-djafari, “Regularization, bayesian inference and machine learning methods for
inverse problems,” Entropy, vol. 23, no. 12, p. 1673, 2021.

[20] T. Meinhardt, M. Moeller, C. Hazirbas, and D. Cremers, “Learning proximal operators: Using de-
noising networks for regularizing inverse imaging problems,” arXiv: Computer Vision and Pattern
Recognition, 2017.

[21] S. Vettam and M. John, “Regularized deep learning with a non-convex penalty.,” arXiv: Machine
Learning, 2019.

[22] R. Guidotti, A. Monreale, F. Turini, D. Pedreschi, and F. Giannotti, “A survey of methods for
explaining black box models,” arXiv: Computers and Society, 2018.

[23] K. H. Jin, M. T. McCann, and M. Unser, “A review of convolutional neural networks for inverse
problems in imaging,” ArXiv, 2017.

[24] J. H. R. Chang, C.-L. Li, B. Poczos, B. V. K. V. Kumar, and A. C. Sankaranarayanan, “One net-
work to solve them all — solving linear inverse problems using deep projection models,” arXiv:
Computer Vision and Pattern Recognition, 2017.

[25] S. Mo, N. Zabaras, X. Shi, and J. Wu, “Deep autoregressive neural networks for high-dimensional
inverse problems in groundwater contaminant source identification,” arXiv: Machine Learning,
2018.

Physics Informed Neural Network Theory and Applications
No ratings yet
Physics Informed Neural Network Theory and Applications
44 pages
Solving Inverse Problems Using Data-Driven Models
No ratings yet
Solving Inverse Problems Using Data-Driven Models
174 pages
Deep Convolutional Neural Network For Inverse Problems in Imaging
No ratings yet
Deep Convolutional Neural Network For Inverse Problems in Imaging
20 pages
Solving Inverse Problems Using Datadriven Models
No ratings yet
Solving Inverse Problems Using Datadriven Models
174 pages
Analyzing Inverse Problems With Invertible Neural Networks
No ratings yet
Analyzing Inverse Problems With Invertible Neural Networks
20 pages
Accepted Manuscript: Journal of Computational Physics
No ratings yet
Accepted Manuscript: Journal of Computational Physics
47 pages
Preliminary OWR 2018 11
No ratings yet
Preliminary OWR 2018 11
29 pages
Roy NN 2023
No ratings yet
Roy NN 2023
18 pages
Physics-Informed Neural Networks
100% (1)
Physics-Informed Neural Networks
22 pages
Mathematics of Deep Learning Lecture Notes
No ratings yet
Mathematics of Deep Learning Lecture Notes
58 pages
On Deep Learning For Inverse Problems: Jaweria Amjad Jure Sokoli C Miguel R.D. Rodrigues
No ratings yet
On Deep Learning For Inverse Problems: Jaweria Amjad Jure Sokoli C Miguel R.D. Rodrigues
5 pages
Physics-Informed Neural Networks M. Raissi & P. Perdikaris & G.E. Karniadakis Online Version
No ratings yet
Physics-Informed Neural Networks M. Raissi & P. Perdikaris & G.E. Karniadakis Online Version
98 pages
Kernels, Data and Physics - Les Houches
No ratings yet
Kernels, Data and Physics - Les Houches
105 pages
Journal of Computational Physics: M. Raissi, P. Perdikaris, G.E. Karniadakis
No ratings yet
Journal of Computational Physics: M. Raissi, P. Perdikaris, G.E. Karniadakis
22 pages
Data-Driven Approaches To Inverse Problems
No ratings yet
Data-Driven Approaches To Inverse Problems
71 pages
A High-Bias, Low-Variance Introduction To Machine Learning For Physicists PDF
No ratings yet
A High-Bias, Low-Variance Introduction To Machine Learning For Physicists PDF
117 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
123 pages
Model-Driven Deep Learning Insights
No ratings yet
Model-Driven Deep Learning Insights
3 pages
21D070048 SRE Report
No ratings yet
21D070048 SRE Report
4 pages
AI - Physics Informed Neural Network by ARNAB HALDER
100% (1)
AI - Physics Informed Neural Network by ARNAB HALDER
15 pages
Neural Network Inversion for Engineers
No ratings yet
Neural Network Inversion for Engineers
12 pages
ENNS: Variable Selection, Regression, Classification and Deep Neural Network For High-Dimensional Data
No ratings yet
ENNS: Variable Selection, Regression, Classification and Deep Neural Network For High-Dimensional Data
45 pages
1803 08823 PDF
No ratings yet
1803 08823 PDF
122 pages
Li 2020 Inverse Problems 36 065005
No ratings yet
Li 2020 Inverse Problems 36 065005
24 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
163 pages
Modules For Machine and Deep Learning:: Fundamentals and Single-Hidden Layer Network (With Matlab)
No ratings yet
Modules For Machine and Deep Learning:: Fundamentals and Single-Hidden Layer Network (With Matlab)
35 pages
Mehta Et Al. - 2019 - A High-Bias, Low-Variance Introduction To Machine PDF
No ratings yet
Mehta Et Al. - 2019 - A High-Bias, Low-Variance Introduction To Machine PDF
116 pages
Physics-Informed Deep Learning for PDEs
100% (1)
Physics-Informed Deep Learning for PDEs
22 pages
Anomaly Detection As A Tool For Discovering New Physics at CERN's Large Hadron Collider
No ratings yet
Anomaly Detection As A Tool For Discovering New Physics at CERN's Large Hadron Collider
8 pages
Unit 3
No ratings yet
Unit 3
110 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
155 pages
The Little Book of Deep Learning - (François Fleuret) - University of Geneva-2023.compressed
No ratings yet
The Little Book of Deep Learning - (François Fleuret) - University of Geneva-2023.compressed
163 pages
Artificial Neural Network Methods For The Solution of Second Order Boundary Value Problems
No ratings yet
Artificial Neural Network Methods For The Solution of Second Order Boundary Value Problems
15 pages
Image Recognition Based On Deep Learning
No ratings yet
Image Recognition Based On Deep Learning
5 pages
Deep Learning Artificial Intelligence
No ratings yet
Deep Learning Artificial Intelligence
9 pages
A Review of Physics-Informed Machine Learning in F
No ratings yet
A Review of Physics-Informed Machine Learning in F
21 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
168 pages
AA12 Deep Learning 2024
No ratings yet
AA12 Deep Learning 2024
30 pages
Data-Driven Models in Inverse Problems: Tatiana A. Bubba (Ed.)
No ratings yet
Data-Driven Models in Inverse Problems: Tatiana A. Bubba (Ed.)
508 pages
Deep Learning for p-Laplacian Equations
No ratings yet
Deep Learning for p-Laplacian Equations
15 pages
Lbdlu
No ratings yet
Lbdlu
168 pages
Deep Learning For DIC
No ratings yet
Deep Learning For DIC
35 pages
Deep Learning Meets Sparse Regularization: A Signal Processing Perspective
No ratings yet
Deep Learning Meets Sparse Regularization: A Signal Processing Perspective
23 pages
22 Selected Top Papers On Deep Learning
No ratings yet
22 Selected Top Papers On Deep Learning
393 pages
Deepinverse: A Python Package For Solving Imaging Inverse Problems With Deep Learning
No ratings yet
Deepinverse: A Python Package For Solving Imaging Inverse Problems With Deep Learning
15 pages
Artificial Neural Networks: Introduction To Computational Neuroscience
No ratings yet
Artificial Neural Networks: Introduction To Computational Neuroscience
42 pages
Deep Learning Model
No ratings yet
Deep Learning Model
144 pages
Introduction To Neural Networks: RWTH Aachen University Chair of Computer Science 6 Prof. Dr.-Ing. Hermann Ney
No ratings yet
Introduction To Neural Networks: RWTH Aachen University Chair of Computer Science 6 Prof. Dr.-Ing. Hermann Ney
31 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
167 pages
Econometrica - 2021 - Farrell - Deep Neural Networks For Estimation and Inference
No ratings yet
Econometrica - 2021 - Farrell - Deep Neural Networks For Estimation and Inference
33 pages
Survey of FNN
No ratings yet
Survey of FNN
25 pages
Yinpeng Wang, Qiang Ren - Deep Learning-Based Forward Modeling and Inversion Techniques For Computational Physics Problems (2024, CRC Press) - Libgen - Li
No ratings yet
Yinpeng Wang, Qiang Ren - Deep Learning-Based Forward Modeling and Inversion Techniques For Computational Physics Problems (2024, CRC Press) - Libgen - Li
199 pages
Understanding Deep Convolutional Networks
No ratings yet
Understanding Deep Convolutional Networks
17 pages
DL Unit 3
No ratings yet
DL Unit 3
14 pages
Deep Learning & Neural Networks Guide
No ratings yet
Deep Learning & Neural Networks Guide
64 pages
Deep Learning
No ratings yet
Deep Learning
50 pages
A Proposal On Machine Learning Via Dynamical Systems
No ratings yet
A Proposal On Machine Learning Via Dynamical Systems
11 pages
Module1 Introduction
No ratings yet
Module1 Introduction
35 pages
SEMU Net
No ratings yet
SEMU Net
10 pages
AI MCQs and Applications Overview
No ratings yet
AI MCQs and Applications Overview
15 pages
Unsupervised Image Segmentation Model
No ratings yet
Unsupervised Image Segmentation Model
13 pages
Mortgage Automation with AI
No ratings yet
Mortgage Automation with AI
11 pages
Artificial Intelligence and Financial Marketing: Transforming Customer Segmentation and Risk Assessment
No ratings yet
Artificial Intelligence and Financial Marketing: Transforming Customer Segmentation and Risk Assessment
10 pages
Cs100 Lesson 1
No ratings yet
Cs100 Lesson 1
12 pages
2023 School Presentation Aut em
No ratings yet
2023 School Presentation Aut em
40 pages
Best Data Science Institute
No ratings yet
Best Data Science Institute
20 pages
Machine Learning Card Deck
No ratings yet
Machine Learning Card Deck
6 pages
DataMining Reflection Paper
No ratings yet
DataMining Reflection Paper
2 pages
Kukbit Internship Projects
No ratings yet
Kukbit Internship Projects
14 pages
C For Machine Learning An Overview
No ratings yet
C For Machine Learning An Overview
8 pages
Java Lab Manual
No ratings yet
Java Lab Manual
26 pages
COS40007 Design Project
No ratings yet
COS40007 Design Project
11 pages
AI5003 AML Week07
No ratings yet
AI5003 AML Week07
14 pages
DMMLM - Risk Score Prediction Model
No ratings yet
DMMLM - Risk Score Prediction Model
28 pages
Deep Learning for ADR Prediction
No ratings yet
Deep Learning for ADR Prediction
36 pages
AI Assistance For UX Literature Review
No ratings yet
AI Assistance For UX Literature Review
23 pages
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
10 pages
Unit I
No ratings yet
Unit I
38 pages
A Study On Software Effort Prediction Using Machine Learning Techniques
No ratings yet
A Study On Software Effort Prediction Using Machine Learning Techniques
15 pages
Aktu Btech Cse 5th Sem Syllabus
No ratings yet
Aktu Btech Cse 5th Sem Syllabus
5 pages
Chapter 1 Introduction To Machine Learning
100% (1)
Chapter 1 Introduction To Machine Learning
19 pages
RNN LSTM BiRNN Notes
No ratings yet
RNN LSTM BiRNN Notes
3 pages
Deep Learning for Age & Gender Detection
No ratings yet
Deep Learning for Age & Gender Detection
6 pages
The Impact of Artificial Intelligence On Branding: A Bibliometric Analysis (1982-2019)
No ratings yet
The Impact of Artificial Intelligence On Branding: A Bibliometric Analysis (1982-2019)
27 pages
2020 Student Handbook: June 21st - August 1st
No ratings yet
2020 Student Handbook: June 21st - August 1st
14 pages
Network Security Threats and Mitigation Strategies in Mobile Networks A Machine Learning Perspective
No ratings yet
Network Security Threats and Mitigation Strategies in Mobile Networks A Machine Learning Perspective
5 pages
Prospectus 2022
No ratings yet
Prospectus 2022
1 page

Deep Learning and Inverse Problems: Ali Mohammad-Djafari Orcid Number:0000-0003-0678-7759, Ning Chu, Li Wang, Liang Yu

Uploaded by

Deep Learning and Inverse Problems: Ali Mohammad-Djafari Orcid Number:0000-0003-0678-7759, Ning Chu, Li Wang, Liang Yu

Uploaded by

Deep Learning and Inverse Problems

Ali Mohammad-Djafari 1,2 orcid number:0000-0003-0678-7759,

4 Central South University, Changsha, China; [email protected]

Test case Data Testing step

3 ML for inverse problems

3.1 First example: A known linear forward model

bf = ( H t H + λI )−1 H t g = A g = BH t g or still bf = H t ( 1 H H t + I )−1 g = H t C g, (2)

Totally data driven NN Inversion

NN 1D filtering + Back projection (FBP)

3.2 Second example: Image denoising with a two layers CNN

bf = Db z = arg min { J (z)} with J (z) = ∥ g − Dz| + λ∥z∥1 ,

where Sλ is a thresholding operator.

f (k+1) = f (k) + αH t ( g − H f (k) ) = αH t g + ( I − αH t H ) f (k) , (5)

Figure 3: A K layers DL NN equivalent to K iterations of the basic optimization algorithm.

This DL structure can easily be extended to a regularized criterion: J ( f ) = 12 ∥ g − H f ∥2 + λ∥ D f ∥2 ,

3.4 Fourth example: ℓ1 regularization and NN

and an iterative optimization algorithm, such as ISTA:

Figure 4: One block of a NN correspond to one iteration of ℓ1 regularization.

3.4.1 DL structure based on iterative inversion algorithm

Figure 5: A K layers DL NN equivalent to K iterations of a basic gradient based optimization algorithm.

4 More Physics based ML using linear transformations

4.1 Decomposition of the NN structure to fixed and trainable parts

4.2 Using Singular value decomposition of forward and backward operators

New Data Physics based part Trained part

H H t = U∆V ′ or equivalently H t H = V ∆U ′ , (10)

Fixed Physics Fixed Physics Trainable part Fixed Physics

5 Learning step general approach

5.1 Fully learned method

a quadratic distanceor any other appropriate distance or divergence or a probabilistic one

Training Data Learning or Training New case Using or Testing

Figure 9: Training (top) and Testing (bottom) steps in a ML approach

6 Application: Infrared imaging

You might also like

a quadratic distanceor any other appropriate distance or divergence or a probabilistic one