Deep_Image_Prior

Deep image prior

Add article description

Deep image prior is a type of convolutional neural network used to enhance a given image with no prior training data other than the image itself. A neural network is randomly initialized and used as prior to solve inverse problems such as noise reduction, super-resolution, and inpainting. Image statistics are captured by the structure of a convolutional image generator rather than by any previously learned capabilities.

Method

Background

Inverse problems such as noise reduction, super-resolution, and inpainting can be formulated as the optimization task $x^{*}=min_{x}E(x;x_{0})+R(x)$ , where $x$ is an image, $x_{0}$ a corrupted representation of that image, $E(x;x_{0})$ is a task-dependent data term, and R(x) is the regularizer. This forms an energy minimization problem.

Deep neural networks learn a generator/decoder $x=f_{\theta }(z)$ which maps a random code vector $z$ to an image $x$ .

The image corruption method used to generate $x_{0}$ is selected for the specific application.

Specifics

In this approach, the $R(x)$ prior is replaced with the implicit prior captured by the neural network (where $R(x)=0$ for images that can be produced by a deep neural networks and $R(x)=+\infty$ otherwise). This yields the equation for the minimizer $\theta ^{*}=argmin_{\theta }E(f_{\theta }(z);x_{0})$ and the result of the optimization process $x^{*}=f_{\theta ^{*}}(z)$ .

The minimizer $\theta ^{*}$ (typically a gradient descent) starts from a randomly initialized parameters and descends into a local best result to yield the $x^{*}$ restoration function.

Applications

Denoising

The principle of denoising is to recover an image $x$ from a noisy observation $x_{0}$ , where $x_{0}=x+\epsilon$ . The distribution $\epsilon$ is sometimes known (e.g.: profiling sensor and photon noise^[2]) and may optionally be incorporated into the model, though this process works well in blind denoising.

The quadratic energy function $E(x,x_{0})=||x-x_{0}||^{2}$ is used as the data term, plugging it into the equation for $\theta ^{*}$ yields the optimization problem $min_{\theta }||f_{\theta }(z)-x_{0}||^{2}$ .

Super-resolution

Super-resolution is used to generate a higher resolution version of image x. The data term is set to $E(x;x_{0})=||d(x)-x_{0}||^{2}$ where d(·) is a downsampling operator such as Lanczos that decimates the image by a factor t.

Inpainting

Inpainting is used to reconstruct a missing area in an image $x_{0}$ . These missing pixels are defined as the binary mask $m\in \{0,1\}^{H\times V}$ . The data term is defined as $E(x;x_{0})=||(x-x_{0})\odot m||^{2}$ (where $\odot$ is the Hadamard product).

The intuition behind this is that the loss is computed only on the known pixels in the image, and the network is going to learn enough about the image to fill in unknown parts of the image even though the computed loss doesn't include those pixels. This strategy is used to remove image watermarks by treating the watermark as missing pixels in the image.

Share this article:

This article uses material from the Wikipedia article Deep_Image_Prior, and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.

[1] [1]
https://sites.skoltech.ru/app/data/uploads/sites/25/2018/04/deep_image_prior.pdf

[2] [2]
jo (2012-12-11). "profiling sensor and photon noise .. and how to get rid of it". darktable.

[3] [3]
"DmitryUlyanov/Deep-image-prior". GitHub. 3 June 2021.

[4] [4]
https://apod.nasa.gov/apod/astropix.doc

[1]

[2]

[3]

[4]

Deep_Image_Prior

Deep image prior

Method

Background

Specifics

Overfitting

Deep Neural Network Model

Applications

Denoising

Super-resolution

Inpainting

Flash–no-flash reconstruction

Implementations

Example

References

Share this article: