3D_reconstruction_from_multiple_images

3D reconstruction from multiple images

Creation of a 3D model from a set of images

3D reconstruction from multiple images is the creation of three-dimensional models from a set of images. It is the reverse process of obtaining 2D images from 3D scenes.

The essence of an image is a projection from a 3D scene onto a 2D plane, during which process the depth is lost. The 3D point corresponding to a specific image point is constrained to be on the line of sight. From a single image, it is impossible to determine which point on this line corresponds to the image point. If two images are available, then the position of a 3D point can be found as the intersection of the two projection rays. This process is referred to as triangulation. The key for this process is the relations between multiple views which convey the information that corresponding sets of points must contain some structure and that this structure is related to the poses and the calibration of the camera.

In recent decades, there is an important demand for 3D content for computer graphics, virtual reality and communication, triggering a change in emphasis for the requirements. Many existing systems for constructing 3D models are built around specialized hardware (e.g. stereo rigs) resulting in a high cost, which cannot satisfy the requirement of its new applications. This gap stimulates the use of digital imaging facilities (like a camera). An early method was proposed by Tomasi and Kanade.^[2] They used an affine factorization approach to extract 3D from images sequences. However, the assumption of orthographic projection is a significant limitation of this system.

Mathematical description of reconstruction

Given a group of 3D points viewed by N cameras with matrices $\{P^{i}\}_{i=1\ldots N}$ , define $m_{j}^{i}\simeq P^{i}w_{j}$ to be the homogeneous coordinates of the projection of the $j^{th}$ point onto the $i^{th}$ camera. The reconstruction problem can be changed to: given the group of pixel coordinates $\{m_{j}^{i}\}$ , find the corresponding set of camera matrices $\{P^{i}\}$ and the scene structure $\{w_{j}\}$ such that

m_{j}^{i}\simeq P^{i}w_{j}

(1)

Generally, without further restrictions, we will obtain a projective reconstruction.^[4]^[5] If $\{P^{i}\}$ and $\{w_{j}\}$ satisfy (1), $\{P^{i}T\}$ and $\{T^{-1}w_{j}\}$ will satisfy (1) with any 4 × 4 nonsingular matrix T.

A projective reconstruction can be calculated by correspondence of points only without any a priori information.

Auto-calibration

In auto-calibration or self-calibration, camera motion and parameters are recovered first, using rigidity. Then structure can be readily calculated. Two methods implementing this idea are presented as follows:

Kruppa equations

With a minimum of three displacements, we can obtain the internal parameters of the camera using a system of polynomial equations due to Kruppa,^[6] which are derived from a geometric interpretation of the rigidity constraint.^[7]^[8]

The matrix $K=AA^{\top }$ is unknown in the Kruppa equations, named Kruppa coefficients matrix. With K and by the method of Cholesky factorization one can obtain the intrinsic parameters easily:

K={\begin{bmatrix}k_{1}&k_{2}&k_{3}\\k_{2}&k_{4}&k_{5}\\k_{3}&k_{5}&1\\\end{bmatrix}}

Recently Hartley ^[9] proposed a simpler form. Let $F$ be written as $F=DUV^{\top }$ , where

Then the Kruppa equations are rewritten (the derivation can be found in ^[9])

Mendonça and Cipolla

This method is based on the use of rigidity constraint. Design a cost function, which considers the intrinsic parameters as arguments and the fundamental matrices as parameters. ${F}_{ij}$ is defined as the fundamental matrix, ${A}_{i}$ and ${A}_{j}$ as intrinsic parameters matrices.

Stratification

Recently, new methods based on the concept of stratification have been proposed.^[10] Starting from a projective structure, which can be calculated from correspondences only, upgrade this projective reconstruction to a Euclidean reconstruction, by making use of all the available constraints. With this idea the problem can be stratified into different sections: according to the amount of constraints available, it can be analyzed at a different level, projective, affine or Euclidean.

The stratification of 3D geometry

Usually, the world is perceived as a 3D Euclidean space. In some cases, it is not possible to use the full Euclidean structure of 3D space. The simplest being projective, then the affine geometry which forms the intermediate layers and finally Euclidean geometry. The concept of stratification is closely related to the series of transformations on geometric entities: in the projective stratum is a series of projective transformations (a homography), in the affine stratum is a series of affine transformations, and in Euclidean stratum is a series of Euclidean transformations.

Suppose that a fixed scene is captured by two or more perspective cameras and the correspondences between visible points in different images are already given. However, in practice, the matching is an essential and extremely challenging issue in computer vision. Here, we suppose that $n$ 3D points $A_{i}$ are observed by $m$ cameras with projection matrices $P_{j},j=1,\ldots ,m.$ Neither the positions of point nor the projection of camera are known. Only the projections $a_{ij}$ of the $i^{th}$ point in the $j^{th}$ image are known.

Projective reconstruction

Simple counting indicates we have $2nm$ independent measurements and only $11m+3n$ unknowns, so the problem is supposed to be soluble with enough points and images. The equations in homogeneous coordinates can be represented:

a_{ij}\sim P_{j}A_{i}\qquad i=1,\ldots n,~~j=1,\ldots m

(2)

So we can apply a nonsingular 4 × 4 transformation H to projections $P_{j}$ → $P_{j}H^{-1}$ and world points $A_{i}$ → $HA_{i}$ . Hence, without further constraints, reconstruction is only an unknown projective deformation of the 3D world.

Affine reconstruction

See affine space for more detailed information about computing the location of the plane at infinity ${\Pi }_{\infty }$ . The simplest way is to exploit prior knowledge, for example the information that lines in the scene are parallel or that a point is the one thirds between two others.

We can also use prior constraints on the camera motion. By analyzing different images of the same point can obtain a line in the direction of motion. The intersection of several lines is the point at infinity in the motion direction, and one constraint on the affine structure.

Share this article:

This article uses material from the Wikipedia article 3D_reconstruction_from_multiple_images, and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.

[3DVAE-1] [1]
"Soltani, A. A., Huang, H., Wu, J., Kulkarni, T. D., & Tenenbaum, J. B. Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes With Deep Generative Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1511-1519)". GitHub. 6 March 2020.

[Tomasi-2] [2]
C. Tomasi and T. Kanade, “Shape and motion from image streams under orthography: A factorization approach”, International Journal of Computer Vision, 9(2):137-154, 1992.

[LaurentiniVisualHull-3] [3]
A. Laurentini (February 1994). "The visual hull concept for silhouette-based image understanding". IEEE Transactions on Pattern Analysis and Machine Intelligence. 16 (2): 150–162. doi:10.1109/34.273735.

[4] [4]
R. Mohr and E. Arbogast. It can be done without camera calibration. Pattern Recognition Letters, 12:39-43, 1991.

[5] [5]
O. Faugeras. What can be seen in three dimensions with an uncalibrated stereo rig? In Proceedings of the European Conference on Computer Vision, pages 563-578, Santa Margherita L., 1992.

[6] [6]
E. Kruppa. Zur Ermittlung eines Objektes aus zwei Perspektiven mit innerer Orientierung. Sitz.-Ber.Akad.Wiss., Wien, math. naturw. Kl., Abt. IIa., 122:1939-1948, 1913.

[7] [7]
S. J. Maybank and O. Faugeras. A theory of self-calibration of a moving camera. International Journal of Computer Vision, 8(2):123-151, 1992.

[8] [8]
O. Faugeras and S. Maybank. Motion from point matches: multiplicity of solutions. International Journal of Computer Vision, 4(3):225-246, June 1990.

[Hartley-9] [9]
R. I. Hartley. Kruppa's equations derived from the fundamental matrix Archived 2018-06-22 at the Wayback Machine. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(2):133-135, February 1997.

[10] [10]
Pollefeys, Marc. Self-calibration and metric 3D reconstruction from uncalibrated image sequences. Diss. PhD thesis, ESAT-PSI, KU Leuven, 1999.

[11] [11]
R. Hartley and A. Zisserman. Multiple view geometry in computer vision. Cambridge University Press, 2nd edition, 2003.

[12] [12]
"Medical Visualization: What is it and what's it for?". GarageFarm. 2018-02-18. Retrieved 2018-02-18.

[13] [13]
"Pearcy MJ. 1985. Stereo radiography of lumbar spine motion. Acta Orthop Scand Suppl".

[14] [14]
"Aubin CE, Dansereau J, Parent F, Labelle H, de Guise JA. 1997. Morphometric evaluations of personalised 3D reconstructions and geometric models of the human spine". Med Biol Eng Comput.

[:0-15] [15]
"S.Hosseinian, H.Arefi, 3D Reconstruction from multiview medical X-ray images- Review and evaluation of existing methods" (PDF).

[16] [16]
Laporte, S; Skalli, W; de Guise, JA; Lavaste, F; Mitton, D (2003). "A biplanar reconstruction method based on 2D and 3D contours: application to distal femur". Comput Methods Biomech Biomed Engin. 6 (1): 1–6. doi:10.1080/1025584031000065956. PMID 12623432. S2CID 3206752.

[:1-17] [17]
G.Scott Owen, HyperVis. ACM SIGGRAPH Education Committee, the National Science Foundation (DUE-9752398), and the Hypermedia and Visualization Laboratory, Georgia State University.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

3D_reconstruction_from_multiple_images

3D reconstruction from multiple images

Processing

Mathematical description of reconstruction

Auto-calibration

Kruppa equations

Mendonça and Cipolla

Stratification

The stratification of 3D geometry

Projective reconstruction

Affine reconstruction

Euclidean reconstruction

Algebraic vs geometric error

Medical applications

Problem statement & Basics

Stereo Corresponding Point Based Technique

Non-Stereo corresponding contour method (NCSS)

Surface rendering technique

See also

References

Further reading

External links

Share this article: