[

* vincent.gaudilliere@uni.lu [ gilles.simon@loria.fr [ marie-odile.berger@inria.fr [ [

Abstract

In computer vision, camera pose estimation from correspondences between 3D geometric entities and their projections into the image has been a widely investigated problem. Although most state-of-the-art methods exploit low-level primitives such as points or lines, the emergence of very effective CNN-based object detectors in the recent years has paved the way to the use of higher-level features carrying semantically meaningful information. Pioneering works in that direction have shown that modelling 3D objects by ellipsoids and 2D detections by ellipses offers a convenient manner to link 2D and 3D data. However, the mathematical formalism most often used in the related litterature does not enable to easily distinguish ellipsoids and ellipses from other quadrics and conics, leading to a loss of specificity potentially detrimental in some developments. Moreover, the linearization process of the projection equation creates an over-representation of the camera parameters, also possibly causing an efficiency loss. In this paper, we therefore introduce an ellipsoid-specific theoretical framework and demonstrate its beneficial properties in the context of pose estimation. More precisely, we first show that the proposed formalism enables to reduce the ellipsoid pose estimation problem to a position or orientation-only estimation problem in which the remaining unknowns can be derived in closed-form. Then, we demonstrate that it can be further reduced to a 1 Degree-of-Freedom (1DoF) problem and provide the analytical expression of the pose as a function of that unique scalar unknown. We illustrate our theoretical considerations by visual examples. Finally, we release this work in order to contribute towards more efficient resolutions of ellipsoid-related pose estimation problems.

Pose estimation, Object modeling, Ellipsoid, Ellipse

\jyear

2022

Perspective- $1$ -Ellipsoid]Perspective- $1$ -Ellipsoid: Formulation, Analysis and Solutions of the Ellipsoid Pose Estimation Problem in Euclidean Space

[1,2]\fnmVincent \surGaudillière

2]\fnmGilles \surSimon

2]\fnmMarie-Odile \surBerger

1]\orgdivSnT - Interdisciplinary Centre for Security, Reliability and Trust, \orgnameUniversity of Luxembourg, \orgaddress\street29 Avenue John F. Kennedy, \postcodeL-1855, \cityLuxembourg, \countryLuxembourg

2]\orgnameLoria - Inria - Université de Lorraine, \orgaddress\streetCampus Scientifique, BP 239, \postcodeF-54506, \cityVandoeuvre-lès-Nancy, \countryFrance

1 Introduction

Estimating the relative pose between a camera and a scene has been representing a core aspect of computer vision for many years. Indeed, this task is at the root of many applications, from robot navigation (Bonin-FontOO08) to Augmented Reality (MarchandUS16).

Historically, pose estimation has been addressed by leveraging 2D-3D correspondences between low-level geometric features such as points (P $n$ P: Perspective- $n$ -Point)(LepetitMF09; Hartley2004) or lines (P $n$ L: Perspective- $n$ -Line)(PnL). More recently, the field has been significantly impacted by the raise of deep learning and pose estimation is now widely addressed by end-to-end trainable methods (HoqueAXMW21). However, while deep learning has proven to be indispensable in solving the problem of perception, it is still not the best choice in terms of accuracy throughout all steps of a pose estimation pipeline, as figured out in a recent object pose estimation challenge (KisantalSPIMD20). Indeed, in this challenge, the two most accurate methods were hybrid approaches in which keypoints are located by deep regression models then used as inputs to a P $n$ P-based solver.

Following the same proven hybrid approach, the appearance of very effective object detectors in the recent years (Redmon_2016_CVPR; Redmon_2017_CVPR; YOLOv3; LiuAESRFB16; Girshick_2014_CVPR; Girshick_2015_ICCV) has been enabling the substitution of low-level primitives (e.g. points, lines), often extracted in droves and carrying limited semantic information, by object-level features providing a deeper scene understanding at a lower computational matching cost. Therefore, the choice of object representation has become crucial.

While modeling 3D objects by cuboids along with their 2D projections by bounding boxes (i.e. outputs of most object detectors) has been investigated in the context of pose computation (context_relevance_iros_2017; wide_baseline_2018; LiMD19), it appears that the ellipse-ellipsoid modeling paradigm has the unparalleled advantage of analytically linking 2D and 3D models (Hartley2004; Eberly-backproj). In other words, ellipsoids always project onto planes in the form of ellipses, and the underlying closed-form projection equation can be leveraged to efficiently solve pose estimation problems (Crocco_2016_CVPR; 7919240; IROS; ISMAR; IROS2; RAL). As an indicator of that increasingly attractive research direction, more and more object detectors have been modeling object projections by ellipses instead of traditional bounding boxes whose sides are parallel to image axes (Li19; PanFWZR21; ZhaoJFLY21; ZinsSB20; DongRPI21; abs-2101-05212).

However, performing pose estimation at the level of objects through ellipse/ellipsoid-modeling has mainly been formulated under the standard projective geometry formalism (Hartley2004) (Crocco_2016_CVPR; Gay_2017_ICCV; 7919240; QSLAM; ROB-059; abs-2004-05303; abs-2110-08977), and mostly through least square estimations where the unknowns are general quadric surfaces. This framework may present limitations since ellipses (resp. ellipsoids) are specific categories of conics (quadrics) and since the linearization of the projection equation increases the numbers of apparent unknowns (see Section 2.1). In addition, these papers do not address the case of a small number of ellipse-ellipsoid correspondences, which is of high practical importance when a few objects are observed or when computing the solutions with minimal sampling size in RANSAC algorithms (FischlerB81).

In this paper, we address the fundamental problem of camera pose estimation from one ellipse-ellipsoid correspondence, referred to as Perspective- $1$ -Ellipsoid (P $1$ E) in what follows. Conversely, it consists in reconstructing an ellipsoid of known size from its projected ellipse given the camera intrinsic parameters (ellipsoid pose estimation).

There are several interests in solving the P $1$ E problem. First, on the theoretical side, and except in the case of a spherical object, we demonstrate that the solutions are a variety of dimension 1 and we provide an effective way to reconstruct the camera trajectory (i.e. solutions). This problem has been addressed in WokesP10 in the particular case of a spheroid (specific ellipsoid having an axis of revolution) but was never considered for general ellipsoids. To the best of our knowledge, we are the first to propose a constructive solution to the P $1$ E problem without resorting to any additional approximation nor prior knowledge.

The proposed formalism relies on Cartesian coordinates instead of homogeneous ones (Hartley2004), and this enables us to develop an ellipsoid-specific framework. In this study, we also consider two particular cases of important practical interest: (i) computing the ellipsoid position when the orientation is known (ii) computing the orientation when the position is known.

Besides the theoretical aspects, solving the P $1$ E problem opens the way towards automatic positioning solutions in texture-less or low-textured environments, for instance leveraging several ellipse-ellipsoid correspondences or one with several point pairs. Industrial or other indoor scenes, in which objects would be approximated by ellipsoids, represent concrete places that could take advantage of these results.

The paper is organized as follows: In Section 2.1, we discuss the current State-of-the-Art object-based pose estimation methods and the limits of the homogeneous representation of ellipses and ellipsoids. In Section 3, we present the Euclidean formulation of the ellipsoid pose estimation problem, previously introduced in Eberly-backproj. Sections 4, 5 and 6 contain our core contributions:

In Section 4, we exhibit several mathematical properties of the P $1$ E solutions needed in the demonstrations of sections 5 and 6.
Section 5 is dedicated to the specific case of P $1$ E where either the position or the orientation of the ellipsoid is known. We bring out that the problem formulation induces an inherent decoupling between orientation and position, one of which being possibly inferred in closed-form from the other one. The orientation to position solver was introduced in IROS; ISMAR then leveraged in IROS2; RAL.
The general P $1$ E problem is solved in Section 6. We demonstrate that the 6DoF ellipsoid pose estimation problem can be reduced to a 1DoF problem, and present an effective way to reconstruct the solutions.

2 Related Work

2.1 Quadrics-based Pose Estimation

Most methods proposing to solve pose estimation problems at the level of objects using ellipsoid modeling are based on the projective geometry formalism. Any quadric $Q$ is thus linked to its projected conic $C$ by the Equation

C^{*} = P Q^{*} P^{⊤}

where $P = K [R$ $t]$ is the camera projection matrix (Hartley2004) and $Q^{*}, C^{*}$ are the dual forms of $Q$ and $C$ ( $M^{*} = M^{- 1}$ up to a scale factor for a non-singular symmetric matrix $M$ ). It is important noting that, under that formalism, ellipses and ellipsoids can be distinguished from other conics and quadrics only by certain algebraic conditions on $C$ and $Q$ entries, these conditions being difficult to leverage in practice.

Quadric-modeling of objects has often been implemented in the context of Semantic SLAM (ROB-059) for improving the process accuracy through multi-objective optimization (QSLAM; abs-2110-08977; abs-2004-05303). On a theoretical level, Crocco_2016_CVPR addresses the object-based Structure-from-Motion (SfM) problem and introduces an analytical solution to reconstruct both quadric and affine camera poses. The problem is solved in a least square sense with a matrix $P$ over-represented by a $6 \times 10$ Kronecker product $P \otimes P$ . This work is extended with CAD model priors in Gay_2017_ICCV, while 7919240 present a closed-form solution to the problem of reconstructing a quadric from three calibrated pinhole camera views in which the object projections are detected. However, in this method, nothing ensures that the reconstructed quadric is an ellipsoid, forcing the authors to add a costly post-processing non-linear optimization of the results.

Another limitation while using homogeneous quadrics and over-representation of $P$ appears in P12Q. In this method, the so-called gold-standard algorithm used to retrieve the camera projection matrix from 2D-3D point correspondences (Hartley2004) is adapted to conic-quadric correspondences. To compute $P$ , 12 conic-quadric pairs are required whereas only 6 point pairs are sufficient in the same context.

We argue that these limitations may be due in part to the fact that ellipse and ellipsoid homogeneous formulations are not clearly enough distinguished from other members of their geometric families, and also to a non-minimal representation of the projection matrix. In our paper and to overcome these difficulties, we thus propose an ellipsoid-specific theoretical framework and highlight its advantages.

2.2 Perspective- $1$ -Spheroid

In WokesP10, a comprehensive study of the spheroid pose estimation problem is conducted. To this end, the authors introduce a spheroid-specific parameterization of the problem that enables solving it but prevent from extrapolating the method to the general case of ellipsoids.

In a nutshell, the authors demonstrates that the spheroid pose estimation problem has two distinct solutions. In 6.4, we retrieve the same result by restricting our general formulation to the case of spheroids.

3 Formulation of the Ellipsoid Pose Estimation Problem

3.1 Problem Statement

Following the notations introduced in Eberly-backproj and presented in Fig. 1, we consider an ellipsoid $(C, A)$ defined by Equation

(X - C)^{⊤} A (X - C) = 1,

where $C$ is the center of the ellipsoid, $A$ is a real positive definite matrix characterizing its orientation and size, and $X$ is any point on it.

Given a center of projection $E$ and a projection plane of normal $N$ which does not contain $E$ , the projection of the ellipsoid is an ellipse of center $K$ and of semi-diameters $a$ et $b$ . Its principal directions are represented by unit vectors $U$ and $V$ , such that ${U, V, N}$ is an orthonormal set.

Figure 1: Illustrating the projection plane, projection center, ellipsoid and projected ellipse.

3.1.1 Projection Cone

The projection cone $(E, B)$ refers to the cone of vertex $E$ tangent to the ellipsoid. According to Eberly-backproj, it is characterized by matrix

B d e f = A Δ Δ^{⊤} A - (Δ^{⊤} A Δ - 1) A,

where $Δ = E - C$ , so that the points $X$ belonging to the projection cone are those who satisfy the Equation $(X - E)^{⊤} B (X - E) = 0$ . Note that $B$ is a real, symmetric and invertible matrix which has two eigenvalues of the same sign and the third one of the opposite sign.

3.1.2 Backprojection Cone

The backprojection cone $(E, B^{'})$ refers to the cone generated by the lines passing through $E$ and any point on the ellipse. Eberly shows that it is characterized by matrix

B^{'} d e f = P^{⊤} M P - Q,

where

	$M$	$d e f = U U^{⊤} / a^{2} + V V^{⊤} / b^{2},$
	$W$	$d e f = N / (N \cdot (K - E)),$
	$P$	$d e f = I - (K - E) W^{⊤},$
	$Q$	$d e f = W W^{⊤} .$

Here again, the points $X$ on the backprojection cone are those who meet $(X - E)^{⊤} B^{'} (X - E) = 0$ , while $B^{'}$ shares $B$ properties (real, symmetric, invertible with signature (2,1) or (1,2)).

3.1.3 The Cone Alignment Equation

Given an ellipsoid, a central projection (center and plane) and an ellipse in the projection plane, the ellipse is the projection of the ellipsoid if and only if the projection and backprojection cones are aligned (Eberly-backproj), i.e. if and only if there is a non-zero scalar $σ$ such that $B = σ B^{'}$ :

A Δ Δ^{⊤} A + μ A = σ B^{'},

(1)

where $μ = 1 - Δ^{⊤} A Δ$ .

Note that $μ$ encapsulates the relative configuration between the camera center $E$ and the ellipsoid center $C$ . It is thus negative if $E$ is outside the ellipsoid and positive otherwise.

Equation (1), to whom we refer as the Cone Alignment Equation, encodes the ellipsoid pose estimation problem. An equivalent formulation is given by Equation (1’) (see proof of equivalence in Appendix A):

Δ Δ^{⊤} = A^{- 1} - \frac{μ}{σ} B^{' - 1} .

(1’)

3.2 Pose Problem Analysis

While Equation (1) has been established in the camera coordinate frame, it is also valid in the ellipsoid coordinate frame, where matrix $A_{e l l}$ is diagonal, and in the cone coordinate frame where $B_{c o n e}^{'}$ is diagonal. Since $A$ and $B^{'}$ are symmetric, both can be diagonalized using an orthogonal matrix $P$ such that $P^{- 1} = P^{T}$ . For that reason, Equation (1) remains the same whatever the choice of the coordinate frame (camera, ellipsoid or cone) in which matrices and vectors are expressed. In the following, if there is no restriction on the coordinate frame, we will adopt notations without subscript. Otherwise, subscripts $c a m$ , $c o n e$ or $e l l$ will be used.

One can theoretically distinguish between camera pose estimation, which consists in estimating the pose of the camera with respect to its environment, and its reference frame counterpart, i.e. object pose estimation, which consists in estimating the pose of an object with respect to the camera. Fundamentally, the sought transformations are the inverse of each other thus, in this paper, we may focus on estimating the pose of the ellipsoid or the pose of the camera, according to the most convenient setup for mathematical developments, without loss of generality.

The givens of the problem are the ellipse detected in the image, the camera intrinsic parameters and the ellipsoid size (i.e. lengths of its three radii). Therefore, $B_{c o n e}^{'}$ , $B_{c a m}^{'} =^{c a m} R_{c o n e} B_{c o n e}^{'}^{c a m} R_{c o n e}^{⊤}$ and $A_{e l l}$ are known, as well as $E_{c o n e}$ , $E_{c a m}$ and $C_{e l l}$ (whose entries are all zero). Since expressions of $A$ and $B^{'}$ in specific coordinate frames are known, their eigenvalues are also known. In addition, matrix properties such as trace and determinant, that are fully constrained by the eigenvalues, are also known.

In the paper, we most often work in the camera frame. We thus aim at computing vector $Δ_{c a m} = E_{c a m} - C_{c a m}$ and matrix

	$A_{c a m}$	$=^{c a m} R_{e l l} A_{e l l}^{c a m} R_{e l l}^{⊤}$
		$=^{c a m} R_{e l l} ⎛ ⎜ ⎝ \begin{matrix} 1 / a^{2} & 0 & 0 0 & 1 / b^{2} & 0 0 & 0 & 1 / c^{2} \end{matrix} ⎞ ⎟ ⎠^{c a m} R_{e l l}^{⊤},$

from which we then retrieve the ellipsoid position $C_{c a m}$ and orientation $^{c a m} R_{e l l}$ .

In the other case, we aim at computing $Δ_{e l l}$ and $B_{e l l}^{'}$ to then derive the camera position $E_{e l l}$ and orientation $^{c a m} R_{e l l}^{⊤}$ .

Finally, vector $Δ$ encodes the relative position between the ellipsoid and the camera, while couple ${A, B^{'}}$ characterizes their relative orientation, denoted $^{c a m} R_{e l l}$ . Indeed, its expression in the camera frame is

{^{c a m} R_{e l l} A_{e l l}^{c a m} R_{e l l}^{⊤}, B_{c a m}^{'}},

while its expression in the ellipsoid frame is

{A_{e l l},^{c a m} R_{e l l}^{⊤} B_{c a m}^{'}^{c a m} R_{e l l}},

recalling that $A_{e l l}$ and $B_{c a m}^{'}$ are known.

Solving Equation (1) therefore consists in determining the value(s) of $σ$ and the expressions of $A, B^{'}, Δ$ in a common coordinate frame (camera or ellipsoid).

4 Properties of the Solutions

In this section, we demonstrate some properties of the solutions of (1) and exhibit relationships between the different variables. These results are new except Result 1, which was demonstrated in Eberly-backproj. It must be noted that the positive definite nature of matrix $A$ , i.e. what differentiates an ellipsoid from any other quadric, plays a fundamental role in the demonstrations of these properties.

4.1 Link with a Generalized Eigenvalue Problem

Let $(A, B^{'}, Δ, σ)$ be a set of solutions of Equation (1). Result 1 shows that they are also solutions of a generalized eigenvalue problem (GoluVanl96).

Result 1.

If $A, B^{'}, Δ$ and $σ$ satisfy

A Δ Δ^{⊤} A + (1 - Δ^{⊤} A Δ) A = σ B^{'},

(1)

they also satisfy

A Δ = σ B^{'} Δ .

(2)

Proof.

Let’s right-multiply (1) by $Δ$ . Since $Δ^{⊤} A Δ$ is a scalar, the right hand term can be simplified:

	$σ B^{'} Δ$	$= (A Δ Δ^{⊤} A + (1 - Δ^{⊤} A Δ) A) Δ$
		$= (Δ^{⊤} A Δ) A Δ + A Δ - Δ^{⊤} A Δ A Δ$
		$= A Δ .$

∎

Since $B^{'}$ is invertible, the generalized eigenvectors and eigenvalues of pair ${A, B^{'}}$ are the eigenvectors and eigenvalues of matrix $B^{' - 1} A$ .

4.2 Generalized Eigenvalues of ${A, B^{'}}$

Result 2.

The couple ${A, B^{'}}$ has exactly two distinct generalized eigenvalues, that are non-zero and of opposite signs.

Proof.

The generalized eigenvalues of ${A, B^{'}}$ are non-zero because $B^{' - 1} A$ is not singular.

We can then observe that

Q (x) = μ x^{2} - (μ + 1) σ x + σ^{2}

is an annihilator polynomial of $B^{' - 1} A$ (see proof in Appendix B):

μ (B^{' - 1} A)^{2} - (μ + 1) σ B^{' - 1} A + σ^{2} I = {0}.

(3)

In linear algebra, the minimal polynomial $π (.)$ is defined as the monic annihilator polynomial having the lowest possible degree. It can be shown (lang2002) that (i) $π (.)$ divides any annihilator polynomial and (ii) the roots of $π (.)$ are identical to the roots of the characteristic polynomial. Since $Q$ is an annihilator polynomial of degree 2, we can thus infer that $B^{' - 1} A$ , and thus ${A, B^{'}}$ , has at most two distinct eigenvalues.

We are now going to prove by contradiction that ${A, B^{'}}$ has exactly two distinct eigenvalues. Let’s thus assume that the couple has only one eigenvalue with multiplicity 3 denoted $σ_{0}$ .

Since $A$ is positive definite and $B^{'}$ is symmetric, the couple ${A, B^{'}}$ has the following properties (GoluVanl96) (Corollary 8.7.2, p. 462):

their generalized eigenvalues are real,
their reducing subspaces are of the same dimension as the multiplicity of the associated eigenvalues,
their generalized eigenvectors form a basis of $R^{3}$ , and those with distinct eigenvalues are $A$ -orthogonal.

According to property 2. above, we have

d i m (K e r (A - σ_{0} B^{'})) = 3,

i.e.

A = σ_{0} B^{'},

which is impossible because $A$ represents an ellipsoid whereas $B^{'}$ represents a cone. So ${A, B^{'}}$ has exactly two distinct generalized eigenvalues.

Let’s then denote $σ_{1}$ (multiplicity 1) and $σ_{2}$ (multiplicity 2) these two eigenvalues. Observing that $\frac{1}{σ_{1}}$ and $\frac{1}{σ_{2}}$ are the generalized eigenvalues of ${B^{'}, A}$ , we can write, according to minimax (Theorem 3)

\begin{matrix} \forall X & \in R^{3} ∖ {0}, m i n (\frac{1}{σ_{1}}, \frac{1}{σ_{2}}) \leq \frac{X^{⊤} B^{'} X}{X^{⊤} A X} \leq m a x (\frac{1}{σ_{1}}, \frac{1}{σ_{2}}) \end{matrix}

If $σ_{1}$ and $σ_{2}$ were of the same sign, then $\forall X \in R^{3} ∖ {0}$ , $X^{⊤} B^{'} X$ would be of that sign (since $X^{⊤} A X > 0$ ). Yet, it is impossible since $B^{'}$ is neither positive nor negative definite (cone). We thus conclude that the two distinct eigenvalues are of opposite signs. ∎

4.3 Characterization of $σ$

Let’s denote $σ_{1}$ (multiplicity 1) and $σ_{2}$ (multiplicity 2) the two generalized eigenvalues of ${A, B^{'}}$ .

Result 3.

$σ$ is the generalized eigenvalue of ${A, B^{'}}$ with multiplicity 1:

σ = σ_{1} .

(4)

Proof.

Let’s consider $(σ_{1}, δ_{1})$ and $(σ_{2}, δ_{2})$ the generalized eigenvalues and eigenvectors of ${A, B^{'}}$ , such that $∥ δ_{i} ∥ = 1$ .

We are going to prove (4) by contradiction.

Let’s suppose that there is $k \in R^{*}$ such that $(A, σ_{2}, k δ_{2})$ is solution of Equation (1).

By injecting these values into (1), we therefore have

B^{'} - \frac{1}{σ_{2}} A = M A,

where

M = \frac{k^{2}}{σ_{2}} (A δ_{2} δ_{2}^{⊤} - δ_{2}^{⊤} A δ_{2} I) .

According to property 2 of the proof of Result 2,

d i m (K e r (B^{'} - \frac{1}{σ_{2}} A)) = 2,

whence, since A is invertible,

d i m (K e r (M A)) = d i m (K e r (M)) = 2 .

However, defining

δ_{2}^{⊥} = {X \in R^{3} / X ⊥ δ_{2}}

the subspace of dimension 2 orthogonal to $δ_{2}$ , we observe that, $\forall X \in δ_{2}^{⊥}$ ,

	$M X$	$= \frac{k^{2}}{σ_{2}} A δ_{2} δ_{2}^{⊤} X - \frac{k^{2}}{σ_{2}} δ_{2}^{⊤} A δ_{2} X$
		$= \frac{k^{2}}{σ_{2}} A δ_{2} (δ_{2} \cdot X) - \frac{k^{2}}{σ_{2}} δ_{2}^{⊤} A δ_{2} X$
		$= - \frac{k^{2}}{σ_{2}} δ_{2}^{⊤} A δ_{2} X .$

Since $A$ is positive definite, $δ_{2}^{⊤} A δ_{2} > 0$ , whence it comes

\forall X \in δ_{2}^{⊥} ∖ {0}, % M X \neq 0 .

It means that

δ_{2}^{⊥} \cap K e r (M) = {0},

whence the direct sum

δ_{2}^{⊥} ⨁ K e r (M)

is a subspace of $R^{3}$ of dimension

d i m (δ_{2}^{⊥}) + d i m (K e r (M)) = 2 + 2 = 4 .

We end up with a contradiction since $4 > d i m (R^{3}) = 3$ .

As a result, triplets $(A, σ_{2}, k δ_{2})$ cannot be solutions of (1), thus solutions are necessarily in the form $(A, σ_{1}, k δ_{1})$ , where $k \in R^{*}$ . ∎

4.4 Characterizations of $μ$

Result 4 demonstrates that the secondary scalar variable $μ$ is also closely linked to the generalized eigenvalues of ${A, B^{'}}$ .

Result 4.

$μ$ is equal to the ratio between the two generalized eigenvalues of ${A, B^{'}}$ :

μ = \frac{σ_{1}}{σ_{2}} .

(5)

Proof.

Trace of $B^{' - 1} A$ is given by its eigenvalues:

t r (B^{' - 1} A) = σ_{1} + 2 σ_{2} .

Whence, by squaring the matrix,

t r ((B^{' - 1} A)^{2}) = σ_{1}^{2} + 2 σ_{2}^{2} .

Therefore, since $t r (I) = 3$ and $σ = σ_{1}$ , applying the operator $t r ()$ to Equation (3) leads to

μ (σ_{1}^{2} + 2 σ_{2}^{2}) - σ_{1} (μ + 1) (σ_{1} + 2 σ_{2}) + 3 σ_{1}^{2} = 0,

which is equivalent to

μ σ_{2} (σ_{2} - σ_{1}) = σ_{1} (σ_{2} - σ_{1}),

i.e.

μ = \frac{σ_{1}}{σ_{2}} .

∎

Since $σ_{1}$ and $σ_{2}$ are of opposite signs, Equation (5) shows that $μ < 0$ .

Result 5 now highlights the connection between $μ$ and $σ$ .

Result 5.

$μ$ and $σ$ are linked through Equation (6):

μ = - \sqrt{\frac{d e t (B^{'})}{d e t (A)} σ^{3}} .

(6)

Proof.

Determinant of $B^{' - 1} A$ is given by its eigenvalues:

d e t (B^{' - 1} A) = σ_{1} σ_{2}^{2},

i.e.

\frac{d e t (A)}{d e t (B^{'})} = σ_{1} σ_{2}^{2} .

(7)

One obtains (6) by injecting (5) into (7) and using $μ < 0$ . ∎

4.5 Link between $σ$ and $∥ Δ ∥$

Result 6.

The scalar $σ$ and the camera-ellipsoid distance $∥ Δ ∥$ are linked through Equation (8):

{(t r (B^{' - 1}))}^{2} σ = \frac{d e t (A)}{d e t (B^{'})} {(t r (A^{- 1}) - ∥ Δ ∥^{2})}^{2} .

(8)

Proof.

Injecting Equations (4) and (5) into (1’) leads to

Δ Δ^{⊤} = A^{- 1} - \frac{1}{σ_{2}} B^{' - 1} .

Applying $t r ()$ then squaring:

\frac{1}{σ_{2}^{2}} {(t r (B^{' - 1}))}^{2} = {(t r (A^{- 1}) - ∥ Δ ∥^{2})}^{2} .

(9)

Furthermore, injecting (4) into (7) leads to the following expression for $\frac{1}{σ_{2}^{2}}$ :

\frac{1}{σ_{2}^{2}} = \frac{d e t (B^{'})}{d e t (A)} σ .

(10)

Equation (8) is then obtained by injecting (10) into (9). ∎

5 Decoupling between Orientation and Position

In this section, we consider two sub-problems of significant practical interest: (i) computing the ellipsoid position when the orientation is known and (ii) computing the orientation when the position is known. We demonstrate that the position can be inferred in closed-form from the orientation (Section 5.1, Result 7), while the latter can be analytically derived from the former up to the ellipsoid symmetries (Section 5.2, Result 8). We assimilate these properties to a decoupling phenomenon between orientation and position.

5.1 Position from Orientation

In this case, $A_{c a m}$ or $B_{e l l}^{'}$ is known. Since $A_{e l l}$ and $B_{c a m}^{'}$ are also known, eigenvalues of ${A, B^{'}}$ , $σ$ (using Result 3) and $μ$ (Result 4) can be retrieved. Result 7 then provides that $Δ$ is unique and fully determined.

Result 7.

Assuming that the relative camera-ellipsoid orientation is known, their relative position is given by

Δ = k δ_{1},

(11)

where

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ \begin{matrix} δ_{1} is a unit % generalized eigenvector of {A, B^{'}} corresponding to σ_{1}, δ_{1} \cdot N < 0, k = \sqrt{t r (A^{- 1}) - \frac{1}{σ_{2}} t r (B^{' - 1})} . \end{matrix}

Proof.

Injecting Equations (4), (5) and $Δ = k δ_{1}$ into Equation (1’) leads to

k^{2} δ_{1} δ_{1}^{⊤} = A^{- 1} - \frac{1}{} σ_{2} B^{' - 1} .

Whence, by applying $t r ()$ ,

k^{2} ∥ δ_{1} ∥^{2} = k^{2} = t r (A^{- 1}) - \frac{1}{σ_{2}} t r (B^{' - 1}) .

The two vectors $\pm \sqrt{k^{2}} δ_{1}$ define ellipsoid centers $C$ that are symetric with respect to the camera center $E$ . The only one that satisfy the chirality constraint (ellipsoid located in front of the camera) is the one whose dot product with vector $N$ is negative (see Fig. 1). ∎

Result 7 highlights the fact that solving Equation (1) may consist only in determining $A_{c a m}$ or $B_{e l l}^{'}$ . Indeed, $σ$ and $Δ$ can then be uniquely derived. In other words, the relative position is fully constrained by the orientation.

As mentioned above, this result is of high practical interest. Indeed, getting the camera orientation, e.g. from physical sensors or image analysis, is usually easier than getting the camera position, and especially indoors where the GPS is useless. In a multi-object scene, the fact that only one ellipse-ellipsoid association is needed to compute the pose allows using a RANSAC-like strategy with low combinatorial cost both to detect the wrong associations and to choose the correct one when a label is shared by several objects.

A re-localization algorithm based on this strategy was presented in (ISMAR). The system operates in real time from YOLO detections (YOLOv3) and IMU data or vanishing points – both methods were assessed. Figure 2 shows a few qualitative results obtained with images from the standard RGB-D TUM dataset (SturmEEBC12). Quantitative results as well as a detailed analysis of the advantages and limitations of this algorithm can be found in (ISMAR). Other applications of Result 7 are presented in (IROS; IROS2; RAL).

Figure 2: Camera relocalization based on the decoupling between orientation and position, here applied to images from the RGB-D TUM dataset (ISMAR). The first row shows the detected boxes, the inscribed ellipsoids (in yellow) and the outlines of the reprojected ellipsoids (in green), with the automatically generated labels. The blue ellipses correspond to the reprojected ellipsoids when the QuadricSLAM residual error is minimized (QSLAM). The second row shows the reprojected ellipsoids (in green the inliers, in white the outliers and undetected ellipsoids) when using the method in (ISMAR).

5.2 Orientation from Position

In this case $Δ_{c a m}$ or $Δ_{e l l}$ is known. $A_{c a m}$ and $B_{e l l}^{'}$ are unknown, but since their eigenvalues are known, $d e t (A)$ , $t r (A^{- 1})$ , $d e t (B^{'})$ and $t r (B^{' - 1})$ are known. $σ$ can thus be deduced from Result 6 and $μ$ from Result 5. Result 8 then explains how to retrieve $A_{c a m}$ or $B_{e l l}^{'}$ and how to derive the orientation in closed-form, up to the cone or ellipsoid symmetries.

Result 8.

Assuming that the relative camera-ellipsoid position is known, their relative orientation is given by eigenvectors of

B_{e l l}^{'} = \frac{1}{σ} (A_{e l l} Δ_{e l l} Δ_{e l l}^{⊤} A_{e l l} + μ A_{e l l})

(1)

in the ellipsoid reference frame, or of

A_{c a m} = \frac{σ}{μ} (B_{c a m}^{'} - σ B_{c a m}^{'} Δ_{c a m} Δ_{c a m}^{⊤} B_{c a m}^{'})

(12)

in the camera reference frame.

Proof.

Injecting (2) into (1) leads to

σ^{2} B^{'} Δ Δ^{⊤} B^{'} + μ A = σ B^{'},

whence (12) by isolating $A$ . ∎

Once for instance $A_{c a m}$ is retrieved, one can compute its eigenvalue decomposition:

A_{c a m} =^{c a m} R_{e l l} ⎛ ⎜ ⎝ \begin{matrix} 1 / a^{2} & 0 & 0 0 & 1 / b^{2} & 0 0 & 0 & 1 / c^{2} \end{matrix} ⎞ ⎟ ⎠^{c a m} R_{e l l}^{⊤} .

For a triaxial ellipsoid ( $a \neq b \neq c$ ), there are 4 solutions for $^{c a m} R_{e l l}$ . Therefore, the ellipsoid orientation can be analytically derived from its position up to the ellipsoid symmetries.

6 Closed-form Solutions

In this section, we introduce the core contribution of the paper, that is closed-form solutions to the general P $1$ E problem. Explicit 1DoF solutions are provided based on the fact that the ellipsoid is triaxial (all $A$ eigenvalues are different) or not, and the cone is circular (two $B^{'}$ eigenvalues are equal) or not.

In Section 6.1, we first present the different types of ellipsoids and cones along with their possible co-occurences. An overview of the solutions is given in Section 6.2. In Section 6.3, we consider the case of a triaxial ellipsoid and present for the first time a Necessary and Sufficient Condition (NSC) on $σ$ to be solution of Equation (1). Then we derive the analytical expressions of the other variables as functions of $σ$ . In Section 6.4, we address the case of the spheroid (ellipsoid with an axis of revolution). That part enables to retrieve, from another formalism, the results presented in WokesP10. In Section 6.5, we finally present the solutions for the sphere.

In what follows, the problem is solved either in the ellipsoid or in the cone coordinate frame. In brief, the choice is linked to the ability to define a frame associated to the considered structure without ambiguities. The case of the triaxial ellipsoid can thus be addressed in both frames since the two structures are unambiguous. The case of the spheroid is different, and depending on the properties of the cone, solutions are derived in one or the other frame.

6.1 Preliminaries: Co-occurences of Ellipsoid and Cone Types

In this paper, we address the full ellipsoid pose estimation problem, i.e. we cover every possible types of ellipsoids and thus cones (see Appendix D). However, a specific type of ellipsoid cannot necessarily be tangent to any type of cone, what we refer to as possible or impossible co-occurence.

Obviously, only circular cones can be tangent to a sphere. Furthermore, we are going to prove that only a non-circular elliptic cone can be tangent to a triaxial ellipsoid.

Let’s prove it by contradiction, and assume that the projection cone has a revolution axis (circular cone).

Let’s also assume that the ellipsoid center $C$ does not belong to that axis. Since the ellipsoid is tangent to the cone, any new ellipsoid obtained by rotating the original one around the cone revolution axis shall still be tangent to the cone, thus be solution of (1). Yet, in this case, the locus of ellipsoid centers would be a circle located in a plane orthogonal to that axis and whose center would belong to it. Every center would thus be at a fixed distance to the cone vertex $E$ , whence there would be an infinite number of $Δ$ solutions for the same $σ$ , given (8). However, this contradicts Equation (19). Therefore, the center of the ellipsoid must belong to the revolution axis of the cone.

If the ellipsoid center belonged to the cone revolution axis, then $Δ$ would be parallel to that axis, i.e. would be an eigenvector of $B^{'}$ , whence of $A$ given (1’). However, in such a case, the symmetries of the cone-ellipsoid pair would impose that the tangent ellipse (intersection between the ellipsoid and the polar plane derived from $E$ (Wylie)) belongs to a plane orthogonal to the cone revolution axis, that is also a principal axis of the ellipsoid. Therefore, that tangent ellipse should be both a circle (orthogonal section of a circular cone) and a non-circular ellipse (section of an ellipsoid by a plane parallel to one of its principal planes), which is impossible. Therefore, the cone cannot have a revolution axis.

In brief, Table 1 summarizes the possible and impossible co-occurences between ellipsoid and cone types.

		Ellipsoid
		Triaxial	Spheroid	Sphere
Projection cone	Non-circular	✓	✓	$\times$
Projection cone	Circular	$\times$	✓	✓

Table 1: Possible co-occurences of ellipsoids and projection cones according to their types. ✓ indicates that ellipsoid and projection cone of the corresponding types may occur simultaneously.

\times

indicates that they cannot.

6.2 Overview of the Solutions

In the rest of this section, we determine the solutions of the Cone Alignment Equation (1), and derive the camera-ellipsoid relative poses. To this end, we distinguish between the three different types of ellipsoids (triaxial, spheroid, sphere). We demonstrate, in particular, that

there is an infinite number of triaxial fixed-size ellipsoids that are tangent to a given backprojection cone (Fig. 2),
as already demonstrated in WokesP10, there are only two fixed-size spheroids solutions (see Fig. 4).

In the first case, the infinite number of ellipsoids tangent to the cone (or, conversely, the infinite number of cones tangent to the ellipsoid) explains the infinite number of camera solutions (see Fig. 5), and provides a parameterization of them. In the second case, the infinite number of change of basis matrices between the spheroid and the camera explains the infinite number of camera solutions (see Fig. 6). The mathematical developments leading to these results are presented below.

Figure 3: Loci of the centers (black) and principal axes endpoints (red, green, blue) of the ellipsoids solutions. A video is available²²footnotemark: 2.

Figure 4: The two spheroids solutions with a non-circular backprojection cone.

Figure 5: Triaxial ellipsoid: locus of cone vertices i.e. camera centers.

Figure 6: Spheroid: locus of cone vertices i.e. camera centers.

6.3 The Triaxial Ellipsoid

6.3.1 Solving for $σ$

In practice, not all $σ$ values give rise to a solution of the problem. When the ellipsoid has three distinct radii (triaxial), Theorem 1 provides a characterization of the scalars $σ$ solutions of (1).

Theorem 1.

Let’s denote $(λ_{A, 1}, λ_{A, 2}, λ_{A, 3})$ the three distinct eigenvalues of $A$ . Then $σ = d m^{2}$ is solution of Equation (1) if and only if the three entries of vector $M_{A}^{- 1} V (m)$ are all non-negative:

M_{A} = ⎛ ⎜ ⎜ ⎝ \begin{matrix} 1 & 1 & 1 λ_{A, 1} & λ_{A, 2} & λ_{A, 3} λ_{A, 1}^{2} & λ_{A, 2}^{2} & λ_{A, 3}^{2} \end{matrix} ⎞ ⎟ ⎟ ⎠,

(13)

V (m) = ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} t r (A^{- 1}) - \frac{t r (B^{' - 1})}{d} m 1 - m^{3} t r (B^{'}) d m^{2} - t r (A) m^{3} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠,

(14)

with

	$d$	$= \sqrt[3]{\frac{d e t (A)}{d e t (B^{'})}},$		(15)
	$m$	$= \sqrt[3]{μ} .$		(16)

It must be noted that $m$ is the only unknown parameter of vector V(m) since all the other ones derive from $A$ and $B^{'}$ eigenvalues.

Proof.

$⟹$ Let’s assume that Equation (1’) is satisfied:

A Δ Δ^{⊤} A + μ A = σ B^{'} .

(1)

Therefore, equivalent Equation (1’) is also satisfied:

Δ Δ^{⊤} = A^{- 1} - \frac{μ}{σ} B^{' - 1} .

(1’)

Since the trace of a product of matrices does not depend on the order of the matrices, we have

	$t r (A Δ Δ^{⊤} A)$	$= t r ((Δ^{⊤} A) (A Δ))$
		$= t r (Δ^{⊤} A^{2} Δ)$
		$= Δ^{⊤} A^{2} Δ % {(scalar)},$

and, similarly

t r (Δ Δ^{⊤}) = Δ^{⊤} Δ .

Given, in addition, $μ = 1 - Δ^{⊤} A Δ$ , applying $t r ()$ to Equations (1’) and (1) leads to the following system:

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩ \begin{matrix} Δ^{⊤} Δ = t r (A^{- 1}) - \frac{μ}{σ} t r (B^{' - 1}) Δ^{⊤} A Δ = 1 - μ Δ^{⊤} A^{2} Δ = σ t r (B^{'}) - μ t r (A) . \end{matrix}

Although the two scalar unknowns $μ$ and $σ$ appear in the right hand side, they can be expressed as functions of a third unknown. Indeed, denoting

m = \sqrt[3]{μ} {(unknown)},

and

d = \sqrt[3]{\frac{d e t (A)}{d e t (B^{'})}} {(known)},

Equation (6) can be rewritten

m^{3} = - \sqrt{\frac{σ^{3}}{d^{3}}},

whence, by raising it to the power of $2 / 3$ ,

σ = d m^{2} .

(17)

Furthermore, given

μ = m^{3} {(definition)},

we have

\frac{μ}{σ} = \frac{m}{d} .

Therefore, $Δ$ is solution of the following system with unknown $m$ :

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ \begin{matrix} Δ^{⊤} Δ = t r (A^{- 1}) - \frac{t r (B^{' - 1})}{d} m Δ^{⊤} A Δ = 1 - m^{3} Δ^{⊤} A^{2} Δ = t r (B^{'}) d m^{2} - t r (A) m^{3} . \end{matrix}

(18)

The above equations are independent from the considered coordinate frame. Considering the ellipsoid frame and denoting $(Δ_{e l l, x}, Δ_{e l l, y}, Δ_{e l l, z})^{⊤}$ the corresponding expression of $Δ$ , the above system can be rewritten:

i.e.

M_{A} ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} Δ_{e l l, x}^{2} Δ_{e l l, y}^{2} Δ_{e l l, z}^{2} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ = V (m) .

Since $A$ eigenvalues are all different (triaxial ellipsoid), Vandermonde matrix $M_{A}$ is not singular (cf Appendix C). Therefore, the system can be inverted:

⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} Δ_{e l l, x}^{2} Δ_{e l l, y}^{2} Δ_{e l l, z}^{2} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ = M_{A}^{- 1} V (m) .

(19)

Left hand side elements are all non-negative, whence the result.

$⟸$ Let’s now assume that the three entries of $M_{A}^{- 1} V (m)$ are all non-negative.

Let $Δ_{e l l} = (Δ_{e l l, x}, Δ_{e l l, y}, Δ_{e l l, z})^{⊤}$ be a vector such that

⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} Δ_{e l l, x}^{2} Δ_{e l l, y}^{2} Δ_{e l l, z}^{2} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ = M_{A}^{- 1} V (m) .

Such a definition is possible due to the positivity of the three entries.

One can therefore demonstrate, with the help of a formal calculus software (the corresponding Maple code is provided in Appendix E as reference), that, irrespective of the sign assumptions made for $Δ_{e l l}$ entries, the matrix

\frac{σ}{μ} (A_{e l l}^{- 1} - Δ_{e l l} Δ_{e l l}^{⊤}) = \frac{d}{m} (A_{e l l}^{- 1} - Δ_{e l l} Δ_{e l l}^{⊤})

has the same eigenvalues with same multiplicities as $B^{' - 1}$ . Given that both matrices are diagonalizable (since symmetric), this amounts to say that they are similar, and thus that Equation (1’) is satisfied. ∎

6.3.2 Solving for Camera Poses

Theorem 2.

Considering a triaxial ellipsoid, each $σ$ solution of (1) gives rise to eight backprojection cones $(E, B^{'})$ tangent to the ellipsoid. These cones are symmetric with respect to the three principal planes of the ellipsoid (see Fig. 7).

In addition, each backprojection cone defines two camera solutions (see Fig. 8).

Figure 7: Illustrating the eight backprojection cones tangent to the triaxial ellipsoid for a given $σ$ value.

Proof.

Theorem 1 provides a NSC on $σ$ to be solution of (1). Moreover, its proof exhibits that vectors $Δ$ solutions are expressed in the ellipsoid frame in the form

Δ_{e l l} = ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} \pm \sqrt{Δ_{e l l, x}^{2}} \pm \sqrt{Δ_{e l l, y}^{2}} \pm \sqrt{Δ_{e l l, z}^{2}} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠

(20)

where

⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} Δ_{e l l, x}^{2} Δ_{e l l, y}^{2} Δ_{e l l, z}^{2} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ = M_{A}^{- 1} V (m)

There are thus eight vectors $Δ_{e l l}$ solutions for a given $m$ (thus $σ$ ), and they are symmetric with respect to the three principal planes of the ellipsoid.

The cone vertices (i.e. camera positions) can then be derived:

E_{e l l} = C_{e l l} + Δ_{e l l} = Δ_{e l l} .

Figure 8: Illustrating the two cameras compatible with each backprojection cone tangent to the triaxial ellipsoid.

Let us now solve for camera orientations. Equation (1) provides the expression of $B_{e l l}^{'}$ :

B_{e l l}^{'} = \frac{1}{d m^{2}} A_{e l l} Δ_{e l l} Δ_{e l l}^{⊤} A_{e l l} + \frac{m}{d} A_{e l l}

(21)

Orientations $^{c a m} R_{e l l}^{⊤}$ of the cameras then verify:

B_{e l l}^{'} =^{c a m} R_{e l l}^{⊤} B_{c a m}^{'}^{c a m} R_{e l l} .

Since the cone is non-circular (see Section 6.1), $B_{e l l}^{'}$ and $B_{c a m}^{'}$ eigenvectors are defined with minimum ambiguity. By arbitrarily fixing the directions of $B_{e l l}^{'}$ eigenvectors for instance, it then remains four ways of choosing the directions of $B_{c a m}^{'}$ eigenvectors so that the change of basis matrix $^{c a m} R_{e l l}$ is a rotation matrix. Yet, over the four resulting orientations, only two leads to an ellipsoid located in front of the camera.

∎

To summarize, to each $σ$ corresponds sixteen camera poses covering eight different positions (see Fig. 8).

6.4 The Spheroid

When the ellipsoid has a revolution axis (i.e. spheroid), we use a different approach since Vandermonde matrix $M_{A}$ is now singular and thus cannot be inverted. We then determine the set of spheroids tangent to the backprojection cone, and distinguish between the two possible types of cone. It is worth noting that this problem has already been addressed in WokesP10 using a different parameterization. The authors especially show that in the general case (non-circular elliptic cone), there are only two tangent spheroids, and we retrieve this result below.

6.4.1 The Non-circular Elliptic Cone

Let us first consider a non-circular elliptic cone. Expressing the Cone Alignment Equation in the cone coordinate frame, and given that the three $B^{'}$ eigenvalues are different, $σ$ solutions can be characterized in a similar way to Theorem 1.

Result 9.

Let’s denote $(λ_{B^{'}, 1}, λ_{B^{'}, 2}, λ_{B^{'}, 3})$ the three distinct eigenvalues of $B^{'}$ . Then $σ = d m^{2}$ is solution of Equation (1) if and only if the three entries of vector $M_{B^{'}}^{- 1} V^{'} (m)$ are all non-negative:

M_{B^{'}} = ⎛ ⎜ ⎜ ⎝ \begin{matrix} 1 & 1 & 1 λ_{B^{'}, 1} & λ_{B^{'}, 2} & λ_{B^{'}, 3} λ_{B^{'}, 1}^{2} & λ_{B^{'}, 2}^{2} & λ_{B^{'}, 3}^{2} \end{matrix} ⎞ ⎟ ⎟ ⎠,

(22)

V^{'} (m) = ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} 1 & 0 & 0 0 & \frac{1}{d m^{2}} & 0 0 & 0 & \frac{1}{d^{2} m^{4}} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ V (m) .

(23)

Proof.

The proof is based on the exact same arguments as the proof of Theorem 1. In particular, $M_{B^{'}}^{- 1} V^{'} (m)$ is related to the expression $Δ_{c o n e}$ of $Δ$ in the cone frame:

⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} Δ_{c o n e, x}^{2} Δ_{c o n e, y}^{2} Δ_{c o n e, z}^{2} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ = M_{B^{'}}^{- 1} V^{'} (m)

∎

It is interesting noting that the above result is also valid in the case of a triaxial ellipsoid since the cone is then non-circular (Section 6.1). It can therefore be used to reconstruct the ellipsoids in the camera coordinate frame (see Fig. 2).

Unlike the triaxial ellipsoid for which there is an infinite number of $σ$ solutions, each one giving rise to a fixed number (16) of camera poses, we demonstrate in Theorem 3 that there is only one $σ$ solution for the spheroid, which gives rise to an infinite number of camera poses.

Let’s consider $λ_{A, s i n g l e}$ (multiplicity 1) and $λ_{A, d o u b l e}$ (multiplicity 2) the eigenvalues of $A$ . Let’s also consider $(λ_{B^{'}, 1}, λ_{B^{'}, 2}, λ_{B^{'}, 3})$ the eigenvalues of $B^{'}$ , where $λ_{B^{'}, 1}$ and $λ_{B^{'}, 2}$ have the same sign (opposed to the sign of $λ_{B^{'}, 3}$ ). Finally, let’s assume, even if it means exchanging the roles, that $| λ_{B^{'} 1} | > | λ_{B^{'} 2} |$ .

Theorem 3.

Considering a spheroid along with a non-circular backprojection cone, there is only one $σ$ value solution of Equation (1):

(24)

That $σ$ value gives rise to two spheroids tangent to the cone, that are symmetric with respect to one of the cone principal planes (see Fig. 4).

Proof.

According to Theorem 9, $σ = d m^{2}$ is solution of (1) if and only if the three entries of the following vector are all non-negative:

⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} Δ_{c o n e, x}^{2} (m) Δ_{c o n e, y}^{2} (m) Δ_{c o n e, z}^{2} (m) \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ = M_{B^{'}}^{- 1} V^{'} (m) .

Yet, developing the right hand term leads to the following system:

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ \begin{matrix} Δ_{c o n e, x}^{2} (m) = \frac{1}{m^{2}} P_{1} (m) Δ_{c o n e, y}^{2} (m) = \frac{1}{m^{2}} P_{2} (m) Δ_{c o n e, z}^{2} (m) = \frac{1}{m^{2}} P_{3} (m) \end{matrix}

where

and where

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ \begin{matrix} k_{1} = \frac{- λ_{B^{'}, 2} λ_{B^{'}, 3}}{λ_{B^{'}, 1} (λ_{B^{'}, 1} - λ_{B^{'}, 2}) (λ_{B^{'}, 1} - λ_{B^{'}, 3}) d} k_{2} = \frac{- λ_{B^{'}, 1} λ_{B^{'}, 3}}{λ_{B^{'}, 2} (λ_{B^{'}, 2} - λ_{B^{'}, 1}) (λ_{B^{'}, 2} - λ_{B^{'}, 3}) d} k_{3} = \frac{- λ_{B^{'}, 1} λ_{B^{'}, 2}}{λ_{B^{'}, 3} (λ_{B^{'}, 3} - λ_{B^{'}, 1}) (λ_{B^{'}, 3} - λ_{B^{'}, 2}) d} \end{matrix}

The locus of scalars $m$ solutions is the subset of $R$ on which $P_{1} (x)$ , $P_{2} (x)$ and $P_{3} (x)$ are all non-negative.

To study the variations of these three polynomials, four cases–described in Table 2 and depending on the relative order of $A$ and $B^{'}$ eigenvalues–need to be considered. Only case #1 is addressed below, since other cases can be solved using a similar reasoning.

	$λ_{B^{'}, 2} < 0$	$λ_{B^{'}, 2} > 0$
$λ_{A, s i n g l e} < λ_{A, d o u b l e}$	#1	#2
$λ_{A, s i n g l e} > λ_{A, d o u b l e}$	#3	#4

Table 2: Configurations of the problem depending on

A

and

B^{'}

eigenvalues.

Let’s denote $S_{i}$ the root of $P_{i} (x)$ with multiplicity 1, and $D_{i}$ the root with multiplicity 2, such that

S_{i} = \frac{λ_{B^{'}, i}}{λ_{A, s i n g l e}} d and D_{i} = \frac{λ_{B^{'}, i}}{λ_{A, d o u b l e}} d .

In configuration #1, studying the variations of every $P_{i}$ leads to only one solution to obtain simultaneous non-negative values, which is the root of $P_{1}$ with multiplicity 2: $x = D_{1}$ (the proof is given in Appendix G).

The unique $σ$ solution is then given by

	$σ$	$= d D_{1}^{2}$
		$= \frac{λ_{A, s i n g l e} λ_{B^{'}, 1}}{λ_{B^{'}, 2} λ_{B^{'}, 3}},$

after replacing $d$ by its expression as a function of $A$ and $B^{'}$ eigenvalues.

Furthermore, since $D_{1}$ is a root of $P_{1} (x)$ , vectors $Δ_{c o n e}$ expressed in the cone frame verify:

Then

Δ_{c o n e} = ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} 0 \pm \sqrt{\frac{1}{D_{1}^{2}} P_{2} (D_{1})} \pm \sqrt{\frac{1}{D_{1}^{2}} P_{3} (D_{1})} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠

Since the sign of the third entry ( $Δ_{c o n e, z}$ ) is fixed under the chirality constraint (ellipsoid located in front of the camera), there remains two possible expressions for $Δ_{c o n e}$ . The two resulting vectors are symmetric with respect to the cone principal plane whose normal is the eigenvector corresponding to eigenvalue $λ_{B^{'}, 2}$ .

Equation (12) then provides the expressions of $A$ in the cone coordinate frame:

A_{c o n e} = \frac{σ}{μ} B_{c o n e}^{'} - \frac{σ^{2}}{μ} B_{c o n e}^{'} Δ_{c o n e} Δ_{c o n e}^{⊤} B_{c o n e}^{'} .

One can therefore derive the expressions of $Δ$ and $A$ in the camera frame using $^{c a m} R_{c o n e}$ (known):

	$Δ_{c a m}$	$=^{c a m} R_{c o n e} Δ_{c o n e},$
	$A_{c a m}$	$=^{c a m} R_{c o n e} A_{c o n e}^{c a m} R_{c o n e}^{⊤},$

then the spheroid centers are:

C_{c a m} = E_{c a m} - Δ_{c a m} = - Δ_{c a m}

∎

Now that the spheroid solutions have been determined in the camera coordinate frame, one can deduce the poses of camera solutions.

Result 10.

Considering a spheroid along with a non-circular backprojection cone, the axial symmetry of the spheroid leads to an infinite number of camera solutions. The solutions belong to two planes orthogonal to the revolution axis of the spheroid and located at the same distance from its center (see Fig 6).

Proof.

Since there is only one possible value for $σ$ , vectors $Δ$ all have the same norm (cf. Result 6), i.e. the cameras are located at a fixed distance from the spheroid center.

Furthermore, orientations $^{c a m} R_{e l l}^{⊤}$ of these cameras verify:

A_{c a m} =^{c a m} R_{e l l} A_{e l l}^{c a m} R_{e l l}^{⊤}

Since the spheroid has a revolution axis, arbitrarily fixing $A_{e l l}$ eigenvectors for instance leaves two choices for one of $A_{c a m}$ eigenvectors (the one corresponding to the revolution axis) and an infinite number of choices for the other two. ∎

6.4.2 The Circular Cone

Considering the case of a circular cone (elliptic cone with a revolution axis), we are going to demonstrate that there is only one spheroid tangent to it (Result 11).

In this case, $B^{'}$ has two distinct eigenvalues. Let’s denote them $λ_{B^{'}, s i n g l e}$ (multiplicity 1) and $λ_{B^{'}, d o u b l e}$ (multiplicity 2).

Result 11.

Considering a spheroid along with a circular backprojection cone, there is only one $σ$ value solution of Equation (1):

σ = \frac{λ_{A, s i n g l e}}{λ_{B^{'}, s i n g l e}} .

(25)

That $σ$ value gives rise to one spheroid tangent to the cone, and both revolution axes coincide (See Fig. 9). The distance between the cone vertex and the spheroid center is given by:

∥ Δ ∥ = \sqrt{\frac{1}{λ_{A, s i n g l e}} (1 - \frac{λ_{A, s i n g l e} λ_{B^{'}, d o u b l e}}{λ_{A, d o u b l e} λ_{B^{'}, s i n g l e}})} .

(26)

Figure 9: The spheroid solution with a circular backprojection cone.

Proof.

Given that $A$ has two distinct eigenvalues, its minimal polynomial is

π_{A} (x) = (x - λ_{A, s i n g l e}) (x - λ_{A, d o u b l e}) .

$π_{A}$ being an annihilator polynomial of $A$ , evaluating $π_{A} (A)$ gives

A^{2} = α A + β I,

where

	$α$	$= λ_{A, s i n g l e} + λ_{A, d o u b l e},$
	$β$	$= - λ_{A, s i n g l e} λ_{A, d o u b l e} .$

Left-multiplying by $Δ^{⊤}$ and right-multiplying by $Δ$ gives

Δ^{⊤} A^{2} Δ = α Δ^{⊤} A Δ + β Δ^{⊤} Δ .

(27)

Injecting the expressions of $Δ^{⊤} A^{2} Δ$ , $Δ^{⊤} A Δ$ and $Δ^{⊤} Δ$ as functions of $m$ from (18) into (27) and observing that

t r (A) = λ_{A, s i n g l e} + 2 λ_{A, d o u b l e}

and that

t r (A^{- 1}) = \frac{1}{λ_{A, s i n g l e}} + \frac{2}{λ_{A, d o u b l e}},

we obtain, after simplification

m^{3} = \frac{t r (B^{'}) d}{λ_{A, d o u b l e}} m^{2} + \frac{λ_{A, s i n g l e} t r (B^{' - 1})}{d} m - \frac{λ_{A, s i n g l e}}{λ_{A, d o u b l e}} .

Let’s call $P_{s p h e r o i d} (x)$ the above polynomial whose $m$ is root:

	$P_{s p h e r o i d} (x) =$
	$x^{3} - \frac{t r (B^{'}) d}{λ_{A, d o u b l e}} x^{2} - \frac{λ_{A, s i n g l e} t r (B^{' - 1})}{d} x + \frac{λ_{A, s i n g l e}}{λ_{A, d o u b l e}} .$

Developing $t r (B^{'})$ and $t r (B^{' - 1})$ , one can observe that it can be rewritten

P_{s p h e r o i d} (x) = (x - \frac{λ_{B^{'}, s i n g l e}}{λ_{A, d o u b l e}} d) {(x - \frac{λ_{B^{'}, d o u b l e}}{λ_{A, d o u b l e}} d)}^{2} .

At this stage, one can note that the sign of $d$ is the sign of $λ_{B^{'}, s i m p l e}$ :

d = \sqrt[3]{\frac{d e t (A)}{d e t (B^{'})}} = \sqrt[3]{\frac{λ_{A, s i n g l e} λ_{A, d o u b l e}^{2}}{λ_{B^{'}, s i n g l e} λ_{B^{'}, d o u b l e}^{2}}} .

Thus the signs of the roots of $P_{s p h e r o i d} (x)$ are:

\frac{λ_{B^{'}, s i n g l e}}{λ_{A, d o u b l e}} d > 0 and % \frac{λ_{B^{'}, d o u b l e}}{λ_{A, d o u b l e}} d < 0 .

Since $m < 0$ , the only possible value for it is the second one. Therefore, $σ$ is given by:

	$σ$	$= d {(\frac{λ_{B^{'}, d o u b l e}}{λ_{A, d o u b l e}} d)}^{2}$
		$= \frac{λ_{A, s i n g l e}}{λ_{B^{'}, s i n g l e}} .$

Let’s now focus on $Δ$ , and consider the first two equations of System 18:

⎧ ⎨ ⎩ \begin{matrix} Δ^{⊤} Δ = t r (A^{- 1}) - \frac{t r (B^{' - 1})}{d} m Δ^{⊤} A Δ = 1 - m^{3} \end{matrix}

Considering the ellipsoid frame, its left hand side can be rewritten in a matrix form:

(\begin{matrix} 1 & 1 λ_{A, s i n g l e} & λ_{A, d o u b l e} \end{matrix}) (\begin{matrix} Δ_{e l l, x}^{2} Δ_{e l l, y}^{2} + Δ_{e l l, z}^{2} \end{matrix})

That Vandermonde matrix is not singular given that $λ_{A, s i n g l e} \neq λ_{A, d o u b l e}$ , thus the system can be inverted.

After developing the right hand side, we finally obtain

Δ_{e l l, x}^{2} = \frac{1}{λ_{A, s i n g l e}} (1 - \frac{λ_{A, s i n g l e} λ_{B^{'}, d o u b l e}}{λ_{A, d o u b l e} λ_{B^{'}, s i n g l e}}),

and

Δ_{e l l, y}^{2} + Δ_{e l l, z}^{2} = 0 .

Therefore,

Δ_{e l l} = ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} \pm \sqrt{\frac{1}{λ_{A, s i n g l e}} (1 - \frac{λ_{A, s i n g l e} λ_{B^{'}, d o u b l e}}{λ_{A, d o u b l e} λ_{B^{'}, s i m p l e}})} 00 \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠ .

(28)

$Δ$ is thus an eigenvector of $A$ corresponding to eigenvalue $λ_{A, s i n g l e}$ , i.e. coincides with the revolution axis of the spheroid.

Equation (2) then requires that $Δ$ is also an eigenvector of $B^{'}$ . It must be the one corresponding to the revolution axis of the cone since, if not, the ellipsoid center would be located outside of the cone. In that respect, both axes of revolutions (cone and spheroid) coincide, and $∥ Δ ∥$ is given by:

∥ Δ ∥ = \sqrt{\frac{1}{λ_{A, s i n g l e}} (1 - \frac{λ_{A, s i m p l e} λ_{B^{'}, d o u b l e}}{λ_{A, d o u b l e} λ_{B^{'}, s i n g l e}})} .

∎

$A_{c o n e}$ can then be obtained from (12), then $A_{c a m}$ and $C_{c a m}$ using $^{c a m} R_{c o n e}$ (known).

Now the spheroid solution has been determined in the camera frame, we can infer the corresponding camera poses.

Result 12.

When considering a spheroid along with a circular backprojection cone, there are two solutions for camera position, that are located on the revolution axis of the spheroid and at the same distance from its center, and an infinite number of solutions for camera orientation.

Proof.

Equation (28) gives the two possible solutions for $Δ$ . For reasons of symmetry, both of them are actual solutions.

Furthermore, orientations $^{c a m} R_{e l l}^{⊤}$ of these cameras verify:

A_{c a m} =^{c a m} R_{e l l} A_{e l l}^{c a m} R_{e l l}^{⊤}

Just as when considering a non-circular elliptic cone (Result 10), we conclude on the infinite number of camera orientations. ∎

6.5 The Sphere

When the ellipsoid is a sphere, the matrix $A$ has the same expression in every basis:

A = λ_{A, t r i p l e} I,

(29)

where $R = \frac{1}{\sqrt{λ_{A, t r i p l e}}}$ is the sphere radius.

Given its observation as an circle in the image, there is obviously an infinite number of camera solutions, that are located at the same distance from the sphere center.

Using the formalism of our study, we can precise these properties with the following result:

Result 13.

When considering a sphere, there is only one $σ$ solution of Equation (1):

σ = \frac{λ_{A, t r i p l e}}{λ_{B^{'}, s i n g l e}} .

(30)

That $σ$ value defines a unique sphere tangent to the cone, whose center belongs the revolution axis of the cone. The distance from the cone vertex to sphere center is given by:

∥ Δ ∥ = \sqrt{\frac{1}{λ_{A, t r i p l e}} (1 - \frac{λ_{B^{'}, d o u b l e}}{λ_{B^{'}, s i n g l e}})} .

(31)

The locus of camera positions is then a sphere with radius $∥ Δ ∥$ around the ellipsoid center.

Proof of Result 13 is provided in Appendix H.

6.6 Examples of Retrieved Poses

We provide in Fig. 10 a few examples of retrieved camera trajectories from one ellipse-ellipsoid correspondence on a real scene from the T-LESS dataset (SturmEEBC12). Ellipsoidal models of objects (all triaxial) were reconstructed using 7919240. Ellipsoids have then been reprojected into one image of the sequence using the groundtruth projection matrix. Finally, the trajectories of the camera solutions were retrieved using our method (Section 6.3). In the figure, ellipse and ellipsoid colors coincide with those of the trajectories. Naturally, all the trajectories intersect at the ground-truth camera position $E$ . This example illustrate the practical interest of our method. However, a comprehensive study of numerical aspects, e.g. noise robustness, are outside the scope of this paper.

Figure 10: Examples of camera trajectories corresponding to six different ellipse-ellipsoid pairs. The image is extracted from the T-LESS dataset (SturmEEBC12).

7 Conclusion

We propose in this paper a complete characterization of the P $1$ E problem and noticeably extend previous works that only partially addressed this problem. Besides its theoretical interest, this paper also proposes a constructive solution of the camera trajectories. The closed-form solution provided for the position-from-orientation case has proven its practical interest. The orientation-from-position solution, even if not yet exploited, may also represent a convenient manner to simplify the pose estimation problem.

Future investigations concern numerical aspects and especially a sensitivity analysis of the method to image noise. Another important concern will be the joint use of the method with a minimal number of other image features, such as points or other ellipse-ellipsoid correspondences, to ensure the computation of a unique solution.

Declarations

The authors declare that they have no conflict of interest.

Appendix A Equivalent Problem Formulations

To prove that Equations (1) and (1’) are equivalent, we demonstrate below that $(???)$ implies $(???)$ then $(???)$ implies $(???)$ .

$(???) ⟹ (???)$ Multiply (1) on the right by $Δ$ to obtain $A Δ = σ B^{'} Δ$ (see Appendix B). Whence

A Δ Δ^{⊤} A + μ A = σ B^{'} ⟹ σ B^{'} - σ B^{'} Δ Δ^{⊤} A = μ A

Left-multiplying by $B^{' - 1}$

σ I - σ Δ Δ^{⊤} A = μ B^{' - 1} A

Then right-multiplying by $A^{- 1}$

σ A^{- 1} - σ Δ Δ^{⊤} = μ B^{' - 1}

Finally ( $σ \neq 0$ )

A^{- 1} - Δ Δ^{⊤} = \frac{μ}{σ} B^{' - 1}

$(???) ⟹ (???)$ Multiply (1’) on the right by $A$ then on the left by $B^{'}$ to obtain

B^{'} - B^{'} Δ Δ^{⊤} A = \frac{μ}{σ} A

Then right-multiply by $Δ$ to obtain $μ B^{'} Δ = \frac{μ}{σ} A Δ$ , whence ( $μ \neq 0$ ) $A Δ = σ B^{'} Δ$ .

Injecting that result into he previous equation leads to

σ B^{'} - A Δ Δ^{⊤} A = μ A

Appendix B Proving that $Q (B^{' - 1} A) = 0$

Replacing (2) into (1), we obtain:

σ^{2} B^{'} Δ Δ^{⊤} B^{'} - (σ Δ^{⊤} B^{'} Δ - 1) A = σ B^{'}

We can then deduce the following expression for A:

A = \frac{σ}{1 - σ Δ^{⊤} B^{'} Δ} (B^{'} - σ B^{'} Δ Δ^{⊤} B^{'})

Whence, denoting $I$ the identity matrix and defining $f = \frac{σ}{1 - σ Δ^{⊤} B^{'} Δ}$ , then left-multiplying by $B^{' - 1}$ , we obtain

B^{' - 1} A

= f (I - σ Δ Δ^{⊤} B^{'})

Squaring that expression leads to

	$(B^{'}$	$^{- 1} A)^{2} = f^{2} (I - σ Δ Δ^{⊤} B^{'})^{2}$
		$= f^{2} (I - 2 σ Δ Δ^{⊤} B^{'} + σ^{2} Δ (Δ^{⊤} B^{'} Δ) Δ^{⊤} B^{'})$
		$= f^{2} (I - 2 σ Δ Δ^{⊤} B^{'} + σ^{2} (Δ^{⊤} B^{'} Δ) Δ Δ^{⊤} B^{'})$
		$= f^{2} (I - σ (2 - σ Δ^{⊤} B^{'} Δ) Δ Δ^{⊤} B^{'})$

Defining $μ = 1 - σ Δ^{⊤} B^{'} Δ = 1 - Δ^{⊤} A Δ$ :

	$(B^{' - 1} A)^{2}$	$= f^{2} (I - σ (μ + 1) Δ Δ^{⊤} B^{'})$
		$= f^{2} ((μ + 1) (I - σ Δ Δ^{⊤} B^{'}) - μ I)$
		$= f (μ + 1) B^{' - 1} A - f^{2} μ I$
		$= \frac{σ}{μ} (μ + 1) B^{' - 1} A - \frac{σ^{2}}{μ} I$

Finally, we have

μ (B^{' - 1} A)^{2} = σ (μ + 1) B^{' - 1} A - σ^{2} I

Whence, denoting $Q (x) = μ x^{2} - (μ + 1) σ x + σ^{2}$ ,

Q (B^{' - 1} A) = 0

Appendix C Vandermonde Matrix

A Vandermonde matrix is a matrix with the terms of a geometric progression in each row or column:

V = ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} 1 & 1 & 1 & \dots & 1 x_{1} & x_{2} & x_{3} & \dots & x_{n} x_{1}^{2} & x_{2}^{2} & x_{3}^{2} & \dots & x_{n}^{2} ⋮ & ⋮ & ⋮ & ⋱ & ⋮ x_{1}^{m - 1} & x_{2}^{m - 1} & x_{3}^{m - 1} & \dots & x_{n}^{m - 1} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠ .

The determinant of a square Vandermonde matrix (when $m = n$ ) is given by

d e t (V) = \prod 1 \leq i < j \leq n (x_{j} - x_{i}) .

Therefore, $V$ is not singular (i.e. $d e t (V) \neq 0$ ) if and only if all $x_{i}$ are distinct.

Appendix D Ellipsoid and Cone types

See Tables 3 and 4.

Table 3: The different types of ellipsoids.

Table 4: The different types of elliptic cones.

Appendix E Theorem 1: Maple code

⬇

> with(linalg);

> A:=matrix([[lA1,0,0],[0,lA2,0],[0,0,lA3]]);

> B:=matrix([[lB1,0,0],[0,lB2,0],[0,0,lB3]]);

> M_A:=transpose(vandermonde([lA1,lA2,lA3]));

> d:=(det(A)/det(B))^(1/3);

> V:=transpose(matrix([[trace(inverse(A))-

trace(inverse(B))*m/d,1-m^3,

trace(B)*d*m^2-trace(A)*m^3]]));

> Delta2:=multiply(inverse(M_A),V);

> Delta:=transpose(matrix([[sqrt(Delta2[1,1]),

sqrt(Delta2[2,1]),sqrt(Delta2[3,1])]]));

> inv_B:=evalm(d/m*(inverse(A)-multiply(Delta,

transpose(Delta))));

> eigenvalues(inv_B);

Appendix F Theorem 9: Maple code

⬇

> with(linalg);

> A:=matrix([[lA1,0,0],[0,lA2,0],[0,0,lA2]]);

> B:=matrix([[lB1,0,0],[0,lB2,0],[0,0,lB3]]);

> M_B:=transpose(vandermonde([lB1,lB2,lB3]));

> d:=(det(A)/det(B))^(1/3);

> V_:=transpose(matrix([[trace(inverse(A))-

trace(inverse(B))*m/d,1-m^3,

trace(B)*d*m^2-trace(A)*m^3]]));

> V:=multiply(inverse(matrix([[1,0,0],[0,d*m^2

,0],[0,0,d^2*m^4]])),V_);

> Delta2:=multiply(inverse(M_B),V);

> Delta:=transpose(matrix([[sqrt(Delta2[1,1]),

sqrt(Delta2[2,1]),sqrt(Delta2[3,1])]]));

> inv_A:=evalm(multiply(Delta,transpose(Delta))

+evalm(m/d*inverse(B)));

> eigenvalues(inv_A);

Appendix G Solving the Polynomial Equation in Theorem 3

In case #1, the signs of $B^{'}$ eigenvalues ensure that

d = \sqrt[3]{\frac{λ_{A, s i n g l e} λ_{A, d o u b l e}^{2}}{λ_{B^{'}, 1} λ_{B^{'}, 2} λ_{B^{'}, 3}}} > 0,

then that

k_{1} < 0, k_{2} > 0 and k_{3} < 0 .

Let’s denote $S_{i}$ the root of $P_{i} (x)$ with multiplicity 1, and $D_{i}$ the root with multiplicity 2, such that

S_{i} = \frac{λ_{B^{'}, i}}{λ_{A, s i n g l e}} d and D_{i} = \frac{λ_{B^{'}, i}}{λ_{A, d o u b l e}} d .

Considering the signs of $λ_{B^{'}, i}$ and the fact that $λ_{A, s i m p l e} < λ_{A, d o u b l e}$ , it comes

	$S_{1}$	$< D_{1} < 0,$
	$S_{2}$	$< D_{2} < 0,$
	$0$	$< D_{3} < S_{3} .$

One can therefore note that the roots of $P_{1} (x)$ and $P_{2} (x)$ are negative, while those of $P_{3} (x)$ are positive. Since, in addition, $k_{3} < 0$ , we have

\forall x \leq 0, P_{3} (x) > 0.

Let’s now focus on the signs of $P_{1} (x)$ and $P_{2} (x)$ to determine the locus of possible $m$ values. Since $λ_{B^{'}, 1} < λ_{B^{'}, 2}$ , their roots verify

	$S_{1}$	$< S_{2},$
	$D_{1}$	$< D_{2} .$

Therefore, one can distinguish between two configurations regaring the roots order:

S_{1} < D_{1} < S_{2} < D_{2},

S_{1} < S_{2} < D_{1} < D_{2} .

For the first case, the variations of the three polynomials are presented in Table 5.

$x$		$S_{1}$		$D_{1}$		$S_{2}$		$D_{2}$
$P_{1} (x)$	+	0	-	0			-
$P_{2} (x)$		-				0	+	0	+
$P_{3} (x)$					+

Table 5: Signs of

P_{i} (x)

when

S_{1} < D_{1} < S_{2} < D_{2}

One can observe that the three polynomials are never all non-negative, thus this configuration is impossible. For the second case, however, there is one value for which all three polynomials are non-negative: $D_{1}$ (see Table 6).

$x$			$S_{1}$	$S_{2}$		$D_{1}$	$D_{2}$
$P_{1} (x)$	+		0	-		0	-
$P_{2} (x)$		-		0		+	0	+
$P_{3} (x)$					+

Table 6: Signs of

P_{i} (x)

when

S_{1} < S_{2} < D_{1} < D_{2}

Appendix H Proof of Result 13

Proof.

Left-multiplying $A = λ_{A, t r i p l e} I$ by $Δ^{⊤}$ right-multiplying it by $Δ$ , we obtain

Δ^{⊤} A Δ = λ_{A, t r i p l e} Δ^{⊤} Δ .

Injecting the first two equations of System (18), we then have

1 - m^{3} = λ_{A, t r i p l e} (t r (A^{- 1}) - \frac{t r (B^{' - 1})}{d} m) .

Developing $t r (A^{- 1})$ and $t r (B^{' - 1})$ leads to

1 - m^{3} = 3 - \frac{λ_{A, t r i p l e}}{d} \frac{λ_{B^{'}, d o u b l e} + 2 λ_{B^{'}, s i m p l e}}{λ_{B^{'}, s i m p l e} λ_{B^{'}, d o u b l e}} m

Furthermore, one can observe that

d = \sqrt[3]{\frac{d e t (A)}{d e t (B^{'})}} = \frac{λ_{A, t r i p l e}}{λ_{B^{'}, s i m p l e}^{1 / 3} λ_{B^{'}, d o u b l e}^{2 / 3}} .

Whence, injecting this into the former equation:

	$0$	$= m^{3} - \frac{λ_{B^{'}, d o u b l e} + 2 λ_{B^{'}, s i m p l e}}{λ_{B^{'}, s i m p l e}^{2 / 3} λ_{B^{'}, d o u b l e}^{1 / 3}} m + 2$
		$= m^{3} - ⎛ ⎜ ⎝ \frac{λ_{B^{'}, d o u b l e}^{2 / 3}}{λ_{B^{'}, s i m p l e}^{2 / 3}} + 2 \frac{λ_{B^{'}, s i m p l e}^{1 / 3}}{λ_{B^{'}, d o u b l e}^{1 / 3}} ⎞ ⎟ ⎠ m + 2$

Denoting

R = \sqrt[3]{\frac{λ_{B^{'}, d o u b l e}}{λ_{B^{'}, s i m p l e}}},

the last equation means that $m$ is root of the polynomial

P_{s p h e r e} (x) = x^{3} - (R^{2} + \frac{2}{R}) x + 2 .

Yet, $R$ is an obvious root of this polynomial:

P_{s p h e r e} (x) = (x - R) (x^{2} + R x - \frac{2}{R})

Even if obtaining a formal expression of the two other roots is not straighforward, Vieta’s formulas provide the following constraints:

{\begin{matrix} x_{1} + x_{2} + R = 0, x_{1} x_{2} R = - 2 . \end{matrix}

If roots $x_{1}$ are $x_{2}$ complex, then $R$ is the only possible value for $m$ . If they are real, then, since $R < 0$ ( $B^{'}$ eigenvalues are of opposite signs), the second formula requires that $x_{1}$ and $x_{2}$ are of the same sign, and the first formula requires that they are positive. Finally,

m = R .

Corresponding $σ$ value is:

	$σ$	$= d R^{2}$
		$= \frac{λ_{A, t r i p l e}}{λ_{B^{'}, s i n g l e}^{1 / 3} λ_{B^{'}, d o u b l e}^{2 / 3}} \frac{λ_{B^{'}, d o u b l e}^{2 / 3}}{λ_{B^{'}, s i n g l e}^{2 / 3}}$
		$= \frac{λ_{A, t r i p l e}}{λ_{B^{'}, s i n g l e}} .$

Applying $t r ()$ to Equation (1’) gives the value of $∥ Δ ∥^{2}$ :

∥ Δ ∥^{2} = \frac{3}{λ_{A, t r i p l e}} - \frac{μ}{σ} (\frac{1}{λ_{B^{'}, s i n g l e}} + \frac{2}{λ_{B^{'}, d o u b l e}}) .

Given

σ = \frac{λ_{A, t r i p l e}}{λ_{B^{'}, s i m p l e}},

and

μ = m^{3} = \frac{λ_{B^{'}, d o u b l e}}{λ_{B^{'}, s i m p l e}} %,

the right hand side can be rewritten

\frac{3}{λ_{A, t r i p l e}} - \frac{λ_{B^{'}, d o u b l e}}{λ_{A, t r i p l e}} (\frac{1}{λ_{B^{'}, s i m p l e}} + \frac{2}{λ_{B^{'}, d o u b l e}}),

i.e.

	$∥ Δ ∥^{2}$	$= \frac{3}{λ_{A, t r i p l e}} - \frac{λ_{B^{'}, d o u b l e}}{λ_{B^{'}, s i m p l e}} \frac{1}{λ_{A, t r i p l e}} - \frac{2}{λ_{A, t r i p l e}}$
		$= (1 - \frac{λ_{B^{'}, d o u b l e}}{λ_{B^{'}, s i m p l e}}) \frac{1}{λ_{A, t r i p l e}} .$

∎

	Triaxial ellipsoid	Spheroid	Sphere
Illustration
Lengths	$a, b, c$	$a, b$	$r$
of principal
axes
Eigenvalues	$\frac{1}{a^{2}}, \frac{1}{b^{2}}, \frac{1}{c^{2}}$	$\frac{1}{a^{2}}, \frac{1}{b^{2}}, \frac{1}{b^{2}}$	$\frac{1}{r^{2}}, \frac{1}{r^{2}}, \frac{1}{r^{2}}$
of $A$
Signs of	$+, +, +$	$+, +, +$	$+, +, +$
eigenvalues
Characteristic	$P_{A} (x) = (x - a) (x - b) (x - c)$	$P_{A} (x) = (x - a) (x - b)^{2}$	$P_{A} (x) = (x - r)^{3}$
polynomial
of $A$
Minimal	$π_{A} (x) = (x - a) (x - b) (x - c)$	$π_{A} (x) = (x - a) (x - b)$	$π_{A} (x) = (x - r)$
polynomial
of $A$

polynomial of $B$	$P_{B} (x) = (x - λ_{1}) (x - λ_{2}) (x - λ_{3})$	$P_{B} (x) = (x - λ_{1}) (x - λ_{2})^{2}$
	Non-circular elliptic cone	Circular cone
Illustration
Eigenvalues of $B$	$λ_{1}, λ_{2}, λ_{3}$	$λ_{1}, λ_{2}, λ_{2}$
	$-, +, +$	$-, +, +$
Signs of eigenvalues	or	or
	$+, -, -$	$+, -, -$
Characteristic	$P_{B} (x) = (x - λ_{1}) (x - λ_{2}) (x - λ_{3})$	$P_{B} (x) = (x - λ_{1}) (x - λ_{2})^{2}$
Minimal	$π_{B} (x) = (x - λ_{1}) (x - λ_{2}) (x - λ_{3})$	$π_{B} (x) = (x - λ_{1}) (x - λ_{2})$
polynomial of $B$	$π_{B} (x) = (x - λ_{1}) (x - λ_{2}) (x - λ_{3})$	$π_{B} (x) = (x - λ_{1}) (x - λ_{2})$

[

Abstract

1 Introduction

2 Related Work

2.1 Quadrics-based Pose Estimation

2.2 Perspective-1-Spheroid

3 Formulation of the Ellipsoid Pose Estimation Problem

3.1 Problem Statement

3.1.1 Projection Cone

3.1.2 Backprojection Cone

3.1.3 The Cone Alignment Equation

3.2 Pose Problem Analysis

4 Properties of the Solutions

4.1 Link with a Generalized Eigenvalue Problem

Result 1.

Proof.

4.2 Generalized Eigenvalues of {A,B′}

Result 2.

Proof.

4.3 Characterization of σ

Result 3.

Proof.

4.4 Characterizations of μ

Result 4.

Proof.

Result 5.

Proof.

4.5 Link between σ and ∥Δ∥

Result 6.

Proof.

5 Decoupling between Orientation and Position

5.1 Position from Orientation

Result 7.

Proof.

5.2 Orientation from Position

Result 8.

Proof.

6 Closed-form Solutions

6.1 Preliminaries: Co-occurences of Ellipsoid and Cone Types

6.2 Overview of the Solutions

6.3 The Triaxial Ellipsoid

6.3.1 Solving for σ

Theorem 1.

Proof.

6.3.2 Solving for Camera Poses

Theorem 2.

Proof.

6.4 The Spheroid

6.4.1 The Non-circular Elliptic Cone

Result 9.

Proof.

Theorem 3.

Proof.

Result 10.

Proof.

6.4.2 The Circular Cone

Result 11.

Proof.

Result 12.

Proof.

6.5 The Sphere

Result 13.

6.6 Examples of Retrieved Poses

7 Conclusion

Declarations

Appendix A Equivalent Problem Formulations

Appendix B Proving that Q(B′−1A)=0

Appendix C Vandermonde Matrix

Appendix D Ellipsoid and Cone types

Appendix E Theorem 1: Maple code

Appendix F Theorem 9: Maple code

Appendix G Solving the Polynomial Equation in Theorem 3

Appendix H Proof of Result 13

Proof.

References

2.2 Perspective- $1$ -Spheroid

4.2 Generalized Eigenvalues of ${A, B^{'}}$

4.3 Characterization of $σ$

4.4 Characterizations of $μ$

4.5 Link between $σ$ and $∥ Δ ∥$

6.3.1 Solving for $σ$

Appendix B Proving that $Q (B^{' - 1} A) = 0$