This is the continuation to Fundamentals of parameter estimation - Part II. Exercise 3/8 from my optimal estimation course. The focus is on random vectors and unbiased linear MMSE estimation.

At the end of this exercise I should understand insights about the concept of covariance matrices and about unbiased linear MMSE estimation.

Context

Prior knowledge

I have a ship. The parameter vector to estimate is the position vector of that ship $x = [x y]^{T}$ . The prior knowledge, obtained via dead reckoning, is captured as a prior expectation $μ_{x}$ and a covariance matrix which expresses the prior uncertainty that we have about the position $C_{x}$ .

Measurement

In order to increase the accuracy, the navigator of the ship measures the direction $φ$ of a beacon, e.g. lighthouse, relative to the ship as in the Figure below.

The beacon has a known reference position $x_{0}$ . The line of sight is defined by the position of the beacon and by the measured direction $θ$ . The compass reading gives $θ = φ + △ θ$ . The following equation defines the line of sight in the $(ξ, η)$ plane:

x_{0} sin θ - y_{0} cos θ = ξ sin θ - η cos θ

The measurement model

The relation between the ship’s true position $x = (x, y)$ and the true bearing $φ$ is:

x_{0} sin φ - y_{0} cos φ = x sin φ - y cos φ

or, by substituting $φ = θ - △ θ$

x_{0} sin (θ - Δ θ) - y_{0} cos (θ - Δ θ) = x sin (θ - Δ θ) - y cos (θ - Δ θ)

The relation between the ship’s true position $x$ and the observed direction $θ$ is nonlinear. To get a linear approximation, we apply a truncated Taylor series expansion to the sine and cosine functions:

sin (θ - Δ θ) \approx sin θ - Δ θ cos θ

cos (θ - Δ θ) \approx cos θ + Δ θ sin θ

⇓ after some rearrangements

x_{0} sin θ - y_{0} cos θ \approx x sin θ - y cos θ + Δ θ ((x_{0} - x) cos θ + (y_{0} - y) sin θ)

Since $θ \approx φ$ , the factor $(x_{0} - x) cos θ + (y_{0} - y) sin θ$ almost equals $(x_{0} - x) cos φ + (y_{0} - y) sin φ$ . The latter equals the distance $d$ between beacon and ship. Therefore:

x_{0} sin θ - y_{0} cos θ \approx x sin θ - y cos θ + d Δ θ

This can be written in the form $z = H x + v$ with the following definitions $⎩ ⎨ ⎧ z = x_{0} sin θ - y_{0} cos θ H = [sin θ - cos θ] v = d Δ θ$

The distance $d$ is unknown, but can be estimated from prior knowledge $μ_{x}$ of the ship’s position and the position of the beacon: $d \approx ∣∣ x_{0} - μ_{x} ∣∣$ . Assuming the measurement of the bearing has an uncertainty of $σ_{Δ θ}$ , the standard deviation of $v$ is $σ_{v} = d σ_{Δ θ}$ [radians].

The Case

Physical units are Nautical miles (Nm).

Uncertainty regions and principal axes

For normal distributions $p (x) = \frac{1}{( 2 π ) ^{N} ∣ C _{x} ∣} exp (- \frac{( x - μ _{x} ) ^{T} C _{x}^{- 1} ( x - μ _{x} )}{2})$ , the equation for the contour simplifies to: $(x - μ_{x})^{T} C_{x}^{- 1} (x - μ_{x}) = k^{2} with k = 1, 2, 3$ .

The eigenvectors and eigenvalues are solutions of $C_{x} v = λ v$ and the corresponding scaling factors are $a_{m} = λ_{m}$

First topic: Determine the eigenvalues and eigenvectors of $C_{x}$ and draw the associated uncertainty region.

1.1 Generate a set of points on a circle with unit radius. The centre of the circle is positioned at the origin

1.2 Scale the $x$ and $y$ coordinates of these points in accordance with the scaling factors $a_{0}$ and $a_{1}$ . The resulting points form an ellipse with the right shape, but not with the right orientation and position.

So based on the Figure 5 above, I need to extract the $a_{0}$ and $a_{1}$ scaling factors of the ellipse defined by $C_{x}$ . The corresponding scaling factors are $a_{m} = λ_{m}$ .

1.3 Rotate the set of points in accordance with the direction of the principal axes. The eigenvector-matrix is a rotation matrix.

This is really just a no-brainer, since the eigenvector-matrix is in itself the Rotation Matrix I need to apply.

1.4+1.5 Shift the whole set to the position determined by $μ_{x}$ . Plot the curve defined by the resulting set of points.

Again, I simply add to each axis the values from $μ_{x}$ .

Second topic: Add the line of sight to the Figure. From the context, the uncertainty of the measured bearing is the standard deviation $σ_{Δ θ}$ . The range $[θ - σ_{Δ θ}, θ + σ_{Δ θ}]$ defines an uncertainty region in the shape of a 2D cone. Visualize this cone in the graph by adding two dashed lines.

The line of sight is a line starting from the beacon position $x_{0}$ going in the direction of the measured bearing $θ$ . So I must simply apply the equation:

y_{los} - y_{0} = m (x_{los} - x_{0})

where $m = tan (θ)$ .

For the upper and lower bounds of the LoS (the 2D cone), I can simply reapply the formula like this:

y = y_{0} + tan (θ \pm σ_{Δ θ}) (x - x_{0})

In the Figure above, in the left plot, the lines do not intersect the uncertainty region if I consider the compass reading of $θ = 35$ . This would indicate that the prior position estimate and the compass measurement are pointing at slightly different locations (so the reading of the compass is off). To see what the true bearing should be, I applied $a t an 2$ between the bearing and the prior estimate, and I get the result of $\sim 41.63$ degrees. In this case, the line of sight would pass straight through the ship’s position in the right plot.

Third topic: The linearized measurement function replaces the cone by a bar (i.e. two parallel lines). The width of this bar is $2 σ_{v}$ . Calculate it and show the results.

According to the document, $σ_{v} = d σ_{Δ θ}$ , where $d = ∣∣ x_{0} - μ_{x} ∣∣$ is the euclidean distance between the bearing and the prior position estimate. According to the calculations, the initial width of the bar is $\sim 4.2033 N m$ .

Although $θ$ is the real measurement, I can use $z$ as a derived measurement instead.

z = x_{0} sin (θ) - y_{0} cos (θ)

Okk, but what is $z$ ? What does it represent?

While the actual physical measurement is the bearing angle $θ$ , its relationship to the ship’s position is non-linear. To make this usable for linear estimation, the measurement model is linearized using a Taylor series expansion. This process groups the known variables—the beacon’s position $(x_{0}, y_{0})$ and the measured angle $θ$ - into a single known scalar value $z$ .

Geometrically, the absolute value of z represents the shortest, perpendicular distance from the origin (0, 0) to the measured line of sight. Because it is a signed value, the positive or negative sign simply indicates which side of the origin the line falls on.

Basically, the true relationship is non-linear and I linearize it through the standard linear format $z = H x + v$ . I will need it in the unbiased linear MMSE estimator.

The linearized bar consists of two parallel lines defined by

x sin (θ) - y cos (θ) = z \pm σ_{v}

After rearranging, I get $y = x tan (θ) - \frac{z \pm σ _{v}}{c o s ( θ )}$

According to the plot, the results do make sense, since the cone and the bar are approximately equal in width near the ship’s position (they actually overlap, since the dashed line of the cone is no longer visible), which is where the linearization is valid. Further away from that, the approximation becomes less accurate.

Fourth topic: Determine the derived measurement $z$ , the measurement matrix $H$ , and the Kalman Gain matrix. The covariance matrix of the measurement noise is $C_{v} = σ_{v}^{2}$ . Next, calculate the unbiased linear MMSE estimate of the position and the corresponding (error) covariance matrix.

For the first part, I already had to compute the derived measurement $z$ in the last question, and its value is $z \sim - 24.5576$ which makes sense. The minus sign signals that the side on which the shortest perpendicular falls on the line of sight is to the left of the origin. The actual distance would be $z = 24.5576 N m$ .

Since $H = [sin θ - cos θ]$ , the actual values of the measurement matrix would be $H = [0.5736, - 0.8192]$ . It maps the 2D position to the scalar measurement.

The Kalman Gain $K = C_{x} H^{T} (H C_{x} H^{T} + C_{v})^{- 1}$ (taken from eq. 3.33 from the book) weights how much to trust the measurement versus the prior knowledge. The actual values are $K \approx [0.419, - 0.8625]$ .

The updated estimate $μ_{post} = \hat{x}_{ulMMSE} (z) = μ_{x} + K (z - H μ_{x})$ depends mostly on the innovation $z - H μ_{x}$ . The Kalman Gain transforms the innovation into a correction term that represents the knowledge that we have gained from the measurements.

When you invert a covariance matrix, you get the information matrix. Just a reminder.

The updated covariance $C_{e} = (C_{x}^{- 1} + H^{T} C_{V}^{- 1} H)^{- 1}$ (taken from eq 3.44 from the book) represents the reduced uncertainty after incorporating the measurement. The actual values are $C_{e} \approx [10.4122 5.0316 5.0316 8.174]$ .

Fifth topic: Draw the uncertainty region of the estimate. That is, plot the posterior mean and covariance matrix.

Now, based on the updated information, the ship’s updated uncertainty region falls within the designated bounds of the line of sight. An interesting idea is that the bearing measurement only constrains the position perpendicular to the line of sight. Along the line of sight, distance remains uncertain, so the ellipse stretches in that direction (the red uncertainty region). Visibly, the Kalman gain K determined how much weight to give the measurement versus the prior. Since $σ_{v}$ was relatively small, the measurement was trusted and the uncertainty collapsed significantly in the perpendicular direction. Due to the innovation being nonzero, meaning the prior mean was not on the line of sight, the estimate got pulled onto it.

Sixth topic: Repeat questions 2 to 4 a number of times, but with varying values of $σ_{Δ θ}$ and explain what you see.

The term $σ_{Δ θ}$ directly influences the width of the linearized bar width and the covariance matrix of the measurement noise. Therefore, if the bearing uncertainty increases, then the update would take the measurement less into consideration, since the Kalman Gain has it in the denominator and the updated covariance matrix computes the error term based on its inverse.

However, since the uncertainty increases, that also increases the change of the initial guess to fall more and more within the linearized bar width. The updated position $μ_{post}$ is more than likely to fall within the bounds, but the uncertainty also increases. This suggests that the lower the bearing uncertainty, the better and more accurate will the updates be, and a narrower space for uncertainty.

An interesting observations is that as $σ_{Δ θ} \to \infty$ , the posterior converges towards the prior. That happens because $K \to 0$ and $C_{x_post} \to C_{x}$ , meaning the measurement holds no value and no influence to the update.

As $σ_{Δ θ} \to 0$ , there is no real uncertainty region, because it would mean we would trust the measurement completely and the posterior ellipse collapses onto the line of sight.

Seventh topic: Repeat question 2 up to 4 a number of times, but with varying $C_{x}$ by $α$ .

If $α$ increases, that means a larger prior uncertainty, which leads the Kalman Filter to trust the measurement much more. While the posterior will be placed inside the bar width, the uncertainty region grows bigger, which still translates to possible errors.