From what I observed, the teachers simply want us to be able to derive some forms we are using in the algorithms. These either imply some partial derivatives, or simply substituting some variables for others.

Closed-Form Scan Matching via Rotation-Translation Separation

Context

As in ICP, we consider two lidar scans represented as corresponding 3D point sets

D = {d_{i}}_{i = 1}^{N} M = {m_{i}}_{i = 1}^{N}

where we would like to align the data scan $d_{i}$ to its corresponding point in the model scan (reference) $m_{i}$ . The rigid alignment problem seeks the transform

m_{i} \approx R d_{i} + t R \in SO (3), t \in R^{3},

that maximizes the least-squares cost

E (R, t) = i = 1 \sum N ∣∣ m_{i} - (R d_{i} + t) ∣ ∣^{2}

It is stated in the lecture notes that centering the point sets allows the translation to be removed from the minimization over $R$ , leading to a closed-form solution for both $R$ and $t$ .

Exercises

Show that the optimal translation satisfies

t^{*} = c_{m} - R c_{d}, c_{m} = \frac{1}{N} i = 1 \sum N m_{i}, c_{d} = \frac{1}{N} i = 1 \sum N d_{i}

Substitute $t^{*}$ into $E (R, t)$ and show that the problem reduces to

R \in SO (3) min i = 1 \sum N ∣∣ m_{i}^{'} - R d_{i}^{'} ∣ ∣^{2}, m_{i}^{'} = m_{i} - c_{m}, d_{i}^{'} = d_{i} - c_{d}

Show that minimizing $\sum_{i} ∣∣ m_{i}^{'} - R d_{i}^{'} ∣ ∣^{2}$ is equivalent to maximizing

t r a ce (R H), H = i = 1 \sum N d_{i}^{^{'}} m_{i}^{^{'} T}

Solutions

(a) We differentiate $E (R, t)$ w.r.t. time and set to zero.

\frac{\partial E}{\partial t} = - 2 i \sum (m_{i} - R d_{i} - t) = 0

therefore, $Nt = \sum_{i} m_{i} - R \sum_{i} d_{i}$ $=>$ $t = \frac{1}{N} \sum_{i} m_{i} - \frac{1}{N} R \sum_{i} d_{i}$ $=>$ $t^{*} = c_{m} - R c_{d}$

(b) Insert $t^{*}$ into the error term:

m_{i} - R d_{i} - t^{*} = (m_{i} - c_{m}) - R (d_{i} - c_{d}) = m_{i}^{'} - R d_{i}^{'}

Hence $E (R, t^{*}) = \sum_{i = 1}^{N} ∣∣ m_{i}^{'} - R d_{i}^{'} ∣ ∣^{2}$

∣∣ m_{i}^{'} - R d_{i}^{'} ∣ ∣^{2} = ∣∣ m_{i}^{'} ∣ ∣^{2} + ∣∣ R d_{i}^{'} ∣ ∣^{2} - 2 \cdot ∣∣ m_{i}^{'} ∣∣ \cdot R ∣∣ d_{i}^{'} ∣∣

Apparently, if the angle $θ$ between two vectors $a$ and $b$ is 0, then $∣∣ a ∣∣ \cdot ∣∣ b ∣∣ = a^{T} b$ . It’s the case for $m_{i}^{'}$ and $d_{i}^{'}$ . The rest is covered in ICP, but the idea is that the first two terms do not depend on R since they refer to the centroid, and so we minimize only the cross-term.

Okk, I actually want to see why those two terms nullify.

First of all, $\sum_{i} ∣∣ m_{i}^{'} ∣ ∣^{2}$ does not depend on R.

It’s the second term I’m more interested in: $∣∣ R d_{i}^{'} ∣ ∣^{2}$ . By definition of the Euclidean norm, $∣∣ x ∣ ∣^{2} = x^{T} x$ . So, $∣∣ R d_{i}^{'} ∣ ∣^{2}$ = $(R d_{i})^{T} (R d_{i}) = d_{i}^{T} R^{T} R d_{i}$ . And here is the actual trick: since the Rotation matrix is orthogonal, then $R^{T} R = I$ , so the final term itself does not depend on $R$ . $∣∣ R d_{i}^{'} ∣ ∣^{2} = d_{i}^{T} I d_{i} = d_{i}^{T} d_{i} = ∣∣ d_{i} ∣ ∣^{2}$

Then, since the first two terms are constants,

a r g R min E (R) \equiv a r g R min (- 2 i \sum R d_{i}^{'} m_{i}^{^{'} T})

Minimizing a negative quantity is the same as maximizing the positive one

a r g R min E (R) \equiv a r g R max i \sum R d_{i}^{'} m_{i}^{^{'} T}

i \sum m_{i}^{^{'} T} d_{i}^{'} = t r a ce (R i \sum d_{i}^{'} m_{i}^{^{'} T}) = t r a ce (R H) H = i \sum d_{i} m_{i}^{T}

Thus, minimizing E is equivalent to maximizing $t r a ce (R H)$ . I did mix the notation here and there, but the idea remains. I got lost in the transposes and everything.

This does not mean that rotation and translation are geometrically independent in $SE (3)$ ; rather, the particular structure of the Euclidean least-squares cost combined with centroid subtraction yields a decomposition that is valid only for this point-set alignment objective. This separation does not generalize to pose residuals in the logarithmic $SE (3)$ error used in SLAM.

Deriving the Recursive Bayesian SLAM Update

Context

Consider the SLAM state

y_{t} = (x_{t}, m)

$x_{t}$ is the robot pose at time $t$ ,
$m$ is the static map.

Assume the motion model $p (x_{t} ∣ x_{t - 1}, u_{t})$ , the measurement model $p (z_{t} ∣ y_{t}, c_{t})$ , and the posterior at the previous timestep $p (y_{t - 1} ∣ z_{0 : t - 1}, u_{0 : t - 1}, c_{0 : t - 1})$ .

We remember the proportional Bayesian update rule and the fact that in Markov processes, the current robot state $y_{t}$ depends only on the previous state $y_{t - 1}$ and the current IMU data (or the control input) $u_{t}$ , not on earlier states.

p (x ∣ z) \propto p (z ∣ x) p (x)

Exercises

Derive the general recursive Bayes filter update for SLAM:

p (y_{t} ∣ z_{0 : t}, u_{0 : t}, c_{0 : t}) = η p (z_{t} ∣ y_{t}, c_{t}) \int p (y_{t} ∣ y_{t - 1}, u_{t}) p (y_{t - 1} ∣ z_{0 : t - 1}, u_{0 : t - 1}, c_{0 : t - 1}) d y_{t - 1} .

Show that the Bayes filter update translates into minimizing the Graph-SLAM least-squares objective:

ar g y_{0 : t} min [∥ y_{0} - \overset{y}{ˉ}_{0} ∥_{Ω_{0}}^{2} + τ = 1 \sum t ∥ y_{τ} - g (u_{τ}, y_{τ - 1}) ∥_{R_{τ}^{- 1}}^{2} + τ = 1 \sum t ∥ z_{τ} - h (y_{τ}, c_{τ}) ∥_{Q_{τ}^{- 1}}^{2}] .

Clarifications

The process is mainly converting probabilities to penalties. The goal of Graph-SLAM is to find the most likely trajectory and map (the Maximum A Posteriori or MAP estimate) given all our sensor data and movements.

Some clarifications first, because I’m dumb as shit when it comes to mathematics.. I really need to pick up a probabilistics book :)

$η$ is the normalization factor. Because the products and integrals on the right side of the equation often result in a number that is much smaller or larger than 1, we use $η$ to scale the result back into a valid probability distribution.
$c_{t}$ is the respective landmark. Covered its usage in SLAM.
$\propto$ is the “is proportional to”
So we can frame it as
- “The posterior probability of the state $y$ at time $t$ , given all measurements, controls, and landmarks from zero to $t$ …
- is equal to the normalizer $η$ …
- times the likelihood of the current measurement $z_{t}$ given the current state and landmark…
- times the integral over the previous state $y_{t - 1}$ …
- of the motion model (you see it takes $u_{t}$ in consideration)…
- multiplied by the previous posterior.
- don’t judge me.

Solutions

(a)

We formulate the posterior at $t = 1$ using the Bayes update rule. I remind myself that the measurement update is only dependent on the current state $y_{1}$ and the landmark association $c_{1}$ . Same for the others (motion depends on controls and current state), etc.

p (y_{1} ∣ y_{0}, z_{1}, u_{1}, c_{1}) \propto Measurement Likelihood p (z_{1} ∣ y_{1}, c_{1}) \cdot Motion Model p (y_{1} ∣ y_{0}, u_{1}) \cdot Prior p (y_{0} ∣ z_{0}, u_{0}, c_{0})

Thus, the explicit (unmarginalized) posterior is: (I guess we just generalize, because it’s true for both cases?)

p (y_{1}, y_{0} ∣ z_{0 : 1}, u_{0 : 1}, c_{0 : 1}) \propto p (z_{1} ∣ y_{1}, c_{1}) \cdot p (y_{1} ∣ y_{0}, u_{1}) \cdot p (y_{0} ∣ z_{0}, u_{0}, c_{0})

Step 2: Posterior at $t = 2$

p (y_{2} ∣ y_{1}, y_{0}, z_{2}, u_{2}, c_{2}) \propto p (z_{2} ∣ y_{2}, c_{2}) \cdot p (y_{2} ∣ y_{1}, u_{2}) \cdot p (y_{1}, y_{0} ∣ z_{0 : 1}, u_{0 : 1}, c_{0 : 1})

By substituting the equation from before:

p (y_{2}, y_{1}, y_{0} ∣ z_{0 : 2}, u_{0 : 2}, c_{0 : 2}) \propto p (z_{2} ∣ y_{2}, c_{2}) p (y_{2} ∣ y_{1}, u_{2}) p (z_{1} ∣ y_{1}, c_{1}) p (y_{1} ∣ y_{0}, u_{1}) p (y_{0} ∣ z_{0}, u_{0}, c_{0}) .

And if we go one more step, we can see the general pattern for $t$ .

p (y_{0 : t} ∣ z_{0 : t}, u_{0 : t}, c_{0 : t}) \propto [τ = 1 \prod t p (z_{τ} ∣ y_{τ}, c_{τ})] \times [τ = 1 \prod t p (y_{τ} ∣ y_{τ - 1}, u_{τ})] p (y_{0} ∣ z_{0}, u_{0}, c_{0}) .

(b)

As discussed in SLAM, the MAP aims to maximize the current estimate

a r g y_{0 : t} max p (y_{0 : t} ∣ z_{0 : t}, u_{0 : t}, c_{0 : t})

Step 1: To compute the errors, we need to go into Log space. It makes computations much easier. This is where we shift from maximizing the probability to minimizing the cost/penalty:

J (y_{0 : t}) := - lo g p (y_{0 : t} ∣ z_{0 : t}, u_{0 : t}, c_{0 : t}) + const

Using the factorization above:

J (y_{0 : t}) = - τ = 1 \sum t lo g p (z_{τ} ∣ y_{τ}, c_{τ}) - τ = 1 \sum t lo g p (y_{τ} ∣ y_{τ - 1}, u_{τ}) - lo g p (y_{0} ∣ z_{0}, u_{0}, c_{0}),

Step 2: Assume Gaussian noise for motion and measurement models. This step helps us convert distances into likelihood $p (x) \propto e x p (- \frac{1}{2} ∣∣ error ∣ ∣_{trust}^{2})$ . And the inverse of the covariance matrix represents the information matrix — trust.

p (y_{τ} ∣ y_{τ - 1}, u_{τ}) p (z_{τ} ∣ y_{τ}, c_{τ}) \propto exp (- \frac{1}{2} ∥ r_{τ}^{motion} ∥_{R_{τ}^{- 1}}^{2}), \propto exp (- \frac{1}{2} ∥ r_{τ}^{meas} ∥_{Q_{τ}^{- 1}}^{2}), r_{τ}^{motion} r_{τ}^{meas} := y_{τ} - g (u_{τ}, y_{τ - 1}), := z_{τ} - h (y_{τ}, c_{τ}),

and a Gaussian prior

p (y_{0} ∣ z_{0}, u_{0}, c_{0}) \propto exp (- \frac{1}{2} ∥ r_{0}^{prior} ∥_{Ω_{0}}^{2}), r_{0}^{prior} := y_{0} - \overset{y}{ˉ}_{0} .

Reminder that $∥ v ∥_{A}^{2} := v^{⊤} A v$

Step 3: Substitute the Gaussian forms into $J (y_{0 : t})$ . Since natural logarithm is the inverse of the exponential function, they cancel each other out.

- lo g p (y_{τ} ∣ y_{τ - 1}, u_{τ}) - lo g p (y_{0} ∣ z_{0}, u_{0}, c_{0}) ≐ \frac{1}{2} ∥ r_{τ}^{motion} ∥_{R_{τ}^{- 1}}^{2}, ≐ \frac{1}{2} ∥ r_{0}^{prior} ∥_{Ω_{0}}^{2}, - lo g p (z_{τ} ∣ y_{τ}, c_{τ}) ≐ \frac{1}{2} ∥ r_{τ}^{meas} ∥_{Q_{τ}^{- 1}}^{2},

We obtain

J (y_{0 : t}) ≐ \frac{1}{2} ∥ y_{0} - \overset{y}{ˉ}_{0} ∥_{Ω_{0}}^{2} + τ = 1 \sum t \frac{1}{2} ∥ y_{τ} - g (u_{τ}, y_{τ - 1}) ∥_{R_{τ}^{- 1}}^{2} + τ = 1 \sum t \frac{1}{2} ∥ z_{τ} - h (y_{τ}, c_{τ}) ∥_{Q_{τ}^{- 1}}^{2} .

As we want to minimize these differences, we drop the $\frac{1}{2}$ term and we obtain what we were after

y_{0 : t} ar g min [∥ y_{0} - \overset{y}{ˉ}_{0} ∥_{Ω_{0}}^{2} + τ = 1 \sum t ∥ y_{τ} - g (u_{τ}, y_{τ - 1}) ∥_{R_{τ}^{- 1}}^{2} + τ = 1 \sum t ∥ z_{τ} - h (y_{τ}, c_{τ}) ∥_{Q_{τ}^{- 1}}^{2}] .

🚀 Costin Chitic

Recent Notes

ROS2 - Writing Publishers and Subscribers

ROS2 Commands Basics

ROS2 Starting Basics

Error Analysis of Airborne Laser Scanning Data

Point Cloud Segmentation Practical

Pen and Paper Exercises SLAM

Closed-Form Scan Matching via Rotation-Translation Separation

Deriving the Recursive Bayesian SLAM Update

Graph View

Table of Contents

Backlinks