====================================================
Lecture
====================================================

.. raw:: latex

   \setcounter{figure}{0}

Prerequisites
====================================================

Ensure you have followed the simulation instructions and have
everything set up before running any code in this lecture.


.. dropdown:: Before You Start
   :open:

   - Do a ``git pull`` (native installation only).
   - Recompile Lecture 11 packages.
   - Source ``.bashrc``.
   - Check that **ROBOT_MODEL** is set to *rosbot*:

   .. code-block:: console

      echo $ROBOT_MODEL


Pose
====================================================

A **pose** describes the position and orientation of an object in 3D
space. Understanding how to represent orientation is fundamental to
robotics: every transform, every sensor reading, and every control
command depends on it.

- **Position**: where the object is located (a 3D vector
  :math:`[x, y, z]`).
- **Orientation**: how the object is rotated or tilted relative to a
  reference frame.

.. note::

   ROS 2 messages represent pose as ``geometry_msgs/msg/Pose`` which
   stores position as a ``Point`` and orientation as a ``Quaternion``.

.. admonition:: Resources
   :class: resources

   - `Robotics, Vision and Control (Corke)
     <https://link.springer.com/book/10.1007/978-3-031-07262-8>`_
   - `Modern Robotics (Lynch & Park)
     <http://hades.mech.northwestern.edu/index.php/Modern_Robotics>`_


Position
----------------------------------------------------

The position of an object is described by a 3D vector:

.. math::

   \mathbf{p} = \begin{pmatrix} x \\ y \\ z \end{pmatrix} \quad [\text{metres}]

- :math:`x`: displacement along the forward axis.
- :math:`y`: displacement along the left axis.
- :math:`z`: displacement along the up axis.

.. note::

   Position is unambiguous: three numbers, three axes, one convention.
   Orientation is far more subtle.


Orientation
----------------------------------------------------

Before publishing transforms, we need to understand how orientations are
represented. Orientation can be represented in several ways:

- Euler angles (roll, pitch, yaw)
- Quaternions
- Rotation matrices
- Axis-angle representation

**Euler Angles**

Euler angles (Leonhard Euler) are a set of three angles that describe
the orientation of a rigid body with respect to a fixed coordinate
system.

- **Three Angles**: Euler angles use three sequential rotations to
  achieve any desired 3D orientation. These rotations are performed
  about the axes of a coordinate system.
- **Different Conventions**: There is not one single definition of Euler
  angles. The most common type in robotics is **Tait-Bryan angles
  (Cardan angles)** (e.g., x-y-z, y-z-x) where all three rotations are
  about different axes. Also known as *roll*, *pitch*, and *yaw*.

.. only:: html

   .. figure:: /_static/images/L11/euler_angles_light.png
      :alt: Rotation about the x-, y-, and z-axis
      :width: 60%
      :align: center
      :class: only-light

      Rotation about the :math:`x`-, :math:`y`-, and :math:`z`-axis.

   .. figure:: /_static/images/L11/euler_angles_dark.png
      :alt: Rotation about the x-, y-, and z-axis
      :width: 60%
      :align: center
      :class: only-dark

      Rotation about the :math:`x`-, :math:`y`-, and :math:`z`-axis.

- **Roll** (:math:`\phi`): rotation about the :math:`x`-axis (tilting
  side to side).
- **Pitch** (:math:`\theta`): rotation about the :math:`y`-axis
  (tilting forward/backward).
- **Yaw** (:math:`\psi`): rotation about the :math:`z`-axis (turning
  left/right).

**The Gimbal Lock Problem**

Despite their intuitiveness, Euler angles suffer from a critical flaw
called **gimbal lock**: when two rotation axes align (e.g., pitch
reaches :math:`\pm 90°`), one degree of freedom is lost and certain
orientations become unreachable through smooth rotation.

**Why this matters in robotics:**

- A robot arm approaching certain orientations may experience erratic
  behavior as the controller tries to compensate for the singularity.
- Interpolating between two orientations using Euler angles can produce
  unexpected paths through gimbal-lock configurations.
- Small orientation changes near the singularity require enormous
  changes in the angle values, causing numerical instability.

.. note::

   This is why ROS 2, game engines, and aerospace systems use
   **quaternions** for representing orientations. Quaternions are
   singularity-free. See `Appendix A: Gimbal Lock`_ for the
   mathematical proof.


Quaternions
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

**The Intuition**

Before the formal math, here is the key idea:

.. note::

   A quaternion encodes a rotation as **an axis** (which direction to
   rotate around) and **an angle** (how far to rotate). Think of it as
   pointing your thumb along an axis and curling your fingers by some
   angle.

Given a unit axis :math:`\mathbf{u} = (u_x, u_y, u_z)` and a rotation
angle :math:`\theta`:

.. math::

   q = \underbrace{\cos\!\left(\frac{\theta}{2}\right)}_{\text{scalar part } w}
     + \underbrace{\sin\!\left(\frac{\theta}{2}\right)}_{\text{scales the axis}}
       \left(u_x\, i + u_y\, j + u_z\, k\right)

**Reading a quaternion:**

- The **scalar part** :math:`w` tells you *how much* rotation: :math:`w = 1`
  means no rotation; :math:`w = 0` means a :math:`180°` rotation.
- The **vector part** :math:`(x, y, z)` tells you *which direction* the
  rotation axis points.
- The vector's magnitude encodes the rotation amount (along with
  :math:`w`).

**Formal Definition**

A *quaternion* (William Rowan Hamilton, 1843) extends complex numbers to
four dimensions. It consists of four components:

.. math::

   q = w + x\,i + y\,j + z\,k

Or more compactly: :math:`q = (w, x, y, z)`.

- :math:`w` is the **scalar part** (real number).
- :math:`x`, :math:`y`, :math:`z` are the components of the **vector
  part**.
- :math:`i`, :math:`j`, :math:`k` are the imaginary basis units,
  satisfying: :math:`i^2 = j^2 = k^2 = ijk = -1`.

**Unit Quaternions**

For representing rotations, we use **unit quaternions** (magnitude equal
to 1):

.. math::

   |q| = \sqrt{w^2 + x^2 + y^2 + z^2} = 1

A quaternion :math:`q = w + xi + yj + zk` is just a point in 4D space.
Most of those points have nothing to do with rotations. The constraint
:math:`|q| = 1` restricts you to the surface of a 4D unit sphere, called
:math:`S^3`. It turns out that this particular surface has exactly the
right algebraic structure to encode every possible 3D rotation, with no
redundancy except the unavoidable double cover (:math:`q` and :math:`-q`
map to the same rotation).

.. warning::

   **Double cover**: :math:`q` and :math:`-q` represent the *same*
   rotation. For example, :math:`(w, x, y, z) = (0.707, 0.707, 0, 0)`
   and :math:`(-0.707, -0.707, 0, 0)` both encode a :math:`90°`
   rotation about the :math:`x`-axis. This can cause sign-flip artifacts
   when interpolating or comparing quaternions.

**Common Quaternion Values**

Building intuition through examples:

.. list-table::
   :header-rows: 1
   :widths: 30 35 35
   :class: compact-table

   * - Rotation
     - Quaternion :math:`(w,x,y,z)`
     - Notes
   * - No rotation (identity)
     - :math:`(1,\; 0,\; 0,\; 0)`
     - :math:`w=1`, no axis needed
   * - :math:`90°` about Z (yaw left)
     - :math:`(0.707,\; 0,\; 0,\; 0.707)`
     - Only :math:`z` component
   * - :math:`180°` about Z (face backward)
     - :math:`(0,\; 0,\; 0,\; 1)`
     - :math:`w=0 \Rightarrow` half turn
   * - :math:`90°` about X (roll right)
     - :math:`(0.707,\; 0.707,\; 0,\; 0)`
     - Only :math:`x` component
   * - :math:`45°` about Y (pitch up)
     - :math:`(0.924,\; 0,\; 0.383,\; 0)`
     - Small angle :math:`\Rightarrow w` near 1

.. note::

   Notice the pattern: the vector part :math:`(x,y,z)` points along the
   rotation axis, and :math:`w` closer to 1 means a smaller rotation
   angle. When rotating about a single world axis (X, Y, or Z), only one
   vector component is non-zero. For an arbitrary axis, all three can be
   non-zero simultaneously.

**Axis-Angle to Quaternion Conversion**

Given a unit axis :math:`\mathbf{u} = (u_x, u_y, u_z)` and rotation
angle :math:`\theta` (in radians):

.. math::

   q = \cos\!\left(\frac{\theta}{2}\right)
     + \sin\!\left(\frac{\theta}{2}\right)(u_x\, i + u_y\, j + u_z\, k)

The scalar part is :math:`w = \cos(\theta/2)` and the vector part is:

.. math::

   (x, y, z) = \left(u_x \sin\!\left(\frac{\theta}{2}\right),\;
   u_y \sin\!\left(\frac{\theta}{2}\right),\;
   u_z \sin\!\left(\frac{\theta}{2}\right)\right)

.. note::

   Why :math:`\theta/2` and not :math:`\theta`? This half-angle encoding
   is what makes quaternion multiplication correctly compose rotations.
   It also ensures :math:`q` and :math:`-q` map to the same rotation
   (the double cover property).

**Rotating Vectors with Quaternions**

To rotate a 3D vector :math:`\mathbf{v}` using a unit quaternion
:math:`q`, represent the vector as a "pure" quaternion (with a scalar
part of zero):
:math:`p = 0 + v_x\, i + v_y\, j + v_z\, k`. The rotated vector
:math:`\mathbf{v}'` (as a pure quaternion :math:`p'`) is:

.. math::

   p' = q\,p\,q^{-1}

For a unit quaternion, the inverse is simply its conjugate:
:math:`q^{-1} = w - x\,i - y\,j - z\,k`.

**Composition of Rotations**

Multiple rotations can be combined by multiplying their quaternions.

- :math:`q_1` represents a rotation, :math:`q_2` represents another
  rotation.
- :math:`q_{\text{combined}} = q_2 \otimes q_1` represents the combined
  rotation (:math:`q_1` applied first, then :math:`q_2`).

.. warning::

   Quaternion multiplication is **not commutative**:
   :math:`q_1 \otimes q_2 \neq q_2 \otimes q_1`. Choosing the wrong
   order will produce an incorrect result!

**Quaternion Multiplication Order**

The order of quaternion multiplication depends on the **frame of
reference** for the rotation.

.. list-table::
   :header-rows: 1
   :widths: 20 40 40
   :class: compact-table

   * - Frame
     - Description
     - Formula
   * - World/Fixed
     - Rotate about a global axis
     - :math:`O_{\text{final}} = O_{\text{rot}} \otimes O_{\text{init}}`
   * - Body/Local
     - Rotate about object's own axis
     - :math:`O_{\text{final}} = O_{\text{init}} \otimes O_{\text{rot}}`

**World Frame Rotation**

- Axis is fixed in space.
- "Rotate about the global x-axis."
- Independent of object's current orientation.

**Body Frame Rotation**

- Axis moves with the object.
- "Roll the aircraft" (about its nose-to-tail axis).
- Depends on object's current orientation.

.. note::

   In aviation and robotics, body-frame rotations are more common. A
   pilot's roll, pitch, and yaw commands rotate about the aircraft's own
   axes, not the world axes.

.. admonition:: Exercise: Quaternion from Axis-Angle
   :class: example

   Calculate the rotation, expressed as a quaternion, resulting from a
   rotation of :math:`\alpha = \frac{\pi}{2}` radians about the
   :math:`x`-axis.

   .. only:: html

      .. figure:: /_static/images/L11/exercise_light.png
         :alt: A rotation of pi/2 radians about the x-axis
         :width: 50%
         :align: center
         :class: only-light

         A rotation of :math:`\alpha = \frac{\pi}{2}` radians about the
         :math:`x`-axis.

      .. figure:: /_static/images/L11/exercise_dark.png
         :alt: A rotation of pi/2 radians about the x-axis
         :width: 50%
         :align: center
         :class: only-dark

         A rotation of :math:`\alpha = \frac{\pi}{2}` radians about the
         :math:`x`-axis.

   1. **Identify the axis and angle**

      The rotation is about the :math:`x`-axis, so
      :math:`\mathbf{u} = (1, 0, 0)` and :math:`\alpha = \frac{\pi}{2}`.

   2. **Verify the axis is a unit vector**

      The axis-angle formula requires :math:`\|\mathbf{u}\| = 1`:

      .. math::

         \|\mathbf{u}\| = \sqrt{u_x^2 + u_y^2 + u_z^2} = \sqrt{1^2 + 0^2 + 0^2} = 1 \quad \checkmark

   3. **Apply the axis-angle formula**

      .. math::

         q = \cos\!\left(\frac{\alpha}{2}\right)
           + \sin\!\left(\frac{\alpha}{2}\right)(u_x\, i + u_y\, j + u_z\, k)
           = \cos\!\left(\frac{\pi}{4}\right)
           + \sin\!\left(\frac{\pi}{4}\right)(1 \cdot i + 0 \cdot j + 0 \cdot k)

   4. **Compute each component**

      - :math:`w = \cos\!\left(\frac{\alpha}{2}\right) = \cos\!\left(\frac{\pi}{4}\right) = \frac{\sqrt{2}}{2} \approx 0.707`
      - :math:`x = \sin\!\left(\frac{\alpha}{2}\right) \times u_x = \sin\!\left(\frac{\pi}{4}\right) \times 1 = \frac{\sqrt{2}}{2} \approx 0.707`
      - :math:`y = \sin\!\left(\frac{\alpha}{2}\right) \times u_y = \sin\!\left(\frac{\pi}{4}\right) \times 0 = 0.0`
      - :math:`z = \sin\!\left(\frac{\alpha}{2}\right) \times u_z = \sin\!\left(\frac{\pi}{4}\right) \times 0 = 0.0`

   5. **Resulting quaternion**

      .. math::

         q = (w, x, y, z) = \left(\frac{\sqrt{2}}{2},\, \frac{\sqrt{2}}{2},\, 0,\, 0\right)
           \approx (0.707,\, 0.707,\, 0.0,\, 0.0)

      **Verify**: Does this match the table from earlier? A :math:`90°`
      rotation about :math:`x` should have :math:`w \approx 0.707` and only
      the :math:`x` component non-zero.

   .. note::

      Remember the double cover: :math:`(-0.707, -0.707, 0, 0)` represents
      the *same* rotation.


Mobile Robot Control
====================================================

This course uses two Husarion ROSbot platforms, both supported by the
same `rosbot_ros <https://github.com/husarion/rosbot_ros>`_ ROS 2
driver.

.. tab-set::

    .. tab-item:: ROSbot 3

        .. only:: html

           .. figure:: /_static/images/L11/rosbot1_light.png
              :alt: ROSbot 3
              :width: 30%
              :align: center
              :class: only-light

           .. figure:: /_static/images/L11/rosbot1_dark.png
              :alt: ROSbot 3
              :width: 30%
              :align: center
              :class: only-dark

        - **Drive**: differential (4-wheel)
        - **SBC**: Raspberry Pi 5
        - **Sensors**: RPLiDAR, Luxonis OAK-D, IMU
        - **ROS 2 model**: ``rosbot``

    .. tab-item:: ROSbot XL

        .. only:: html

           .. figure:: /_static/images/L11/rosbot2_light.png
              :alt: ROSbot XL
              :width: 30%
              :align: center
              :class: only-light

           .. figure:: /_static/images/L11/rosbot2_dark.png
              :alt: ROSbot XL
              :width: 30%
              :align: center
              :class: only-dark

        - **Drive**: differential or mecanum (4-wheel)
        - **SBC**: Raspberry Pi 5 / Jetson Orin / NUC
        - **Sensors**: LiDAR, RGB-D camera, IMU
        - **ROS 2 model**: ``rosbot_xl``

.. code-block:: console

   ros2 launch rosbot_gazebo husarion_world.launch.py


Gazebo
----------------------------------------------------

**Gazebo** is an open-source 3D robot simulator. It provides a physics
engine, sensor simulation, and a world environment so that ROS 2 nodes
can be developed and tested without access to physical hardware.

.. note::

   This course uses **Gazebo Harmonic**, the version paired with ROS 2
   Jazzy.

Gazebo simulates the physical world so that ROS 2 nodes behave
identically whether they run against real hardware or a simulation.

- **Physics engine**: models rigid body dynamics, collisions, friction,
  and inertia.
- **Sensor simulation**: generates realistic LiDAR scans, camera images,
  and IMU readings with configurable noise.
- **ROS 2 bridge**: the ``ros_gz_bridge`` package forwards Gazebo topics
  to ROS 2 topics, making the simulation transparent to your nodes.
- **World files**: environments are defined in SDF format and can range
  from an empty plane to a full indoor building.

.. note::

   From your node's perspective, a simulated ROSbot and a real ROSbot
   look identical: the same topics, the same transforms, the same
   message types. The only difference is where the data originates.


RViz2
----------------------------------------------------

**RViz2** is the standard ROS 2 visualization tool. It does not simulate
anything: it subscribes to ROS 2 topics and renders the data in 3D,
giving you a live view of what the robot perceives and where it thinks
it is.

RViz2 is a **passive subscriber**: it reads data from running nodes and
renders it visually. It never publishes commands or modifies the robot's
state.

- Every visual element in RViz2 is called a **display**.
- Each display subscribes to one or more ROS 2 topics and renders the
  data in the 3D viewport.
- Displays can be added, removed, and configured at runtime without
  restarting RViz2.
- The **Fixed Frame** setting determines which coordinate frame the
  scene is rendered relative to. All other frames are drawn relative to
  it.

.. note::

   RViz2 configuration can be saved to a ``.rviz`` file and loaded at
   startup:

   .. code-block:: console

      rviz2 -d path/to/config.rviz

**Key Displays for This Course**

.. list-table::
   :header-rows: 1
   :widths: 20 30 50
   :class: compact-table

   * - Display
     - Topic
     - What it shows
   * - **TF**
     - ``/tf``, ``/tf_static``
     - All coordinate frames as colored axes.
   * - **RobotModel**
     - ``/robot_description``
     - The robot's URDF mesh rendered in 3D.
   * - **Odometry**
     - ``/odometry/filtered``
     - Robot pose history as a trail of arrows.
   * - **LaserScan**
     - ``/scan``
     - LiDAR point cloud as colored dots.
   * - **Image**
     - ``/oak/rgb/color``
     - RGB camera feed in a 2D overlay panel.
   * - **PointCloud2**
     - ``/oak/stereo/depth/points``
     - Stereo depth point cloud in 3D.
   * - **Imu**
     - ``/imu/data``
     - IMU orientation and acceleration arrows.
   * - **Axes**
     - (manual)
     - A single coordinate frame for reference.

.. note::

   Set **Fixed Frame** to ``odom`` when visualizing robot motion, and to
   ``map`` when working with a known map. If RViz2 shows a grey scene
   with no data, the Fixed Frame is likely set to a frame that does not
   yet exist in the TF tree.


Differential Drive Model
----------------------------------------------------

The differential drive model relates wheel velocities to the robot's
linear and angular velocity. Understanding this model is essential
before designing a controller.

**Linear Velocity** (:math:`v`): Rate of change of position along the
robot's heading direction.

.. math::

   v = \frac{ds}{dt} \quad [\text{m/s}]


**Angular Velocity** (:math:`\omega`): Rate of change of orientation
(heading angle).

.. math::

   \omega = \frac{d\theta}{dt} \quad [\text{rad/s}]

.. note::

   **Sign Convention**: :math:`v > 0` moves forward, :math:`v < 0`
   moves backward. :math:`\omega > 0` turns counterclockwise (left),
   :math:`\omega < 0` turns clockwise (right). Combining both produces
   curved trajectories.

**Kinematics**

Let :math:`v_L` and :math:`v_R` be the left and right wheel speeds,
:math:`L` the wheel separation, and :math:`r` the wheel radius.

From wheel speeds to body velocities:

.. math::

   v = \frac{r(v_R + v_L)}{2}, \quad
   \omega = \frac{r(v_R - v_L)}{L}

From body velocities to wheel speeds:

.. math::

   v_R = \frac{2v + \omega L}{2r}, \quad
   v_L = \frac{2v - \omega L}{2r}

- :math:`v`: linear velocity (m/s), sent as ``twist.linear.x``
- :math:`\omega`: angular velocity (rad/s), sent as ``twist.angular.z``
- Only ``linear.x`` and ``angular.z`` are used for ground robots.
- Other components (``linear.y``, ``angular.x``, ``angular.y``) are set
  to zero.

.. note::

   Although both ROSbots have four wheels, the two left wheels are
   driven together and the two right wheels are driven together. The
   kinematics therefore reduce to the standard two-variable differential
   drive model above.

**The** ``/cmd_vel`` **Topic**

Velocity commands are sent to the robot by publishing on ``/cmd_vel``.
In ROS 2 Jazzy the expected message type is
``geometry_msgs/msg/TwistStamped``.

.. code-block:: text

   std_msgs/Header header
       builtin_interfaces/Time stamp
           int32 sec
           uint32 nanosec
       string frame_id
   Twist twist
       Vector3  linear
           float64 x
           float64 y
           float64 z
       Vector3  angular
           float64 x
           float64 y
           float64 z

- For a differential drive robot, only ``twist.linear.x`` and
  ``twist.angular.z`` carry meaningful values.
- The ``header.stamp`` should be set to the current time so controllers
  can reject stale commands.
- The ``header.frame_id`` identifies the frame the twist is expressed in
  (usually ``base_link``).

**Velocity Command Reference**

.. list-table::
   :header-rows: 1
   :widths: 25 25 50
   :class: compact-table

   * - ``linear.x``
     - ``angular.z``
     - Motion
   * - :math:`> 0`
     - :math:`0`
     - Straight forward
   * - :math:`< 0`
     - :math:`0`
     - Straight backward
   * - :math:`0`
     - :math:`> 0`
     - Rotate left (counterclockwise)
   * - :math:`0`
     - :math:`< 0`
     - Rotate right (clockwise)
   * - :math:`> 0`
     - :math:`> 0`
     - Curve forward to the left
   * - :math:`> 0`
     - :math:`< 0`
     - Curve forward to the right
   * - :math:`< 0`
     - :math:`> 0`
     - Curve backward to the right
   * - :math:`< 0`
     - :math:`< 0`
     - Curve backward to the left

.. note::

   All other fields of ``geometry_msgs/Twist`` (``linear.y``,
   ``linear.z``, ``angular.x``, ``angular.y``) are always set to zero
   for a ground differential drive robot.


Odometry
----------------------------------------------------

**Odometry** is the process of estimating a robot's current pose
(position and orientation) by integrating motion data over time. It
answers the question: *where is the robot now, given where it started
and how it has moved?*

As the robot moves, its wheels rotate. Each wheel is equipped with a
**quadrature encoder** that counts the number of pulses per revolution.
By knowing the wheel radius and the number of pulses, the firmware
computes how far each wheel has traveled.

From those distances, the differential drive kinematics equations
compute:

- How far the robot has moved forward (:math:`\Delta s`).
- How much the robot has turned (:math:`\Delta \psi`).

These incremental changes are integrated over time to produce the
robot's estimated pose :math:`(x, y, \psi)` in the ``odom`` frame:

.. math::

   x(t) = x(t-1) + \Delta s \cos\psi, \quad
   y(t) = y(t-1) + \Delta s \sin\psi, \quad
   \psi(t) = \psi(t-1) + \Delta\psi

.. warning::

   Odometry **drifts** over time. Small errors in each step accumulate,
   so the estimated pose diverges from the true pose. This is why the
   ROSbot fuses wheel odometry with IMU data via an Extended Kalman
   Filter to produce ``odometry/filtered``.

**The** ``nav_msgs/msg/Odometry`` **Message**

The ``nav_msgs/msg/Odometry`` message bundles everything a consumer
needs to know about the robot's estimated motion:

- **header.frame_id**: the frame in which the pose is expressed
  (typically ``odom``).
- **child_frame_id**: the frame being described (typically
  ``base_link``).
- **pose.pose**: the estimated position :math:`(x, y, z)` and
  orientation (quaternion).
- **pose.covariance**: a :math:`6 \times 6` covariance matrix encoding
  the uncertainty in the pose estimate.
- **twist.twist**: the robot's estimated linear and angular velocity in
  the ``base_link`` frame.
- **twist.covariance**: uncertainty in the velocity estimate.

.. note::

   The ROSbot publishes two odometry topics: ``odometry/wheels``
   contains the raw wheel-encoder estimate, and ``odometry/filtered``
   contains the EKF-fused estimate that also incorporates IMU data.
   Always use ``odometry/filtered`` for control.

**Odometry in RViz2**

When you add an **Odometry** display in RViz2 and point it at
``odometry/filtered``, you will see a **red arrow** appear at the
robot's estimated position.

.. rubric:: Reading the Arrow

- **Arrow origin**: the robot's estimated :math:`(x, y)` position in
  the ``odom`` frame.
- **Arrow direction**: the robot's estimated heading :math:`\psi` (yaw).
  The arrow points in the direction the robot is facing.
- **Arrow length**: proportional to the robot's current linear speed
  :math:`v`. A stationary robot shows a very short arrow; a fast-moving
  robot shows a long one.
- **Keep last N**: RViz2 can retain the last :math:`N` arrows, leaving a
  trail that shows the robot's path over time.

.. note::

   If the arrow does not appear, check that the **Fixed Frame** is set
   to ``odom`` and that the ``odometry/filtered`` topic is being
   published. You can verify with:

   .. code-block:: console

      ros2 topic echo odometry/filtered --once

.. admonition:: Demonstration
   :class: demonstration

   - Review ``random_mover_demo.py`` from ``robot_control_demo``.
   - Start the environment:

     .. code-block:: console

        ros2 launch rosbot_gazebo empty_world.launch.py

   - Randomly move the robot:

     .. code-block:: console

        ros2 run robot_control_demo random_mover


Proportional Controller
----------------------------------------------------

A **proportional (P) controller** produces a command proportional to the
error between the desired state and the current state. It is the
simplest feedback controller and a natural first step toward more
sophisticated designs.

**The P Controller Law**

.. math::

   u = K_p \cdot e

- :math:`e` is the **error**: the difference between the goal and the
  current state.
- :math:`K_p` is the **proportional gain**: a tuning constant that
  scales the response.
- :math:`u` is the **control output**: the velocity command sent to the
  robot.

.. list-table::
   :header-rows: 1
   :widths: 30 35 35
   :class: compact-table

   * - Error
     - Meaning
     - Command
   * - Distance to goal (:math:`\rho`)
     - How far away is the goal?
     - Linear velocity :math:`v`
   * - Heading error (:math:`\alpha`)
     - How misaligned is the robot?
     - Angular velocity :math:`\omega`

Mobile Robot Application
^^^^^^^^^^^^^^^^^^^^^^^^^

.. only:: html

   .. figure:: /_static/images/L11/proportional_pythagore_light.png
      :alt: Proportional controller geometry
      :width: 50%
      :align: center
      :class: only-light

   .. figure:: /_static/images/L11/proportional_pythagore_dark.png
      :alt: Proportional controller geometry
      :width: 50%
      :align: center
      :class: only-dark

.. card:: Position Error
   :class-card: sd-border-info

   .. math::

      \begin{align*}
      e_x &= x_g - x_r \\
      e_y &= y_g - y_r
      \end{align*}

.. card:: Orientation Error
   :class-card: sd-border-info

   .. math::

      e_\theta = \theta_g - \theta_r

   where :math:`\theta_g = \text{atan2}(e_y, e_x)`.

.. card:: Proportional Control Law
   :class-card: sd-border-info

   .. math::

      v = K_{pv} \cdot \sqrt{e_x^2 + e_y^2}

   .. math::

      \omega = K_{p\omega} \cdot e_\theta


.. note::

   As the robot approaches the goal, errors decrease, velocities
   decrease, and the robot stops at the goal.

.. admonition:: Example: Reaching a Goal 5 m Away
   :class: example

   Reach the goal, which is 5 m from the current position of the robot
   (straight line).

   .. only:: html

      .. figure:: /_static/images/L11/drive_light.png
         :alt: Robot must reach a goal 5 m ahead
         :width: 50%
         :align: center
         :class: only-light

      .. figure:: /_static/images/L11/drive_dark.png
         :alt: Robot must reach a goal 5 m ahead
         :width: 50%
         :align: center
         :class: only-dark

   We will assume a value for the proportional gain, e.g.,
   :math:`K_{pv}=0.5\,\text{s}^{-1}`. The units ensure that when
   multiplied by distance in meters, we get velocity in m/s.


.. admonition:: Step-by-Step Example: Reaching a Goal 5 m Away
   :class: example

   Assume :math:`K_{pv} = 0.5\,\text{s}^{-1}`.

   .. only:: html

      .. figure:: /_static/images/L11/proportional_example_light.png
         :alt: Proportional controller step-by-step example
         :width: 80%
         :align: center
         :class: only-light

      .. figure:: /_static/images/L11/proportional_example_dark.png
         :alt: Proportional controller step-by-step example
         :width: 80%
         :align: center
         :class: only-dark

   - **At** :math:`t_0`: error :math:`= 5` m :math:`\rightarrow`
     :math:`u(t) = 0.5 \times 5 = 2.5` m/s :math:`\rightarrow` distance
     traveled :math:`\approx 2.4` m
   - **At** :math:`t_1`: error :math:`= 2.6` m :math:`\rightarrow`
     :math:`u(t) = 0.5 \times 2.6 = 1.3` m/s :math:`\rightarrow` distance
     traveled :math:`\approx 3.6` m
   - **At** :math:`t_2`: error :math:`= 1.4` m :math:`\rightarrow`
     :math:`u(t) = 0.5 \times 1.4 = 0.7` m/s :math:`\rightarrow` distance
     traveled :math:`\approx 4.3` m
   - ... continues until the error falls below the tolerance.

   As the robot approaches the goal, errors decrease, velocities
   decrease, and the robot stops at the goal. This is the essence of
   proportional control.

**Limitations of a Pure P Controller**

- **Steady-state error**: the robot may stop slightly before the goal if
  the gain is too low, or oscillate around it if the gain is too high.
- **No integral term**: persistent disturbances (e.g., wheel slip,
  uneven terrain) accumulate and are never corrected.
- **No derivative term**: the controller does not anticipate rapid
  changes in error, which can cause overshoot.
- **Heading-only correction**: this design corrects heading and distance
  independently. For curved paths, a PID or pure pursuit controller is
  more appropriate.

.. note::

   A P controller is an excellent starting point for learning
   closed-loop control. PID and model-predictive controllers build
   directly on the same error-signal concept.

**Proportional Control for Angle Reaching**

The same proportional control reasoning applies to reaching a certain
orientation (heading angle).

- Instead of distance error, use angular error:
  :math:`e_\psi = \psi_{\text{goal}} - \psi_{\text{robot}}`
- The control output is angular velocity:
  :math:`\omega = K_{p\omega} \cdot e_\psi`
- As the heading error decreases, the angular velocity decreases, and
  the robot settles at the desired orientation.

.. note::

   In the full go-to-goal controller, both linear and angular
   proportional control work simultaneously: the robot adjusts its
   heading toward the goal while moving forward, resulting in a smooth
   curved trajectory.

.. admonition:: Demonstration
   :class: demonstration

   - Review ``p_controller_demo.py`` from ``robot_control_demo``.
   - Start the environment:

     .. code-block:: console

        ros2 launch rosbot_gazebo empty_world.launch.py

   - Reach a goal:

     .. code-block:: console

        ros2 launch robot_control_demo p_controller.launch.py

   - Observe in RViz2: add an **Odometry** display and watch the robot's
     path.

.. tip::

   **Experiment**: Increase ``k_rho`` to 1.0. What changes? Now decrease
   it to 0.1. What happens near the goal? What value gives the best
   trade-off between speed and precision?


Coordinate Frames
====================================================

A **coordinate frame** (or reference frame) is a named origin with three
orthogonal axes used to express positions and orientations. In robotics,
multiple frames are needed simultaneously: the world has its own frame,
the robot body has its own frame, and each sensor has its own frame.

.. admonition:: Resources
   :class: resources

   - `About TF2
     <https://docs.ros.org/en/jazzy/Concepts/Intermediate/About-Tf2.html>`_
   - `Introduction to TF2
     <https://docs.ros.org/en/jazzy/Tutorials/Intermediate/Tf2/Introduction-To-Tf2.html>`_
   - `REP 105: Coordinate Frames for Mobile Platforms
     <https://wiki.ros.org/REP/0000?action=show&redirect=REP+105>`_


Why Multiple Frames
----------------------------------------------------

- A LiDAR mounted on a robot reports obstacle distances relative to
  itself, not relative to the world.
- A camera reports pixel coordinates that must be related to the robot
  body and then to the world to be useful.
- A goal pose provided to a planner is in the world frame; the robot
  must know its own pose in that frame to compute the required motion.

.. note::

   Without a consistent way to track how all these frames relate to each
   other, a multi-sensor robot cannot function. TF2 provides this
   bookkeeping automatically.


Right-Hand Rule
----------------------------------------------------

ROS 2 uses the right-hand rule for coordinate frames:

- **X** axis: forward
- **Y** axis: left
- **Z** axis: up

.. only:: html

   .. figure:: /_static/images/L11/right-hand-rule-light.png
      :alt: ROS uses the right-hand rule
      :width: 80%
      :align: center
      :class: only-light

      ROS uses the right-hand rule.

   .. figure:: /_static/images/L11/right-hand-rule-dark.png
      :alt: ROS uses the right-hand rule
      :width: 80%
      :align: center
      :class: only-dark

      ROS uses the right-hand rule.

.. note::

   In ROS, the axes are color coded: :math:`x` is red, :math:`y` is
   green, and :math:`z` is blue.

**Why Are Frames Important?**

Imagine telling a robot to pick up an object. You might know the
object's location relative to a camera. However, the robot's arm
controller needs to know where to move its joints relative to its own
base. This is where frames come in.

Frames provide a consistent way to represent and reason about spatial
relationships between different entities in the robot's environment.

- **Sensor Fusion**: Data from different sensors (camera, LiDAR, IMU)
  are expressed in their own sensor-specific frames. To combine and
  interpret this data, we need to transform it into a common frame of
  reference.
- **Navigation**: For a robot to navigate effectively, it needs to know
  its pose (position and orientation) relative to a global map frame.
- **Manipulation**: Precise control of robot arms requires understanding
  the transformations between the robot's base frame, the end-effector
  frame, and the object frame.


Standard Mobile Robot Frames (REP 105)
----------------------------------------------------

.. list-table::
   :header-rows: 1
   :widths: 20 80
   :class: compact-table

   * - Frame
     - Description
   * - ``world``
     - Global inertial frame. Fixed. Used when a long-term, drift-free
       reference is required.
   * - ``map``
     - The robot's best estimate of its global position. May jump when
       the localization estimate is corrected (e.g., loop closure).
   * - ``odom``
     - Locally consistent, drift-prone frame computed from wheel
       odometry or IMU. Never jumps; drifts over time.
   * - ``base_link``
     - Rigidly attached to the robot body. Typically at the geometric
       center of the base, at ground level.
   * - ``base_footprint``
     - Projection of ``base_link`` onto the ground plane. Used when 2D
       reasoning is sufficient.
   * - Sensor frames
     - Attached to each sensor (e.g., ``lidar_link``,
       ``camera_link``). Defined by the URDF.


Transforms
----------------------------------------------------

A **transform** describes how to express the pose of one coordinate
frame relative to another. It answers the question: *if I know a point's
coordinates in frame A, what are its coordinates in frame B?*

Every transform between two frames in 3D space consists of two
components:

- **Translation**: a 3D vector :math:`\mathbf{t} = (t_x, t_y, t_z)`
  describing how far and in which direction the origin of frame :math:`B`
  is displaced from the origin of frame :math:`A`.
- **Rotation**: a description of how frame :math:`B`'s axes are oriented
  relative to frame :math:`A`'s axes (expressed as a quaternion in
  ROS 2).

In ROS 2, a transform between two frames is represented by the
``geometry_msgs/TransformStamped`` message:

.. code-block:: console

   ros2 interface show geometry_msgs/msg/TransformStamped

.. code-block:: text

   # This expresses a transform from coordinate frame header.frame_id
   # to the coordinate frame child_frame_id at the time of header.stamp
   #
   # This message is mostly used by the tf2 package.
   # See its documentation for more information.
   #
   # The child_frame_id is necessary in addition to the frame_id
   # in the Header to communicate the full reference for the transform
   # in a self contained message.

   # The frame id in the header is used as the reference frame of this transform.
   std_msgs/Header header
       builtin_interfaces/Time stamp
           int32 sec
           uint32 nanosec
       string frame_id

   # The frame id of the child frame to which this transform points.
   string child_frame_id

   # Translation and rotation in 3-dimensions of child_frame_id from header.frame_id.
   Transform transform
       Vector3 translation
           float64 x
           float64 y
           float64 z
       Quaternion rotation
           float64 x 0
           float64 y 0
           float64 z 0
           float64 w 1

.. card:: Frames and Transforms Are Inseparable
   :class-card: sd-border-secondary

   A coordinate frame in isolation has no spatial meaning: knowing that a
   frame exists tells you nothing about where it is or how it is oriented
   unless you also know its transform relative to some other frame.

   The only exception is the **root frame**:

   - Every transform tree has exactly one root, typically ``world``.
   - The root frame is defined by convention as the fixed global reference;
     it has no parent and therefore requires no transform.
   - Every other frame in the system must have a transform connecting it to
     its parent, and through that parent, ultimately to ``world``.

.. card:: Transforms Are Directional
   :class-card: sd-border-secondary

   A transform :math:`T^{A}_{B}` expresses frame :math:`B` *relative to*
   frame :math:`A`. Direction matters.

   - :math:`T^{A}_{B}` tells you where :math:`B` is when you are standing
     in :math:`A`.
   - :math:`T^{B}_{A}` is the **inverse transform**: it tells you where
     :math:`A` is when you are standing in :math:`B`.
   - Inverting a transform reverses both the rotation and the translation:
     you cannot simply negate the translation vector.

.. card:: Transforming a Point
   :class-card: sd-border-secondary

   Given a point :math:`\mathbf{p}_B` expressed in frame :math:`B`, the
   same point expressed in frame :math:`A` is:

   .. math::

      \mathbf{p}_A = R^{A}_{B}\, \mathbf{p}_B + \mathbf{t}^{A}_{B}

   where :math:`R^{A}_{B}` is the :math:`3 \times 3` rotation matrix that
   rotates vectors from frame :math:`B` into frame :math:`A`, and
   :math:`\mathbf{t}^{A}_{B}` is the translation vector of the origin of
   :math:`B` expressed in :math:`A`.

.. card:: Chaining Transforms
   :class-card: sd-border-secondary

   Transforms can be **chained**: if you know :math:`T^{A}_{B}` and
   :math:`T^{B}_{C}`, you can compute :math:`T^{A}_{C}` directly using
   :math:`4 \times 4` homogeneous transformation matrices:

   .. math::

      T^{A}_{C} = T^{A}_{B} \times T^{B}_{C}

   The inner indices cancel, which is a useful consistency check: :math:`B`
   appears as a subscript on the left and a superscript on the right, and
   drops out of the result.


TF2
----------------------------------------------------

**TF2** (Transform Library, version 2) is the ROS 2 library responsible
for tracking coordinate frames over time. Rather than requiring every
node to manually compute and share transforms, TF2 provides a shared,
system-wide bookkeeping service that any node can publish to or query
from.

**What TF2 Does**

- **Stores** all transforms broadcast by any node in a time-stamped
  buffer.
- **Answers** queries of the form: "what is the transform from frame
  :math:`A` to frame :math:`B` at time :math:`t`?"
- **Interpolates** between stored transforms to give the best estimate
  at any requested timestamp.
- **Chains** intermediate transforms automatically so callers never need
  to compose them by hand.

.. note::

   TF2 is not a node you run explicitly. It is a library: any node that
   creates a ``TransformBroadcaster`` or ``TransformListener`` is
   participating in the TF2 system automatically.

**The TF2 Transform Tree**

.. only:: html

   .. figure:: /_static/images/L11/tf_tree_light.png
      :alt: TF2 transform tree
      :width: 40%
      :align: center
      :class: only-light

      TF2 transform tree.

   .. figure:: /_static/images/L11/tf_tree_dark.png
      :alt: TF2 transform tree
      :width: 40%
      :align: center
      :class: only-dark

      TF2 transform tree.

- TF2 maintains a **tree** of transforms, where each edge represents
  the transform from a parent frame to a child frame.
- The tree is **directed**: every child has exactly one parent, but a
  parent may have many children.
- To find the transform between any two frames, TF2 traverses the tree
  from one frame up to the common ancestor and back down to the other.

**TF2 CLI Tools**

.. code-block:: console

   ros2 run tf2_tools view_frames

- Saves a PDF of the current transform tree to
  ``frames_<timestamp>.pdf``.
- Useful for verifying that all expected frames are connected.

.. code-block:: console

   ros2 run tf2_ros tf2_echo <source_frame> <target_frame>

- Continuously prints the transform from ``source_frame`` to
  ``target_frame``.

.. code-block:: console

   ros2 topic echo /tf

.. code-block:: console

   ros2 topic echo /tf_static

- Raw transform messages published on these two topics by all
  broadcasters.
- ``/tf`` receives dynamic (time-varying) transforms.
- ``/tf_static`` receives static (time-invariant) transforms, published
  once and latched.

.. code-block:: console

   ros2 run rqt_tf_tree rqt_tf_tree --force-discover

Provides the TF2 tree in a graphical user interface.

.. admonition:: Demonstration
   :class: demonstration

   - Start empty environment:

     .. code-block:: console

        ros2 launch rosbot_gazebo empty_world.launch.py

   - Try each CLI tool.


Static Transforms
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

A **static transform** describes the fixed geometric relationship
between two frames that does not change over time, for example the
offset from ``base_link`` to a rigidly mounted sensor such as a LiDAR
or camera.

.. admonition:: Resources
   :class: resources

   - `Writing a TF2 Static Broadcaster (Python)
     <https://docs.ros.org/en/jazzy/Tutorials/Intermediate/Tf2/Writing-A-Tf2-Static-Broadcaster-Py.html>`_

**CLI: static_transform_publisher**

ROS 2 ships a convenience node for publishing a single static transform
from the command line.

.. code-block:: console

   ros2 run tf2_ros static_transform_publisher --help

.. admonition:: Demonstration
   :class: demonstration

   Publish a transform from ``body_link`` to ``lidar_link``:

   - T1:

     .. code-block:: console

        ros2 launch rosbot_gazebo empty_world.launch.py

   - T2:

     .. code-block:: console

        ros2 run tf2_ros static_transform_publisher --x 0.2 --y 0.0 --z 0.15 \
          --frame-id body_link --child-frame-id lidar_link

   - T3:

     .. code-block:: console

        ros2 run tf2_ros tf2_echo body_link lidar_link

   - T4:

     .. code-block:: console

        ros2 run rqt_tf_tree rqt_tf_tree --force-discover

.. note::

   This is useful for quick debugging. For production, prefer embedding
   the transform in the robot URDF or broadcasting it from a node.

**Scenario: ArUco Marker Detection (Static)**

- **Goal:** detect an ArUco marker in a camera image, recover its 3D
  pose from the 2D corners using the camera intrinsics, and publish that
  pose *once* as a static TF from the camera optical frame to a child
  frame ``static_aruco_box``.
- **Why static?** The marker (and the overhead camera) do not move, so
  one latched transform on ``/tf_static`` is enough (no need to
  re-broadcast every frame).
- **Inputs needed:**

  - the color image (2D pixel corners of the marker),
  - ``CameraInfo`` (intrinsics :math:`K` and distortion :math:`d`),
  - the known physical marker size (meters).

- **Approach:** PnP (see `Appendix B: Perspective-n-Point (PnP)`_).
- **Implementation:** ``static_aruco_detector.py``
- **Environment:**

  .. code-block:: console

     ros2 launch rosbot_gazebo aruco_box_camera_world.launch.py

.. card:: Step 1 --- Subscribe to the camera and prepare the broadcaster
   :class-card: sd-border-secondary

   We need image pixels, intrinsics, and a latched static TF publisher.

   .. code-block:: python

      # Image pixels
      self._image_sub = self.create_subscription(
          Image, self._camera_image_topic,
          self._image_callback, qos_profile_sensor_data,
      )
      # Intrinsics
      self._info_sub = self.create_subscription(
          CameraInfo, self._camera_info_topic,
          self._camera_info_callback, qos_profile_sensor_data,
      )
      # Broadcaster
      self._static_tf_broadcaster = StaticTransformBroadcaster(self)

.. card:: Step 2 --- Cache the intrinsics once
   :class-card: sd-border-secondary

   Distortion and :math:`K` never change, so read them a single time and
   drop the subscription.

   .. code-block:: python

      def _camera_info_callback(self, msg: CameraInfo):
          self._camera_matrix = np.array(msg.k, dtype=np.float64).reshape(3, 3)
          self._dist_coeffs  = np.array(msg.d, dtype=np.float64)
          self.destroy_subscription(self._info_sub)

.. card:: Step 3 --- Describe the marker in its own frame
   :class-card: sd-border-secondary

   Four corners of a square of side ``marker_size``, centered at the
   origin, lying in the :math:`z=0` plane. These are the *object points*
   PnP will try to match against the detected pixels.

   .. code-block:: python

      half = self._marker_size / 2.0
      self._marker_object_points = np.array(
          [[-half,  half, 0.0], [ half,  half, 0.0],
           [ half, -half, 0.0], [-half, -half, 0.0]],
          dtype=np.float32,
      )

.. card:: Step 4 --- Detect the marker in the image
   :class-card: sd-border-secondary

   OpenCV's ``ArucoDetector`` returns the four pixel corners of every
   marker it finds.

   .. code-block:: python

      corners, ids, _ = self._detector.detectMarkers(cv_image)
      if ids is None or len(ids) == 0:
          return
      image_points = corners[0].reshape(-1, 2).astype(np.float32)

.. card:: Step 5 --- Solve PnP: 2D corners + 3D model -> pose
   :class-card: sd-border-secondary

   Given the known square (object points), the detected pixels (image
   points), and the intrinsics, ``solvePnP`` returns ``rvec`` (rotation as
   an axis-angle vector) and ``tvec`` (translation), i.e. the pose of the
   marker in the *camera optical frame*.

   .. code-block:: python

      success, rvec, tvec = cv2.solvePnP(
          self._marker_object_points, image_points,
          self._camera_matrix, self._dist_coeffs,
          flags=cv2.SOLVEPNP_IPPE_SQUARE,
      )

.. card:: Step 6 --- Convert rvec to a quaternion
   :class-card: sd-border-secondary

   TF expects a quaternion, not an axis-angle vector.

   .. code-block:: python

      qx, qy, qz, qw = Rotation.from_rotvec(rvec.flatten()).as_quat()

.. card:: Step 7 --- Publish once as a static TF
   :class-card: sd-border-secondary

   Parent is the camera optical frame (taken from the image header); child
   is ``static_aruco_box``. Translation is ``tvec``, rotation is the
   quaternion from ``rvec``.

   .. code-block:: python

      t = TransformStamped()
      t.header.stamp     = self.get_clock().now().to_msg()
      t.header.frame_id  = image_header.frame_id  # camera optical frame
      t.child_frame_id   = self._child_frame_id   # "static_aruco_box"
      t.transform.translation.x = float(tvec[0])
      t.transform.translation.y = float(tvec[1])
      t.transform.translation.z = float(tvec[2])
      t.transform.rotation.x = float(qx)
      t.transform.rotation.y = float(qy)
      t.transform.rotation.z = float(qz)
      t.transform.rotation.w = float(qw)
      self._static_tf_broadcaster.sendTransform(t)

.. card:: Step 8 --- One-shot behavior
   :class-card: sd-border-secondary

   After a successful publish, flip a flag and drop the image subscription
   so no more frames are processed --- the static TF stays latched on
   ``/tf_static``.

   .. code-block:: python

      self._tf_published = True
      self.destroy_subscription(self._image_sub)

.. admonition:: Demonstration
   :class: demonstration

   - T1:

     .. code-block:: console

        ros2 launch rosbot_gazebo aruco_box_camera_world.launch.py

   - T2:

     .. code-block:: console

        ros2 run rqt_tf_tree rqt_tf_tree --force-discover

   - T3:

     .. code-block:: console

        ros2 launch frame_demo static_aruco_detector.launch.py

   - T2: Refresh ``rqt_tf_tree``.


Dynamic Transforms
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

A **dynamic transform** describes a relationship between frames that
changes over time, most commonly the pose of the robot in the world
(``odom`` -> ``base_link``), which updates as the robot moves.

.. admonition:: Resources
   :class: resources

   - `Writing a TF2 Broadcaster (Python)
     <https://docs.ros.org/en/jazzy/Tutorials/Intermediate/Tf2/Writing-A-Tf2-Broadcaster-Py.html>`_
   - `Writing a TF2 Listener (Python)
     <https://docs.ros.org/en/jazzy/Tutorials/Intermediate/Tf2/Writing-A-Tf2-Listener-Py.html>`_

**Scenario: ArUco Marker Detection (Dynamic)**

- **Goal:** detect *every* ArUco marker visible in a live camera stream,
  recover each marker's 3D pose from the 2D corners using the camera
  intrinsics, and broadcast one TF per marker on *every frame* so the
  transforms follow the markers as they (or the camera) move.
- **Why dynamic?** The robot's camera can move relative to the markers,
  so the transform must be re-published continuously on ``/tf`` (not
  latched on ``/tf_static``).
- **Per-marker frames:** child frames are named
  ``aruco_marker_<id>`` so different markers produce distinct TF frames
  automatically.
- **Inputs needed:**

  - the color image stream (2D pixel corners, every frame),
  - ``CameraInfo`` (intrinsics :math:`K` and distortion :math:`d`,
    cached once),
  - the known physical marker size (meters).

- **Approach:** PnP (see `Appendix B: Perspective-n-Point (PnP)`_),
  applied per marker per frame.
- **Implementation:** ``dynamic_detector_demo.py``
- **Environment:**

  .. code-block:: console

     ros2 launch rosbot_gazebo aruco_box_world.launch.py

.. card:: Step 1 --- Subscribe to the camera, set up the broadcaster and debug publisher
   :class-card: sd-border-secondary

   We need image pixels, intrinsics, a *dynamic* TF broadcaster, and an
   annotated debug image for ``rqt_image_view``.

   .. code-block:: python

      # Image pixels + intrinsics
      self._image_sub = self.create_subscription(...)
      self._info_sub = self.create_subscription(...)
      # Annotated debug image (marker outlines + pose axes)
      self._debug_image_pub = self.create_publisher(
          Image, "aruco_detection_image", 10
      )
      # Dynamic TF broadcaster (publishes on /tf, not /tf_static)
      self._tf_broadcaster = TransformBroadcaster(self)

.. card:: Step 2 --- Cache the intrinsics once
   :class-card: sd-border-secondary

   Same as the static case: read :math:`K` and :math:`d` a single time,
   then drop the subscription.

   .. code-block:: python

      def _camera_info_callback(self, msg: CameraInfo):
          self._camera_matrix = np.array(msg.k, dtype=np.float64).reshape(3, 3)
          self._dist_coeffs  = np.array(msg.d, dtype=np.float64)
          self.destroy_subscription(self._info_sub)

.. card:: Step 3 --- Describe a marker in its own frame
   :class-card: sd-border-secondary

   Same object-point template as the static case: a square of side
   ``marker_size``, centered at the origin, in the :math:`z=0` plane.
   Corner order matches ``detectMarkers``: top-left, top-right,
   bottom-right, bottom-left.

   .. code-block:: python

      half = self._marker_size / 2.0
      self._marker_object_points = np.array(
          [[-half,  half, 0.0], [ half,  half, 0.0],
           [ half, -half, 0.0], [-half, -half, 0.0]],
          dtype=np.float32,
      )

.. card:: Step 4 --- Detect all markers in the current frame
   :class-card: sd-border-secondary

   ``detectMarkers`` returns *all* markers it sees; we will iterate over
   them.

   .. code-block:: python

      corners, ids, _ = self._detector.detectMarkers(cv_image)
      if ids is None or len(ids) == 0:
          return  # nothing to do for this frame

.. card:: Step 5 --- Solve PnP for each detected marker
   :class-card: sd-border-secondary

   Loop over every detection; each ``solvePnP`` call gives the pose of that
   marker in the camera optical frame.

   .. code-block:: python

      for i, marker_id in enumerate(ids.flatten()):
          image_points = corners[i].reshape(-1, 2).astype(np.float32)
          success, rvec, tvec = cv2.solvePnP(
              self._marker_object_points, image_points,
              self._camera_matrix, self._dist_coeffs,
              flags=cv2.SOLVEPNP_IPPE_SQUARE,
          )
          if not success:
              continue
          self._broadcast_marker_tf(msg.header, int(marker_id), rvec, tvec)

.. card:: Step 6 --- Convert rvec to a quaternion
   :class-card: sd-border-secondary

   TF expects a quaternion, not an axis-angle vector.

   .. code-block:: python

      qx, qy, qz, qw = Rotation.from_rotvec(rvec.flatten()).as_quat()

.. card:: Step 7 --- Broadcast one TF per marker, every frame
   :class-card: sd-border-secondary

   Parent frame comes from the image header (camera optical frame); child
   is ``aruco_marker_<id>``. The timestamp is copied from the image header
   so downstream TF consumers can interpolate correctly.

   .. code-block:: python

      t = TransformStamped()
      t.header.stamp     = image_header.stamp         # image time, not now()
      t.header.frame_id  = image_header.frame_id      # camera optical frame
      t.child_frame_id   = f"aruco_marker_{marker_id}"
      t.transform.translation.x = float(tvec[0])
      t.transform.translation.y = float(tvec[1])
      t.transform.translation.z = float(tvec[2])
      t.transform.rotation.x = float(qx)
      t.transform.rotation.y = float(qy)
      t.transform.rotation.z = float(qz)
      t.transform.rotation.w = float(qw)
      self._tf_broadcaster.sendTransform(t)

   .. note::

      Unlike the static case, there is no one-shot flag and no
      ``destroy_subscription``: the callback keeps running and publishes a
      fresh TF for every image on ``/tf``.

.. card:: Step 8 --- Publish an annotated debug image
   :class-card: sd-border-secondary

   Draw a 3D axis triad on each detected marker and outline the detections,
   then republish the image so you can verify detections live in
   ``rqt_image_view``.

   .. code-block:: python

      cv2.drawFrameAxes(
          cv_image, self._camera_matrix, self._dist_coeffs,
          rvec, tvec, self._marker_size * 0.5,
      )
      cv2.aruco.drawDetectedMarkers(cv_image, corners, ids)

      debug_msg = self._bridge.cv2_to_imgmsg(cv_image, encoding="bgr8")
      debug_msg.header = msg.header
      self._debug_image_pub.publish(debug_msg)

.. admonition:: Demonstration
   :class: demonstration

   - T1:

     .. code-block:: console

        ros2 launch rosbot_gazebo aruco_box_world.launch.py

   - T2:

     .. code-block:: console

        ros2 launch frame_demo dynamic_detector_demo.launch.py

   - T3:

     .. code-block:: console

        ros2 run rqt_image_view rqt_image_view /aruco_detection_image

   - T4:

     .. code-block:: console

        ros2 run rqt_tf_tree rqt_tf_tree --force-discover


Static vs. Dynamic Comparison
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

.. list-table::
   :header-rows: 1
   :widths: 25 40 35
   :class: compact-table

   * -
     - **StaticTransformBroadcaster**
     - **TransformBroadcaster**
   * - Topic
     - ``/tf_static``
     - ``/tf``
   * - Published
     - Once (latched)
     - Repeatedly in a loop
   * - Use case
     - Fixed sensor offsets, URDF links
     - Robot pose, PTZ camera
   * - Time-varying?
     - No
     - Yes

.. warning::

   Do not use ``StaticTransformBroadcaster`` for transforms that change
   over time. TF2 assumes static transforms are permanent and will not
   expire them.


Listening for Marker Frames
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

A **TF listener** queries the transform tree for the pose of one frame
expressed in another. In this demo we listen for the
``aruco_marker_<id>`` frames broadcast by the dynamic detector and
report each marker's position in the ``odom`` frame.

**Scenario**

- **Goal:** for every ``aruco_marker_<id>`` frame published by
  ``dynamic_detector_demo.py``, look up its pose in the ``odom`` frame
  and log :math:`(x, y, z)`.
- **Why** ``odom`` **?** The detector publishes each marker relative to
  the camera's optical frame. Chaining through the TF tree to ``odom``
  gives a *world-fixed* position that is independent of the robot's
  motion.
- **Challenge:** the listener does not know in advance which marker IDs
  will be detected. It must *auto-discover* new marker frames as they
  appear in the TF tree.
- **Implementation:** ``aruco_marker_listener.py``
- **Environment:**

  .. code-block:: console

     ros2 launch frame_demo dynamic_detector_listener.launch.py

.. card:: Step 1 --- Set up the Buffer and TransformListener
   :class-card: sd-border-secondary

   The ``Buffer`` stores incoming transforms (default: last 10 s); the
   ``TransformListener`` subscribes to ``/tf`` and ``/tf_static`` and
   populates the buffer automatically. A timer drives the periodic lookup.

   .. code-block:: python

      from tf2_ros import Buffer, TransformListener, TransformException
      from rclpy.duration import Duration
      import rclpy, yaml

      class ArucoMarkerListener(Node):
          def __init__(self):
              super().__init__("aruco_marker_listener")
              self._target_frame  = "odom"
              self._marker_prefix = "aruco_marker_"
              self._tf_buffer   = Buffer()
              self._tf_listener = TransformListener(self._tf_buffer, self)
              self._timer = self.create_timer(1.0, self._timer_callback)

   .. note::

      Always create the ``Buffer`` **before** the ``TransformListener``:
      the listener needs a buffer to write into.

.. card:: Step 2 --- Auto-discover marker frames in the TF buffer
   :class-card: sd-border-secondary

   ``Buffer.all_frames_as_yaml()`` returns a YAML snapshot of every frame
   currently known to TF. Filter by the prefix to get the markers.

   .. code-block:: python

      def _discover_marker_frames(self):
          frames = yaml.safe_load(self._tf_buffer.all_frames_as_yaml()) or {}
          return sorted(
              name for name in frames
              if name.startswith(self._marker_prefix)
          )

   **Why auto-discovery?** The detector creates a new child frame each time
   it sees a new marker ID. Hard-coding IDs would not scale; walking the
   buffer picks up new markers as soon as they appear.

.. card:: Step 3 --- Look up each marker in the target frame
   :class-card: sd-border-secondary

   ``lookup_transform(target, source, time)`` returns the transform
   expressing ``source`` in ``target``. The ``timeout`` lets TF wait
   briefly for data that may still be in flight.

   .. code-block:: python

      def _timer_callback(self):
          for frame in self._discover_marker_frames():
              try:
                  t = self._tf_buffer.lookup_transform(
                      self._target_frame,        # "odom"
                      frame,                     # "aruco_marker_<id>"
                      rclpy.time.Time(),         # latest available
                      timeout=Duration(seconds=0.1),
                  )
              except TransformException as e:
                  self.get_logger().warn(
                      f"{frame} -> {self._target_frame}: {e}")
                  continue
              p = t.transform.translation
              self.get_logger().info(
                  f"{frame}: x={p.x:.3f}, y={p.y:.3f}, z={p.z:.3f}")

.. admonition:: Demonstration
   :class: demonstration

   One launch file starts the simulation, the dynamic detector, and the
   listener together:

   - T1:

     .. code-block:: console

        ros2 launch rosbot_gazebo aruco_markers_world.launch.py

   - T2:

     .. code-block:: console

        ros2 launch frame_demo dynamic_detector_listener.launch.py

   - T3:

     .. code-block:: console

        ros2 run rqt_tf_tree rqt_tf_tree --force-discover

   Expected output on T2 once markers are in view:

   .. code-block:: console

      [aruco_detector-1] [INFO] [1776111488.195991627] [aruco_detector]: Detected markers: [2, 4, 5]

      [aruco_marker_listener]: aruco_marker_2 in odom: x=1.984, y=-0.500, z=0.251
      [aruco_marker_listener]: aruco_marker_4 in odom: x=1.967, y=-0.003, z=0.251
      [aruco_marker_listener]: aruco_marker_5 in odom: x=1.977, y=0.491, z=0.250


Behind the Scenes
----------------------------------------------------

Understanding the roles of TF broadcasters and listeners, and the math
behind transform lookups, will help you debug common issues such as
missing frames, stale transforms, and incorrect chaining.

**TF Broadcasters**

Nodes that publish transform information are called **transform
broadcasters** (TF broadcasters). They take information about the
relative pose of two frames and broadcast it over ROS topics. Common
broadcasters include:

- **Robot state publishers** publishing joint angles and deriving link
  poses.
- **Sensor drivers** publishing the pose of the sensor relative to the
  robot.
- **SLAM algorithms** publishing the robot's pose relative to a map.
- **Static transform publishers** for fixed relationships between
  frames.

**TF Listeners**

Nodes that need to know the transform between two frames are called
**transform listeners** (TF listeners). They subscribe to the transform
information being broadcast and use the TF2 API to query for specific
transformations at specific times.

**How Is the Frame Tree Built and Maintained?**

The transform tree in ROS is built and maintained by combining the
information from both the ``/tf`` and ``/tf_static`` topics.

1. **tf2 Buffer**: The TF2 library maintains an internal buffer of
   transform information. This buffer stores the transformations between
   different coordinate frames along with their timestamps.
2. **Data Sources**: The TF2 buffer populates itself by subscribing to
   both ``/tf`` and ``/tf_static``.
3. **Building the Tree**: As TF2 receives ``TransformStamped`` messages
   from both topics, it uses this information to build and update the
   internal representation of the transform tree. Each
   ``TransformStamped`` message defines a directed edge in the tree,
   representing the transformation from the child frame to the parent
   frame at a specific time.

**Transform Lookup: The Math**

When you call
``tf_buffer.lookup_transform("odom", "marker", time)``, the listener:

1. **Traverses the TF tree** to find a path between frames.
2. **Chains transforms** via matrix multiplication.
3. **Returns** the composed transform.

.. admonition:: Example: Marker Pose in Odom Frame
   :class: example

   .. math::

      \text{odom} \xrightarrow{T_1} \text{base_link} \xrightarrow{T_2} \text{camera_optical} \xrightarrow{T_3} \text{marker}

   The listener computes:
   :math:`T_{\text{odom} \rightarrow \text{marker}} = T_1 \times T_2 \times T_3`

   Where each :math:`T` is a :math:`4 \times 4` homogeneous transformation
   matrix:

   .. math::

      T = \begin{bmatrix} R_{3\times3} & t_{3\times1} \\ 0_{1\times3} & 1 \end{bmatrix}
        = \begin{bmatrix} r_{00} & r_{01} & r_{02} & t_x \\ r_{10} & r_{11} & r_{12} & t_y \\ r_{20} & r_{21} & r_{22} & t_z \\ 0 & 0 & 0 & 1 \end{bmatrix}


KDL Frames
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

The **KDL (Kinematics and Dynamics Library)** lets you chain coordinate
transforms *locally*, in Python, instead of relying on TF2 to compose
them implicitly. Useful when you already have the individual hops in
hand and want to control the math yourself.

**Scenario**

Reuse the overhead-camera world (one box, one marker). After the static
broadcaster runs, TF contains two hops relevant to the marker:

.. math::

   \underbrace{\text{odom}}_{\text{target}} \;\xrightarrow{T_1}\;
   \underbrace{\text{overhead_camera_optical_frame}}_{\text{middle}} \;\xrightarrow{T_2}\;
   \underbrace{\text{static_aruco_box}}_{\text{source}}

- **Goal:** look up :math:`T_1` and :math:`T_2` *separately* and
  compose them in Python with PyKDL to recover
  :math:`T_{\text{odom} \rightarrow \text{box}} = T_1 \cdot T_2`.
- **Why do this by hand?** The math is what
  ``lookup_transform(odom, static_aruco_box)`` does internally. Doing it
  yourself makes chaining explicit and lets you combine TF data with
  poses computed locally (e.g., a detector result that has not been
  broadcast yet).
- **Implementation:** ``kdl_chain_demo.py``
- **Environment:**

  .. code-block:: console

     ros2 launch frame_demo kdl_chain_demo.launch.py

**What is a KDL Frame?**

A ``PyKDL.Frame`` is a :math:`4 \times 4` homogeneous transform: a
rotation (``PyKDL.Rotation``) plus a translation (``PyKDL.Vector``).

.. math::

   T = \begin{bmatrix} R_{3\times3} & t_{3\times1} \\ 0_{1\times3} & 1 \end{bmatrix}

**Three-step workflow**

1. **Convert** each input pose/transform into a ``PyKDL.Frame``.
2. **Multiply** frames to chain transforms:

   .. math::

      T_{A \rightarrow C} = T_{A \rightarrow B} \cdot T_{B \rightarrow C}

3. **Convert** the resulting ``PyKDL.Frame`` back to a
   ``geometry_msgs/Pose`` (or publish as a TF).

.. card:: Step 1 --- Helpers: ROS <-> KDL conversions
   :class-card: sd-border-secondary

   TF returns a ``TransformStamped``; PyKDL wants a ``Frame``. One function
   each way.

   .. code-block:: python

      import PyKDL
      from geometry_msgs.msg import Pose, TransformStamped

      def transform_to_kdl(t: TransformStamped) -> PyKDL.Frame:
          p, q = t.transform.translation, t.transform.rotation
          return PyKDL.Frame(
              PyKDL.Rotation.Quaternion(q.x, q.y, q.z, q.w),
              PyKDL.Vector(p.x, p.y, p.z),
          )

      def kdl_to_pose(frame: PyKDL.Frame) -> Pose:
          pose = Pose()
          pose.position.x, pose.position.y, pose.position.z = (
              frame.p.x(), frame.p.y(), frame.p.z()
          )
          qx, qy, qz, qw = frame.M.GetQuaternion()
          pose.orientation.x, pose.orientation.y = qx, qy
          pose.orientation.z, pose.orientation.w = qz, qw
          return pose

.. card:: Step 2 --- Look up each hop separately
   :class-card: sd-border-secondary

   We do *not* ask TF to chain for us. We ask for each edge of the tree
   individually.

   .. code-block:: python

      t_target_middle = self._tf_buffer.lookup_transform(
          self._target,  self._middle,  rclpy.time.Time(),
          timeout=Duration(seconds=0.1),
      )   # odom -> overhead_camera_optical_frame
      t_middle_source = self._tf_buffer.lookup_transform(
          self._middle,  self._source,  rclpy.time.Time(),
          timeout=Duration(seconds=0.1),
      )   # overhead_camera_optical_frame -> static_aruco_box

.. card:: Step 3 --- Compose with KDL, then convert back
   :class-card: sd-border-secondary

   ``*`` is overloaded: multiplying two ``PyKDL.Frame`` objects returns
   their composition.

   .. code-block:: python

      kdl_target_middle = transform_to_kdl(t_target_middle)
      kdl_middle_source = transform_to_kdl(t_middle_source)

      kdl_target_source = kdl_target_middle * kdl_middle_source   # chain!

      pose_in_target = kdl_to_pose(kdl_target_source)

**KDL vs. TF2**

**Use KDL when:**

- You want explicit control over the math.
- Some inputs are poses computed in your node, not yet on ``/tf``.
- Combining a TF lookup with a sensor measurement for a one-off
  calculation.

**Use TF2 when:**

- Multiple nodes need the same chained transform.
- You want interpolation / time-synchronized lookups.
- You want RViz to visualize the chain.

.. note::

   KDL does not replace TF2 --- on a moving robot you still need TF to
   get the current ``odom`` -> ``camera`` transform. KDL just owns the
   multiplication step, which is otherwise hidden inside
   ``lookup_transform``.

.. admonition:: Demonstration
   :class: demonstration

   - T1:

     .. code-block:: console

        ros2 launch rosbot_gazebo aruco_box_camera_world.launch.py

   - T2:

     .. code-block:: console

        ros2 launch frame_demo kdl_chain_demo.launch.py

   - T3:

     .. code-block:: console

        ros2 run tf2_ros tf2_echo odom static_aruco_box

   T2 should print the KDL-composed pose; T3 should print the direct TF
   lookup --- the two values should match.

   .. code-block:: console

      [kdl_chain_demo]: KDL-composed static_aruco_box in odom:
          pos=(2.001, 2.001, 0.481)
          quat=(0.707, -0.707, 0.000, 0.000)


Appendix A: Gimbal Lock
====================================================

**The Gimbal Lock Problem: Mathematical Proof**

Using the Tait-Bryan ZYX convention, the full rotation matrix is
:math:`R = R_z(\psi)\,R_y(\theta)\,R_x(\phi)`:

.. math::

   R = \begin{bmatrix}
   c\psi\, c\theta & c\psi\, s\theta\, s\phi - s\psi\, c\phi & c\psi\, s\theta\, c\phi + s\psi\, s\phi \\
   s\psi\, c\theta & s\psi\, s\theta\, s\phi + c\psi\, c\phi & s\psi\, s\theta\, c\phi - c\psi\, s\phi \\
   -s\theta        & c\theta\, s\phi                          & c\theta\, c\phi
   \end{bmatrix}

Now set :math:`\theta = \pi/2`, so :math:`\cos\theta = 0` and
:math:`\sin\theta = 1`. Applying the angle subtraction identities:

.. math::

   R\big|_{\theta=\pi/2} = \begin{bmatrix}
   0  & -\sin(\psi-\phi) & \cos(\psi-\phi) \\
   0  &  \cos(\psi-\phi) & \sin(\psi-\phi) \\
   -1 & 0                & 0
   \end{bmatrix}

.. warning::

   The matrix now depends only on the **difference**
   :math:`(\psi - \phi)`, not on :math:`\psi` and :math:`\phi`
   individually. Two parameters have collapsed into one: one degree of
   freedom is lost. For example, :math:`(\psi=90°, \phi=30°)` and
   :math:`(\psi=120°, \phi=60°)` both give :math:`\psi-\phi=60°` and
   produce *identical* rotation matrices.


Appendix B: Perspective-n-Point (PnP)
====================================================

**What is PnP?**

**PnP = Perspective-n-Point.** Given :math:`n` known 3D points in an
object's frame and their matching 2D pixel projections, plus the camera
intrinsics (:math:`K`, distortion :math:`d`), recover the 6-DoF pose
:math:`(R, t)` of the object in the camera frame.

The projection model is:

.. math::

   s\,\begin{bmatrix} u \\ v \\ 1 \end{bmatrix}
   \;=\; K\,\bigl[\,R \;\mid\; t\,\bigr]\,
   \begin{bmatrix} X \\ Y \\ Z \\ 1 \end{bmatrix},

where :math:`(X,Y,Z)` is a 3D point in the object frame, :math:`(u,v)`
is its pixel, and :math:`s` is a scalar depth. PnP solves for :math:`R`
and :math:`t` so the projected points match the observed pixels.

**In the ArUco case** (:math:`n=4`):

- **3D points**: the four marker corners in the marker's own frame
  (known from ``marker_size``).
- **2D points**: the four pixel corners from ``detectMarkers``.
- **Intrinsics**: from ``CameraInfo``.
- **Output**: ``rvec``, ``tvec`` --- pose of the marker in the camera
  optical frame.