Infinity and beyond!: The Linear Quadratic Regulator

Optimal Control and Linear-Quadratic-Regulator (LQR)

Today, I will not write an introductory passage to write off my blog. Because, writing an introduction to Optimal Control in itself will required a blog. However, I will add in small tidbits as and when needed.
To understand the topic, we need some basic definitions with us.

1.
A control system can be represented in terms of State Space, as follows,
$\boxed{\dot{x}(t) = A(t)x(t)+B(t)u(t) \\ y(t)=C(t)x(t)+D(t)u(t)}$
In the above formulation,
$x(.)$ is the state vector; $x(t) \in \mathcal{R}^{n}$ .
$y(.)$ is the output vector; $y(t) \in \mathcal{R}^{q}$ .
$u(.)$ is the input vector; $u(t) \in \mathcal{R}^{p}$ .
$A(.)$ is the System Matrix; $\mathrm{dim}[A(.)]=n\times n$ .
$B(.)$ is the Input Matrix; $\mathrm{dim}[B(.)]=n\times p$ .
$C(.)$ is the Output Matrix; $\mathrm{dim}[C(.)]=q\times n$ .
$A(.)$ is the Feed-forward Matrix; $\mathrm{dim}[D(.)]=q\times p$ .
Now, for a system to be controllable, we first define a matrix $\mathcal{Q_c}$ , called the controllability matrix, such that,
$\boxed{\mathcal{Q}_c=\begin{bmatrix} B & AB & A^{2}B & \ldots & A^{n-1}B \\ \end{bmatrix}}$
The system is controllable if $\mathcal{Q}_c$ has full row rank (i.e. rank( $\mathcal{Q}_c$ ) $=n$ ).

We will assume that we deal with Controllable systems only.

Usually, a single input system’s state feedback controller is designed using the Eigen-value method, or Pole Placement method.

2.
Pole placement method is the methodology of finding the control vector $\mathcal{U}$ in the form $-\mathcal{k}x$
So, the state space representation changes as,
$\dot{x}=(A-Bk)x$
$k$ is found as,
$|sI-(A-Bk)|=(s-\mu_1)(s-\mu_2)...(s-\mu_n)$
Here, $\mu$ are the desired pole locations. Note that $k$ is defined as $k=\begin{bmatrix} k_1 & k_2 & k_3 & \ldots & k_n\ \end{bmatrix}$

However, for a multi-input system the feedback gain i.e. $k$ is not unique.
Linear Quadratic Control strategy is used to deal with this issue.

Now, we dive into the Linear Quadratic Regulator (LQR) formulation, for an $m$ -input and $n$ -state system with $x \in \mathcal{R}^n$ , $u \in \mathcal{R}^m$ . Consider a system,
$\dot{x}=A(t)x(t)+B(t)u(t) \text{ provided }x(0)=x_0 \tag*{(1)}$
Our aim is to find an open loop control $u(\tau)$ , for $\tau \in [t_0, t_f]$ such that we minimize:
$\boxed{J(u, x_0, t_0, t_f) = \int_{t_0}^{t_f}[x^{T}(t)Q(t)x(t)+u^{T}(t)R(t)u(t)]dt+x(t_f)^{T}Sx(t_f) } \tag*{(2)}$
where $Q(t)$ and $S$ are symmetric positive semi-definite $n \times n$ matrices.
$R(t)$ is a symmetric positive definite $m \times m$ matrix. Note that $x_0$ , $t_0$ and $t_f$ are fixed and given data.
The controller aim is to basically keep $x(t)$ close to 0 especially at $t_f$ , which is the final time.
In $(2)$ ,

$x^T(t)Q(t)x(t)$ works against the transient response.
$x^T(t_f)Sx(t_f)$ works against the finite state.
$u^T(t)R(t)u(t)$ works against the control effort.

The above formulation can regulate the output $y(t) = C(t)x(t)$ near $0$ .
Note that, we can define, $S$ and $Q(t)$ as $C^T(t)W(t)C(t)$ where, $W(t) \in R^{r \times r}$
We can now have a theorem as follows,

For a system with fixed initial and final conditions, $\dot{x}=f(x,u,t)$ ; and clearly $x(t_0)=x_0$ . We define our time horizon as $[t_0,t_f]$ such that $t \in [t_0,t_f]$ . We find $u(t)$ such that our cost function, $J$ is minimized. $J$ is defined as,
$J(u(.),x_0)= \phi(x(t_f))+\int_{t_0}^{t_f}L(x(t), u(t), t)dt$
Here, the first term of $J$ is the final cost and the second term is the recurring cost.

Now, we will formulate some important functions that will convert the $J$ which is a constrained optimal control problem to a unconstrained optimal control problem. [THIS MAY NOT MAKE SENSE TO YOU, WHICH IS NATURAL. HOLD ON].
$\boxed{\dot{\lambda}=-H_x=-\frac{\partial{L}}{\partial{x}}-\lambda^T\frac{\partial{f}}{\partial{x}}} \tag*{(3)}$
Note that, $\lambda(t)$ ( $\in R^n$ ) is called the Lagrangian.
$H$ is the Hamiltonian operator. Defined in terms of $L$ and $\lambda$ as in $(3)$ . Or it can be defined as,
$\boxed{H(x,u,t) := L(x,u,t)+\lambda^T(t)f(x,u,t)} \tag*{(4)}$
The above definition is in terms of $L$ as defined in the theorem. So, we define $\lambda$ in the same lines. Just for convenience of computation.
$\boxed{\lambda^T(t_f)=\frac{\partial \phi}{\partial x}(x(t_f))} \tag*{(5)}$
$(5)$ can be written as $\lambda^T(t_f)=\phi_x(x(t_f))$
Equation $(3)$ , $(4)$ and $(5)$ together form a set of $2n$ differential equations (in $x$ and $\lambda$ , obviously) with split boundary conditions at $t_0$ and $t_f$ . Now, we can easily define $u(t)$ in terms of $x$ or/and $\lambda$ .
As mentioned earlier, the solution is found by converting $J$ from a constrained optimal problem to a constrained optimal problem using a Lagrange multiplier function $\lambda(t)$ :
$\boxed{\bar{J}(u,x_0)=J(u(.),x_0)+\int_{t_0}^{t_f} \lambda^T [f(x,u,t)-\dot{x}]dt} \tag*{(6)}$
Notice that,
$\frac{d}{dt}(\lambda^T(t)\dot{x}(t))=\dot{\lambda}^T(t)x(t)+\lambda^T(t)\dot{x} \tag*{(7)}$
Therefore,
$\int_{t_0}^{t_f}\lambda^T\dot{x}\ dt=\lambda^T(t_f)\dot{x}(t_f)-\lambda^T(t_0)\dot{x}(t_0)-\int_{t_0}^{t_f}\dot{\lambda}^Tx\ dt \tag*{(8)}$
As the Hamiltonian Function is defined in $(4)$ , thus,
$\boxed{\bar{J}=\phi(x(t_f))-\lambda^T(t_f)x(t_f) + \lambda^T(t_0)x(t_0)+\int_{t_0}^{t_f}[H(x(t),u(t),t)+\dot{\lambda}(t)x(t)] \ dt} \tag*{(9)}$
The necessary condition for an optimal solution is $\delta \bar{J}$ of the modified cost with respect to all variations of the system be minimal at all times from $t_0$ to $t_f$ .
We will define $\delta \bar{J}$ analytically in the next post and formulate the Riccati Equation that will lay the foundation to some amazing control strategies.
Cheers!

Infinity and beyond!

Friday, 17 November 2017

The Linear Quadratic Regulator

Optimal Control and Linear-Quadratic-Regulator (LQR)

No comments:

Post a Comment