The scalars xt,yt,ut are related by the equations
xt=xt−1+ut−1,yt=xt−1+ηt−1,t=1,2,…,T,
where the initial state x0 is normally distributed with mean x^0 and variance 1 and {ηt} is a sequence of independent random variables each normally distributed with mean 0 and variance 1 . The control variable ut is to be chosen at time t on the basis of information Wt, where W0=(x^0) and
Wt=(x^0,u0,…,ut−1,y1,…,yt),t=1,2,…,T
(a) Let x^1,x^2,…,x^T be the Kalman filter estimates of x1,x2,…,xT, i.e.
x^t=x^t−1+ut−1+ht(yt−x^t−1)
where ht is chosen to minimise E((x^t−xt)2∣Wt). Calculate ht and show that, conditional on Wt,xt is normally distributed with mean x^t and variance Vt=1/(1+t).
(b) Define
F(WT)=E(xT2∣WT), and F(Wt)=ut,…,uT−1infE(xT2+j=t∑T−1uj2∣Wt),t=0,…,T−1.
Show that F(Wt)=x^t2Pt+dt, where Pt=1/(T−t+1),dT=1/(1+T) and dt−1=Vt−1VtPt+dt.
(c) Show that the minimising control ut can be expressed in the form ut=−Ktx^t and find Kt. How would the expression for Kt be altered if x0 or {ηt} had variances other than 1?