A particle follows a discrete-time trajectory on R given by
xt+1=(Axt+ut)ξt+ϵt
for t=1,2,…,T. Here T⩾2 is a fixed integer, A is a real constant, xt and ut are the position of the particle and control action at time t, respectively, and (ξt,ϵt)t=1T is a sequence of independent random vectors with
Eξt=Eϵt=0,var(ξt)=Vξ>0,var(ϵt)=Vϵ>0 and cov(ξt,ϵt)=0
Find the optimal control, i.e. the control action ut, defined as a function of (x1,…,xt;u1,…,ut−1), that minimizes
t=1∑Txt2+ct=1∑T−1ut2
where c>0 is given.
On which of Vϵ and Vξ does the optimal control depend?
Find the limiting form of the optimal control as T→∞, and the minimal average cost per unit time.