Suppose {xt}t⩾0 is a Markov chain. Consider the dynamic programming equation
Fs(x)=max{r(x),βE[Fs−1(x1)∣x0=x]},s=1,2,…,
with r(x)>0,β∈(0,1), and F0(x)=0. Prove that:
(i) Fs(x) is nondecreasing in s;
(ii) Fs(x)⩽F(x), where F(x) is the value function of an infinite-horizon problem that you should describe;
(iii) F∞(x)=lims→∞Fs(x)=F(x).
A coin lands heads with probability p. A statistician wishes to choose between: H0:p=1/3 and H1:p=2/3, one of which is true. Prior probabilities of H1 and H0 in the ratio x:1 change after one toss of the coin to ratio 2x:1 (if the toss was a head) or to ratio x:2 (if the toss was a tail). What problem is being addressed by the following dynamic programming equation?
F(x)=max{1+x1,1+xx,β[(1+x132+1+xx31)F(x/2)+(1+x131+1+xx32)F(2x)]}
Prove that G(x)=(1+x)F(x) is a convex function of x.
By sketching a graph of G, describe the form of the optimal policy.