Consider the following generalized linear model for responses y1,…,yn as a function of explanatory variables x1,…,xn, where xi=(xi1,…,xip)⊤ for i=1,…,n. The responses are modelled as observed values of independent random variables Y1,…,Yn, with
Yi∼ED(μi,σi2),g(μi)=xi⊤β,σi2=σ2ai,
Here, g is a given link function, β and σ2 are unknown parameters, and the ai are treated as known.
[Hint: recall that we write Y∼ED(μ,σ2) to mean that Y has density function of the form
f(y;μ,σ2)=a(σ2,y)exp{σ21[θ(μ)y−K(θ(μ))]}
for given functions a and θ.]
[ You may use without proof the facts that, for such a random variable Y,
E(Y)=K′(θ(μ)),var(Y)=σ2K′′(θ(μ))≡σ2V(μ).]
Show that the score vector and Fisher information matrix have entries:
Uj(β)=i=1∑nσi2V(μi)g′(μi)(yi−μi)xij,j=1,…,p
and
ijk(β)=i=1∑nσi2V(μi)(g′(μi))2xijxik,j,k=1,…,p
How do these expressions simplify when the canonical link is used?
Explain briefly how these two expressions can be used to obtain the maximum likelihood estimate β^ for β.