Consider the linear model Y=Xβ+ε, where ε∼Nn(0,σ2I) and X is an n×p matrix of full rank p<n. Suppose that the parameter β is partitioned into k sets as follows: β⊤=(β1⊤⋯βk⊤). What does it mean for a pair of sets βi,βj,i=j, to be orthogonal? What does it mean for all k sets to be mutually orthogonal?
In the model
Yi=β0+β1xi1+β2xi2+εi
where εi∼N(0,σ2) are independent and identically distributed, find necessary and sufficient conditions on x11,…,xn1,x12,…,xn2 for β0,β1 and β2 to be mutually orthogonal.
If β0,β1 and β2 are mutually orthogonal, what consequence does this have for the joint distribution of the corresponding maximum likelihood estimators β^0,β^1 and β^2 ?