Paper 4, Section II, J
Consider the normal linear model where the -vector of responses satisfies with . Here is an matrix of predictors with full column rank where and is an unknown vector of regression coefficients. For , denote the th column of by , and let be with its th column removed. Suppose where is an -vector of 1 's. Denote the maximum likelihood estimate of by . Write down the formula for involving , the orthogonal projection onto the column space of .
Consider with . By thinking about the orthogonal projection of onto , show that
[You may use standard facts about orthogonal projections including the fact that if and are subspaces of with a subspace of and and denote orthogonal projections onto and respectively, then for all .]
By considering the fitted values , explain why if, for any , a constant is added to each entry in the th column of , then will remain unchanged. Let . Why is (*) also true when all instances of and are replaced by and respectively?
The marks from mid-year statistics and mathematics tests and an end-of-year statistics exam are recorded for 100 secondary school students. The first few lines of the data are given below.
The following abbreviated output is obtained:
What are the hypothesis tests corresponding to the final column of the coefficients table? What is the hypothesis test corresponding to the final line of the output? Interpret the results when testing at the level.
How does the following sample correlation matrix for the data help to explain the relative sizes of some of the -values?