One-way ANOVA Model
Consider the balanced one-way layout model for \(y_{ij}\) a response on the \(j^{th}\) unit in the \(i^{th}\) treatment group. Suppose that there are \(a\) treatments and \(n\) units in the \(i^{th}\) treatment group, with total sample size \(N=an\). Let \(x_i\) be an indicator vector for treatment \(i\), and \(j_N\) be a vector of ones.
1. The Cell-Means Model
The cell-means model represents the response strictly in terms of the mean of its respective treatment group.
Equation and Vector Form: \[
y_{ij} = \mu_i + e_{ij}, \quad i=1,\dots,a, \quad j=1,\dots,n
\] where the \(e_{ij}\) are i.i.d. \(N(0,\sigma^2)\). In vector notation, this is: \[
y = \mu_1 x_1 + \mu_2 x_2 + \dots + \mu_a x_a + e, \quad e \sim N(0, \sigma^2 I)
\]
Matrix Formulation (\(n=2\) observations per level): Let the parameter vector be \(\boldsymbol{\mu} = (\mu_1, \mu_2, \dots, \mu_a)^T\). Arranging the response and error vectors by treatment group, the model \(\mathbf{y} = X_1\boldsymbol{\mu} + \mathbf{e}\) is written as: \[
\begin{pmatrix} y_{11} \\ y_{12} \\ y_{21} \\ y_{22} \\ \vdots \\ y_{a1} \\ y_{a2} \end{pmatrix} = \begin{pmatrix} 1 & 0 & \dots & 0 \\ 1 & 0 & \dots & 0 \\ 0 & 1 & \dots & 0 \\ 0 & 1 & \dots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \dots & 1 \\ 0 & 0 & \dots & 1 \end{pmatrix} \begin{pmatrix} \mu_1 \\ \mu_2 \\ \vdots \\ \mu_a \end{pmatrix} + \begin{pmatrix} e_{11} \\ e_{12} \\ e_{21} \\ e_{22} \\ \vdots \\ e_{a1} \\ e_{a2} \end{pmatrix}
\]
Identifiability: The \(2a \times a\) model matrix \(X_1\) is of full column rank (rank \(a\)). Therefore, the parameters \(\mu_i\) are uniquely identifiable.
2. The Effects Model
An alternative, but equivalent, linear model is the effects model, which decomposes the cell mean (\(\mu_i\)) into a baseline value (\(\mu\)) and a treatment-specific deviation (\(\alpha_i\)). The explicit relationship between the parameters of the two models is: \[
\mu_i = \mu + \alpha_i
\]
Equation and Vector Form: Substituting this relationship into the cell-means model yields: \[
y_{ij} = \mu + \alpha_i + e_{ij}, \quad i=1,\dots,a, \quad j=1,\dots,n
\] with the same assumptions on the errors. In vector notation, this is: \[
y = \mu j_N + \alpha_1 x_1 + \alpha_2 x_2 + \dots + \alpha_a x_a + e, \quad e \sim N(0, \sigma^2 I)
\]
Matrix Formulation (\(n=2\) observations per level): The parameter vector is expanded to include the baseline/grand mean \(\mu\), so \(\boldsymbol{\beta} = (\mu, \alpha_1, \alpha_2, \dots, \alpha_a)^T\). The model \(\mathbf{y} = X_2\boldsymbol{\beta} + \mathbf{e}\) is written as: \[
\begin{pmatrix} y_{11} \\ y_{12} \\ y_{21} \\ y_{22} \\ \vdots \\ y_{a1} \\ y_{a2} \end{pmatrix} = \begin{pmatrix} 1 & 1 & 0 & \dots & 0 \\ 1 & 1 & 0 & \dots & 0 \\ 1 & 0 & 1 & \dots & 0 \\ 1 & 0 & 1 & \dots & 0 \\ \vdots & \vdots & \vdots & \ddots & \vdots \\ 1 & 0 & 0 & \dots & 1 \\ 1 & 0 & 0 & \dots & 1 \end{pmatrix} \begin{pmatrix} \mu \\ \alpha_1 \\ \alpha_2 \\ \vdots \\ \alpha_a \end{pmatrix} + \begin{pmatrix} e_{11} \\ e_{12} \\ e_{21} \\ e_{22} \\ \vdots \\ e_{a1} \\ e_{a2} \end{pmatrix}
\]
Non-identifiability and Constraints: The effects model has the same model matrix as the cell-means model, but with one extra column of ones in the first position (\(j_N\)). Notice that \(\sum_i x_i = j_N\) and \(\text{Col}(X_1) = \text{Col}(X_2)\). Therefore, the columns of the model matrix \(X_2\) are linearly dependent. In this \(2a \times (a+1)\) matrix, the first column is exactly equal to the sum of the remaining \(a\) columns.
Because \(X_2\) is not full rank (it has rank \(a\), but \(a+1\) columns), the parameters are not uniquely identified. \(\mu\) is in no sense the grand mean; it is just an arbitrary baseline value. However, subject to the constraint \(\sum_i \alpha_i = 0\), the parameters gain specific interpretations:
- Grand Mean (\(\mu\)): The grand mean response across all treatments.
- Treatment Effect (\(\alpha_i\)): The deviation from the grand mean.
Adding the constraint \(\sum_i \alpha_i = 0\) essentially reparameterizes the overparameterized (non-full rank) effects model to a just-parameterized (full rank) model equivalent to the cell means model.
Definition 9.1 (Equivalent Linear Models) In general, two linear models \(y = X_1 \beta_1 + e_1\) and \(y = X_2 \beta_2 + e_2\) with the same assumptions on \(e_1\) and \(e_2\) are equivalent linear models if \(\text{Col}(X_1) = \text{Col}(X_2)\).
Different Strategies for Non-Full Rank Models
When faced with a non-full rank model, we have three ways to proceed:
- Reparameterize to a full rank model (e.g., cell-means model).
- Add constraints (side-conditions) to the model parameters (e.g., \(\sum \alpha_i = 0\)).
- Analyze the model as a non-full rank model, limiting inference to estimable functions.