Median polish

An exploratory data analysis (EDA) procedure that creates an additive model to describe data in a two-way table. This additive model represents each data value in the table as a sum of four components: a common effect, a row effect, a column effect, and a residual. This format isolates the effects of the row and column factors to reveal effects that are not readily apparent in the original data, and to help you determine whether these effects justify formal statistical inference.

For example, suppose a manufacturing facility employs three shifts of welders who can choose between two different welding methods. Inspectors measure weld strength and create a two-way table of the observations, with rows corresponding to the shift during which each weld was made, and columns corresponding to the welding method. Although weld strength is influenced by both welding method and shift, the raw data itself do not indicate how each individual factor contributes to weld strength. The median polish procedure, however, helps ascertain this information.

The results of the median polish procedure are represented below. Each component of the additive model is separated from the others and labeled in bold.  Interpretations of each component follow.

 

 

Weld Method A

Weld Method B

 

 

Residuals:

Row effects:

Shift 1

10

0

2

Shift 2

-1

1

0

Shift 3

0

-1

-2

Column effects:

-5

5

10 = Common effect

 

Common Effect

The common effect is 10. The common effect represents a typical measure of weld strength against which the effects of the two factors welding method and shift will be compared for relative strength.

Row Effects

Median polish produces a column of row effects that indicate the influence of each level of the row factor on the observed data. The row effects above lead inspectors to conclude the following:

·  Median weld strength for the first shift is 2 units greater than the common effect.

·  Median weld strength for the second shift is equal to the common effect.

·  Median weld strength for the third shift is 2 units less than the common effect.

In other words, a pattern emerges in which weld strength decreases as shifts progress. This EDA procedure suggests that the inspectors should formally analyze the effect of work shifts on weld strength.

Column Effects

Median polish also produces a row of column effects that indicate the influence of each level of the column factor on the observed data. The column effects above lead the inspectors to conclude the following:

·  Median weld strength for Weld Method A is 5 units less than the common effect.

·  Median weld strength for Weld Method B is 5 units greater than the common effect.

Again, the EDA reveals a pattern, and the inspectors have sufficient justification to analyze the effect of welding method on weld strength with formal statistical inference.

Residuals

Lastly, the residuals remain in the body of the table. These residuals reveal extraordinary values by indicating how each observation differs from its expected value (the sum of the common, row, and column effects). For example, the only exceptional residual value is 10 for "First Shift, Weld Method A". Inspectors conclude that, although the column effects suggest that Weld Method A produces a lower median weld strength, first shift welders using Weld Method A produce welds uniquely greater than expected. This effect is worth investigating further.