Power is an important consideration for equivalence tests. The Power and Sample Size Overview describes power and other related concepts. These concepts apply to equivalence tests, but remember that the hypotheses for an equivalence test are different from the hypotheses for a typical test of population means.
Consider the difference between a 2-sample t-test and a 2-sample equivalence test. You use a 2-sample t-test to test whether the means of two populations are different. The hypotheses for the test are as follows:
If the p-value for the test is less than alpha, then you reject the null hypothesis and conclude that the means are different.
In contrast, you use a 2-sample equivalence test to test whether the means of two populations are equivalent, within a range of values that you specify (the equivalence interval). The hypotheses for the test are as follows:
If the p-value for the test is less than alpha, then you reject the null hypothesis and conclude that the means are equivalent.
Thus, power for an equivalence test is the likelihood that you will conclude that the difference of interest is within your equivalence limits, when this is in fact true. If your test has low power, you may mistakenly conclude that the difference is not within your equivalence limits when it actually is. The following factors affect the power of your test:
Minitab provides power, sample size, and difference calculations for the following equivalence tests:
Equivalence Test with Paired Data
Equivalence Test for 2x2 Crossover Design