Statistics [15]: Analysis of Variance - F test

4 minute read

Published:

In statistics, analysis of variance (ANOVA) is a collection of statistical models, and their associated procedures, in which the observed variance is partitioned into components due to different explanatory variables. In its simplest form ANOVA gives a statistical test of whether the means of several groups are all equal, and therefore generalizes Student’s two-sample test to more than two groups.


One-Way ANOVA

The one-way ANOVA compares the means between the groups we are interested in and determines whether any of those means are statistically significantly different from each other. Specifically, it tests the null hypothesis:

where .

ANOVA Model

The independent variable (grouping variable) that we are interested is called factor, and each factor may have two or more levels. In the table below, are levels of factor ; is number of independent tests under each level; is the test results.

level \ test number

Assume

Further, let

We have

The hypothesis could be

ANOVA Problem

The hypothesis test problem:

The test statistics:

Variance Analysis

where

Within group sum of square:

Between group sum of square:

For ,

For ,

When hypothesis is true,

Hence,

The hypothesis is thus rejected for large values of .

Example 1

Comparison of pesticides.

 
879056559275
858862489972
8087  9581
 94  91 

ANOVA

 Sum of SquaresdfMean SquareFSig.
Between Groups3794.5005758.90051.162.000
Within Groups178.0001214.833  
Total3972.50017   

The significance is obvious.

Example 2

Lifetime of lightbulbs.

 12345678
1600161016501680170017001780 
15001640140017001750   
16401550160016201640160017401800
151015201530157016401680  

ANOVA

 Sum of SquaresdfMean SquareFSig.
Between Groups39776.456313258.8191.638.209
Within Groups178088.929228089.951  
Total217865.38525   

The significance is not obvious.


Two-Way ANOVA

A two-way ANOVA is used to estimate how the mean of a quantitative variable changes according to the levels of two categorical variables. Use a two-way ANOVA when we want to know how two independent variables, in combination, affect a dependent variable.

ANOVA Model

Assume there are two factors and with levels and respectively. Then, there will be different combinations . Perform independent tests under each combination, tests in total would be necessary.

\

Assume

Let

We have

where reflects the total effects of on the experiment results, is called interactive effect and and are called main effects.

Further,

In summary,

and all are independent.

Hypothesis

Variance Analysis

where

Let

Then,

The expectation would be

Summary,

SourceSum of SquareDOFMean Square value
 
  

Example 3

Influence of fertilizer and seeds on the production.

seeds \ production \ fertilizer
173,172174,176177,179172,173
175,173178,177174,175170,171
177,175174,174174,173169,169

ANOVA

 Sum of SquaresdfMean SquareFSig.
Seeds8.08324.0424.409.037
Fertilizer90.833330.27833.030.000
Seeds * Fertilizer51.91768.6539.439.001
Error11.00012.917  
Total161.83323   

The significance of fertilizer and (seeds * fertilizer) is obvious, however, the significance of seeds is not as significance as fertilizer.


Table of Contents

Comments