ANALYSIS OF VARIANCE (ANOVA)



Meaning of ANOVA.
          ANOVA is a statistical technique used for portioning the variation in an observed data into it’s different sources. In other words, it is a technique employed to breakdown the total variation in an experiment to it’s additive components. With this technique or method therefore, we can break down the total variation, occurring in a dependent variable, into various separate factors causing the variation.
          ANOVA is a statistical method developed by R.A Fisher in 1948 to handle research problems in agriculture and biology. ANOVA can also be called F-ratio. It can be used for two comparative samples but it is called student t-ratio, or t-test. Fisher defined ANOVA as the separation of the variance due to other groups the overall approach in analysis of variance is to partition the variation in the observed set of data into a number of components. This technique does not stipulate any functional difference between the dependent and the independent variables.
Terms used in ANOVA
i.        Factor:
          It is an independent variable to be studied in an experiment. It is a variable whose effect on another variable is to be studied. Example in the study of the effect of fertilizer on the yield/growth rate of cassava, the factor on understudy is fertilizer.
ii.       Factor Level:
          It is sources of a particular factor. For instance, in the study of the effect of fertilizer on the growth rate/yield of cassava, the different sources of fertilizer such as N.P.K, Nitrogen phosphorus and potassium are called factor levels.
iii.      Treatment:
          A treatment is equivalent to a facto in a single-factor analysis. In a multi-factor experiment or analysis, a treatment is equivalent to a combination of factor levels.
iv.      ANOVA Table:  It is a table that shows in summary form, the computations for analysis of variance. This table enables us to have a quick and convenient assessment of the sources of variation, their respective sum of squares and degrees of freedom which are the results of ANOVA. The ANOVA table.
1.       The degree of freedom written as 9d.f) to obtained the degree of freedom for variation within treatments is a-1 while the d.f for total variation sis ab-1 and this is always equals to the sum of the degrees of freedom for the between-treatments and within treatments.
2.       Mean sum of squares. This is also called the variance (S2). It is calculated as sum of squares divided by the corresponding degrees of freedom i.e.
= summation or additive sign
:. 2  = sum of squares (SS) of deviation from the mean.
N       =       1 = Degrees of freedom (df).
Another formular for calculating variance is
         
The first formular is called deviation formular second is called the machine formular.
3.       F-test or F-calculated (fisher test). This is obtained by dividing the mean square of treatment by the mean square of the experimental or residual error.

          In testing the significance of the F-test, if F-tabulated given the same degrees of freedom then the differences between the treatments are significant i.e the treatments given in the experiment have difference which must by carefully monitored. But if the F-Cal is smaller than the F-tab, the it means differences between the treatments given are similar.

THE ESSENCE OF ANOVA
          Analysis of variance is a very important analytical tool developed by R.A fisher for the analysis of experimental data. Hence it is employed in such fields of study as agriculture, medicine, biology, engineering, economics and other social researchers to analyse data and achieve logical conclusions.
          Besides determining the various factors which cause variation of the dependent variable, ANOVA could also be useful in following areas:
i.        testing the overall significance of the regression,
ii.       testing the significance of the  improvement in fit obtained by the introduction of additional explanatory variables in the function.
iii.      testing the equality of coefficient s obtained from different samples.
iv.      testing the stability of coefficients of regression.
v.       testing restrictions imposed on coefficient of a function .
          Basically, analysis of variance technique is used to test hypothesis concerning population means; and it is employed in regression analysis to conduct various tests of significance. Hence, it is an essential component of any regression analysis.
Types of ANOVA.
ANOVA is classified into two basic types: one factor classification and two-factor classification the classification is based on the number of factors to be studied.
A.      One-factor classification
          The analysis based on one factor classification is used to find out the effect of a single independent variable (a factor) on the dependent variable. For instance, in the study of the level of cassava yield using different sources of nitrogen, we are actually considering the effect of nitrogen on the yield of maize is dependent on it. To ascertain if the variation in the cassava yield is due to the treatment (that is, nitrogen application or due to natural) THE ONE-FACTOR ANALYSIS IS ADOPTED.
          However, to adopt and apply this one-factor analysis of variance technique, the following assumptions must be made:-
1.       The treatment effect is fixed
2.       The total effect of the treatment is equal to zero.
3.       The expected value of the error effect is equal to zero.
4.       The error is normally and independently distributed with mean zero and variance, 2.
b.      Two-factor ANOVA
  Unlike the one-factor ANOVA, the two-factor classification contains more than two variable. It is made up of one dependent variable and two independent variables, variable, (factors) on the dependent variable. This technique therefore, enables us to estimate not only the separate effects of the factors (independent variables) but also the join effect (Interaction effect) of these factors on the dependent variable.
          Consider the study of the effects of fertilizer and soil type on the yield of cassava. There are two independent variables or factors namely; fertilizer and soil type, and one dependent variable (cassava yield). Hence the effects of these two factors-fertilizer and soil type- can be analyzed using the two factor analysis of variance technique.
          By using this technique. The separate effect of the two factors on the yield of cassava will be determined. Also, the factors on the yield of cassava will be determined. Also, the combined effect (interaction effect) of the two factors on cassava yield can be shown. However, when the two factors do not have any interaction effect, we say that they have additive effect is on the dependent variable (cassava yield).
          In applying the two-factor analysis of variance technique, the same assumptions holding in a one factor technique still hold as follows;-
1.       The treatment effect is fixed that is Tr is fixed.
2.       The total effect of the treatment is equal to zero; that is
3.      The sum expected value of the error effect is equal to zero i.e    
4.      The error is normally and independently distributed with mean zero and variance.

READ RECENT UPDATES HERE