|
We continue our exploration of a simple multivariate anova by looking at different effect sizes of the factors and different correlations between the variables. We have seen that that two apparently insignificant univariate anovas can be shown by a multivariate anova to have masked differences which are in fact quite significant. This is because the multivariate analysis takes the correlation between the variables into account, something that multiple univariate analyses cannot do. In this particular case, the positive correlation between the variables sets an expectation that any differences between the variable means should also show a positive relationship, and when the data shows that the differences have a negative relationship it is detected as significant. Our next enquiry concerns the effect size which does yield a significant univariate result when the differences between the variable means show the same relationship as would be expected from the correlation between the variables.
Consistent centroid trend (r = .71), partially significant univariate, significant multivariateWe take the data of Table 1 from the page introducing the Multivariate Anova and change the effect size of our elixir to increase the mean Confidence of the Treatment group from 1 to 3, as per Table 1 below. Table 1. Data and descriptive statistics
The Treatment group has a higher mean Confidence and higher mean Test score than the Control. The effect sizes are shown in Table 2. Table 2. Effect sizes
While the Test score difference is half a standard deviation in size, the Confidence difference is 1.2 standard deviations in size. The scattergram for this data is shown in Figure 1. Figure 1. Scattergram of the data of Table 1
Visually, Figure 1 paints a picture of considerable separation of the Treatment group markers from the Control group markers, mainly along the Confidence axis. Notice that the centroid trend line is similarly positive and consistent with the trend lines of the group data, and that the centoids are further apart than seen in Figure 3 of the earlier Multivariate Anova discussion. Note that the error ovals (see Multivariate Error Oval) do not overlap. The multivariate test is shown in Table 3. Table 3. Multivariate test of Table 1 data
This is significant. There is a statistically significant Treatment effect when considering Confidence and Test score together, which allows us to examine the elixir Treatment effect on Confidence and on Test score considered separately. This is shown in Table 4. Table 4. Univariate tests of Table 1 data
There are no surprises here and the interpretation of the results is straightforward — the statistically significant Treatment effect when considering Confidence and Test score together is shown to be mainly due to a statistically significant Treatment effect on Confidence (effect size = 1.20), p = .01; the Treatment effect on Test score is not significant (effect size = 0.51), p = .30.
Consistent centroid trend (r = .71), significant univariate, insignificant multivariateOur next enquiry concerns the situation where there are significant univariate results yet the multivariate analysis stubbornly refuses to declare a significant result. As before, we reengineer our data to have "large" effect sizes while keeping the same pattern of correlation for both the data points and the mean data centroids. We skip the raw data table and show the effect size table only, as in Table 5, and then show the scattergram in Figure 2. Table 5. Effect sizes
Figure 2. Scattergram of the data summarised in Table 5 as derived from the data of Table 1
Notice the centroid trend line, which is giving us a visual representation of the two-variable "distance" between the Treatment and Control groups. In particular, notice that the length of this trend line is very similar to the length of the Figure 1 centroid trend line, where the "distance" between Treatment and Control groups was shown to be significant by the multivariate analysis of Table 3. The Confidence and Test score differences are a little under one standard deviation in size, said by Cohen to be "large". Two large effect sizes should give us two significant univariate results, but do they yield a significant multivariate result? We can anticipate by inspection of the error ovals, which seem to have a slight overlap. Table 6 shows the multivariate test, and Table 7 shows the univariate tests. Table 6. Multivariate test of data illustrated in Figure 2.
Table 7. Univariate tests following Table 6 test
Although we have the expected two significant univariate tests, p = 0.05 each, the multivariate test fails to reach significance, p = 0.12. The strong positive correlation between the two variables means that, taken together, their centroids are not significantly different even though the differences between their individual means are significantly different. Another way of saying this is that the two groups of data, Treatment and Control, are judged to have come from the same population (are not significantly different) when the measures, Confidence and Test score, are considered together.
Consistent centroid trend (r = .93), significant univariate, insignificant multivariateThis enquiry examines the situation with a very high correlation between the measures, r = 0.93. The data is reengineered to have particularly large effect sizes around 1, giving significant univariate results but an insignificant multivariate test. The effect size table is shown in Table 8, and the scattergram in Figure 3. Table 8. Effect sizes
Figure 3. Scattergram of data with r = 0.93 and effect sizes summarised in Table 8
We can anticipate the multivariate test result from inspection of the error ovals, which seem to have a slight overlap. Table 9 shows the multivariate test, and Table 10 shows the univariate tests. Table 9. Multivariate test of data illustrated in Figure 3.
Table 10. Univariate tests of data illustrated in Figure 3.
Despite the two univariate tests being significant, p = .02 each, the multivariate test declares that the Treatment and Control groups cannot be shown to have come from different populations when the measures are considered together, p = .06. As before, we can attribute this result to the exceptional correlation between the measures being matched by the trend of the centroid trend line which is exactly consistent with the correlation.
When r = 0Our next enquiry concerns the situation when there is negligible correlation between the measures. We reengineer our data to have "large" effect sizes. Figure 3 shows the scattergram. Figure 4. Scattergram of data with error ovals for r=0
For this data, the multivariate and univariate tests are shown in Tables 11 and 12.
Table 11. Multivariate test of Figure 4 data The multivariate test shows that the two group centroids are significantly different, p = .03, and indeed the error ovals show no overlap. For interest, the univariate tests are shown in Table 12, where the differences in mean Confidence and mean Test score are also significant, and to a similar degree, p = 0.05 and 0.05, as that seen for the multivariate test of the centroids.
Table 12. Univariate tests of Figure 4 data
When r is negative, inconsistent centroid trendWe check our original finding (Figure 2 in Multivariate Anova) that the multivariate analysis is a sensitive test of a centroid trend that is contrary to the correlation seen in the group data, even when the correlation is negative. The example data has been reengineered to yield the scattergram of Figure 5, an approximately mirror image of the earlier scattergram. Figure 5. Scattergram with group r ≈ –.72, effect sizes 0.50
The results of the multivariate and univariate tests are given in Tables 13 and 14. Table 13. Multivariate test of Figure 5 data
As before, there is a statistically significant Treatment effect considering Confidence and Test score together, p = .04. Table 14. Univariate tests of Figure 5 data
The elixir has no significant effects on Confidence or Test score when these measures are considered separately, p = .27 and .31 respectively. The multivariate analysis tests and flags a result that cannot be tested or flagged by univariate tests — whether the Treatment effect is consistent with, or the opposite of, what would be expected given the correlation between the measures.
Inconsistent group correlationFinally, we explore the most unlikely of data, where one group shows a positive correlation between the measures, and the other shows the opposite. Our data is engineered to have r = 0.71 in the Control group data as may be seen in Figure 2 and most of the other scattergrams, and r = –0.74 in the Treatment group as seen above in Figure 5. The effect sizes are 0.5. The scattergram is shown in Figure 6. Figure 6. Scattergram with Control group r = .71, Treatment group r = – .74
The overlap of the error bars suggests the multivariate test will be insignificant. It is shown in Table 15 with the univariate tests in Table 16. Table 15. Multivariate test of the data in Figure 6.
As anticipated, the multivariate test is not significant; the centroids are not significantly different, p = 0.37. Table 16. Univariate tests of the data in Figure 6.
We have seen this data with these effect sizes before, and they routinely yield insignificant univariate tests with each p in the region of 0.30 (slightly different values elsewhere due to rounding errors). We can see that the multivariate test p and the univariate test p's are similar in magnitude. To round off our exploration of multivariate testing, it is interesting to run an analysis where r = 0 for both groups and the effect sizes are 0.50 as seen before. Unsurprisingly, the multivariate test returns p = 0.37, and the univariate tests return p = 0.30 and p = 0.31. Wait. We have just seen these more or less identical results when the r for Control was .71 and the r for Treatment was –.74. What is interesting is that the multivariate analysis treats conflicting correlation in the groups as cancelling each other out to give an effective "overall" r of 0. Indeed, when the raw data for the two groups are combined, overall r = 0.07, and when the raw data is de-trended (§1), r = 0.00. (§1) In general, the correlation seen in any two groups will be different. To identify the correlation between the measures, the multivariate analysis de-trends and then combines the data from the different groups to give an "overall" estimate. The "de-trend" eliminates the group mean differences from the data and then sets their standard deviations equal, a normalization which gives group means of 0 and standard deviations of 1.
Previously, Multivariate Anova and Multivariate Error Oval.
|
©2025 Lester Gilbert |