Department of Industrial Psychology, Stellenbosch University
Published
January 21, 2025
Introduction
In this tutorial I demonstrate how to examine the similarity of factor solutions across different groups with Tucker’s phi coefficient (aka Burt’s coefficient of congruence). Here I examine the similarity of the big five personality factors (as measured by the Big Five Inventory) of men and women with the fa(), fa.congruence(), and target.rot() functions of the psych package (Revelle, 2024a) in R. I use the bfi data set that is included in the psychtools (Revelle, 2024b) package. This data set contains the responses of 2800 people to the items of the Big Five Inventory. There are 919 men (coded as 1) and 1881 women (coded as 2).
Lorenzo-Seva and ten Berge’s (2006) excellent article describes the meaning of the coefficient of congruence and how it can be used to evaluate the similarity of factors across groups. They suggested that coefficients of congruence in the range 0.85 to 0.94 suggest fair similarity, while values above 0.95 suggest good similarity.
Descriptive statistics of the Big Five Inventory items
Select the persons that represent the reference and focal groups
Here we choose the men (coded as 1) as the reference group and the women (coded as 2) as the focal group. We store the data of the reference group as personsR and those of the focal group as personsF.
By default the factors were extracted using unweighted least squares (aka minimum residual or minres) and obliquely rotated according to the direct oblimin criterion. Inspection of the rotated factor pattern matrix shows that the first factor, labeled MR1, has high loadings on the Extroversion items (items E1 to E5), the second factor, labeled MR2, has high loadings on the Neuroticism items (items N1 to N5), the third factor, labeled MR5, has high loadings on four Agreeableness items (items A2 to A5 – the loading of item A1 “unexpectedly” was relatively low), the fourth factor, labeled MR3, has high loadings on the Conscientiousness items (items C1 to C5), and the fifth factor, labeled MR4, has high loadings on the Openness items (items O1 to O5). It appears safe to label the five factors of the reference group (the men) as Extroversion (MR1), Neuroticism (MR2), Agreeableness (MR5), Conscientiousness (MR3), and Openness (MR4).
Inspection of the rotated factor pattern matrix of the focal group shows that the first factor, labeled MR2, has high loadings on the Neuroticism items (items N1 to N5), the second factor, labeled MR1, has high loadings on the Extroversion items (items E1 to E5), the third factor, labeled MR3, has high loadings on the Conscientiousness items (items C1 to C5), the fourth factor, labeled MR5, has high loadings on the Agreeableness items (items A1 to A5), and the fifth factor, labeled MR4, has high loadings on the Openness items (items O1 to O5). It appears safe to label the five factors of the focal group (the women) as Neuroticism (MR2), Extroversion (MR1), Conscientiousness (MR3), Agreeableness (MR5), and Openness (MR4).
Whereas the order of the factors are different across the two groups, each of the big five factors were found for both the reference and the focal group. What remains to be seen is how similar the patterns of high and low loadings are across the two groups.
Factor Analysis using method = minres
Call: fa(r = personsF[, items], nfactors = nfactors, rotate = rotation)
Standardized loadings (pattern matrix) based upon correlation matrix
MR2 MR1 MR3 MR5 MR4 h2 u2 com
A1 0.22 0.22 0.10 -0.44 -0.09 0.24 0.76 2.3
A2 -0.01 0.00 0.07 0.63 0.05 0.43 0.57 1.0
A3 -0.02 0.15 0.05 0.62 0.01 0.50 0.50 1.1
A4 -0.06 0.08 0.16 0.42 -0.18 0.26 0.74 1.8
A5 -0.10 0.26 0.00 0.48 0.05 0.43 0.57 1.7
C1 0.06 0.01 0.54 -0.06 0.13 0.32 0.68 1.2
C2 0.17 -0.02 0.68 0.05 0.00 0.46 0.54 1.1
C3 0.01 -0.09 0.56 0.10 -0.04 0.32 0.68 1.1
C4 0.21 0.06 -0.61 0.01 -0.06 0.46 0.54 1.3
C5 0.23 -0.08 -0.54 -0.02 0.08 0.41 0.59 1.5
E1 -0.04 -0.53 0.10 -0.06 -0.10 0.31 0.69 1.2
E2 0.16 -0.65 0.02 -0.04 -0.07 0.52 0.48 1.2
E3 0.10 0.45 0.01 0.23 0.25 0.44 0.56 2.2
E4 -0.01 0.61 0.03 0.25 -0.10 0.52 0.48 1.4
E5 0.12 0.39 0.27 0.01 0.24 0.37 0.63 2.7
N1 0.80 0.09 -0.01 -0.09 -0.04 0.65 0.35 1.1
N2 0.77 0.03 0.02 -0.10 0.02 0.60 0.40 1.0
N3 0.73 -0.06 -0.02 0.03 0.03 0.55 0.45 1.0
N4 0.51 -0.37 -0.12 0.09 0.06 0.49 0.51 2.1
N5 0.53 -0.16 -0.01 0.17 -0.15 0.36 0.64 1.6
O1 0.05 0.17 0.11 0.03 0.46 0.31 0.69 1.4
O2 0.21 0.09 -0.07 0.11 -0.48 0.28 0.72 1.6
O3 0.06 0.22 0.03 0.08 0.58 0.46 0.54 1.3
O4 0.19 -0.28 0.00 0.18 0.35 0.24 0.76 3.1
O5 0.10 0.11 0.01 -0.02 -0.56 0.31 0.69 1.2
MR2 MR1 MR3 MR5 MR4
SS loadings 2.73 2.17 1.99 1.80 1.55
Proportion Var 0.11 0.09 0.08 0.07 0.06
Cumulative Var 0.11 0.20 0.28 0.35 0.41
Proportion Explained 0.27 0.21 0.19 0.18 0.15
Cumulative Proportion 0.27 0.48 0.67 0.85 1.00
With factor correlations of
MR2 MR1 MR3 MR5 MR4
MR2 1.00 -0.19 -0.17 -0.11 -0.02
MR1 -0.19 1.00 0.20 0.32 0.18
MR3 -0.17 0.20 1.00 0.20 0.19
MR5 -0.11 0.32 0.20 1.00 0.19
MR4 -0.02 0.18 0.19 0.19 1.00
Mean item complexity = 1.5
Test of the hypothesis that 5 factors are sufficient.
df null model = 300 with the objective function = 7.12 with Chi Square = 13313
df of the model are 185 and the objective function was 0.68
The root mean square of the residuals (RMSR) is 0.03
The df corrected root mean square of the residuals is 0.04
The harmonic n.obs is 1854 with the empirical chi square 931 with prob < 8.5e-100
The total n.obs was 1881 with Likelihood Chi Square = 1262 with prob < 1.6e-159
Tucker Lewis Index of factoring reliability = 0.866
RMSEA index = 0.056 and the 90 % confidence intervals are 0.053 0.059
BIC = -133
Fit based upon off diagonal values = 0.98
Measures of factor score adequacy
MR2 MR1 MR3 MR5 MR4
Correlation of (regression) scores with factors 0.93 0.89 0.87 0.86 0.84
Multiple R square of scores with factors 0.86 0.78 0.77 0.74 0.70
Minimum correlation of possible factor scores 0.71 0.57 0.53 0.49 0.40
Factor congruence coefficients
The rows of the table below represent the factors of the reference and the columns the factors of the focal group. The same number of factors were extracted in both groups and the same oblique rotation criterion was applied (by default direct oblimin).
Working our way across the rows of the matrix we see that factor MR1 (Extroversion) of the men has a coefficient of congruence of -0.92 with factor MR1 (Extroversion) of the women. The negative sign indicates that the poles of the factor of the men are opposite of that of the women. The direction of the poles is arbitrary and one could simply change the signs of all the factor loadings of factor MR1 of the women, which would yield a positive coefficient of congruence of 0.92. The remaining coefficients in the first row are all relatively small, which simply indicates that the Extroversion factor of the men is very dissimilar to the Neuroticism, Agreeableness, Consicientiousness, and Openness factors of the women. This is a good sign.
The second row shows that factor M2 of the men (Neuroticism) and factor M2 of the women (Neuroticism) has a coefficient of congruence of 0.98. This signifies that the pattern of high and low loadings on this factor is very similar across the two groups.
The third row shows that factor M5 of the men (Agreeableness) and factor M5 of the women (Agreeableness) has a coefficient of congruence of 0.93. This signifies that the pattern of high and low loadings on this factor is fairly similar across the two groups.
The fourth row shows that factor M3 of the men (Concientiousness) and factor M3 of the women (Conscientiousness) has a coefficient of congruence of 0.99. This signifies that the pattern of high and low loadings on this factor is very similar across the two groups.
Finally, the fifth row shows that factor M4 of the men (Openness) and factor MR4 of the women (Openness) has a coefficient of congruence of 0.98. This signifies that the pattern of high and low loadings on this factor is very similar across the two groups.
In summary, we see that three factors manifested very good similarity across the two groups, namely Neuroticism, Conscientiousness, and Openness. The Extroversion and Agreeableness factors manifested fairly similarly across the men and women.
Factor congruence coefficients after target rotation
The factors of the reference group (the men), myfactorsR, were obliquely rotated to simple structure (by default this is according to the direct oblimin criterion). The factors of the focal group (the women), myfactorsF, were rotated to be as similar as possible to the factors of the reference group with the target.rot() function of the psych package. This function takes as first argument the factor matrix that should be rotated to the target, and as second argument an arbitrary target matrix. A common application would be to build a target matrix using -1, 0 an 1 as the weights. However, in this example, the rotated factor matrix of the reference group is specified as the target. The target rotated factor matrix of the focal group is stored as myFactorsFt.
The rows of the factor congruence matrix represents the factors of the reference group and the columns those of the focal group. Inspection of the factor solution of the reference group (personsR) shows that the first row (MR1) represents Extroversion, the second row (MR2) represents Neuroticism, the third row (MR5) represents Agreeableness, the fourth row (MR3) represents Conscientiousness, and the fifth row (MR4) represents Openness.
Results show that the coefficients of congruence of corresponding factors were 0.94 (Extroversion), 0.99 (Neuroticism), 0.96 (Agreeableness), 0.99 (Conscientiousness), and 0.98 (Openness).
### Here we rotate the factors of the focal group to be as similar as possible to the factors of the men. We then examine the congruence of the target rotated factors of the women to those of the men.myfactorsFt <-target.rot(myfactorsF$loadings, myfactorsR$loadings)fa.congruence(myfactorsR$loadings, myfactorsFt)
Following the interpretation guidelines of Lorenzo-Seva and ten Berge (2006), the Agreeableness, Neuroticism, Conscientiousness and Openness factors have good similarity across the men and women. The Extroversion factor has fair similarity, but the congruence coefficient of 0.94 falls right on the border of fair versus good similarity.
The coefficient of congruence is insensitive to differences in the absolute sizes of the factor loadings. A very high coefficient can be obtained as long as the pattern of high and low loadings is proportionally similar across the two groups. This means that a perfect coefficient of congruence can be found if the factor loadings in one group differ by a constant from those of the second group.
In this respect it is useful to plot the factor loadings of corresponding factors across the two groups and to compare it to an identity line (that represents perfect correspondence). An advantage of the plot is that it can highlight particular variables that have different factor loadings.
Scatter plots of corresponding factors
In the plots below we see that the factors manifested very similarly across the two groups. It is noticeable that the factor loadings for Extroversion show greater dispersion around the identity line than the remaining factors.
The root mean square difference of factor loadings
The overall similarities of the factors can also be expressed with reference to differences in the size of the corresponding factor loadings across the two groups. In this respect the root of the mean squared difference (RMSD) of the corresponding factor loadings reflects the average size of the differences.
In our example the RMSDs of the Extroversion and Agreeableness factors were slightly higher than those of the remaining factors. These were also the factors that had the lowest coefficients of congruence.
Lorenzo-Seva, U. & ten Berge, J. M. F. (2006). Methodology, 2(2), 57:64. doi: 10.1027/1614-1881.2.2.57
Revelle, W. (2024a). psych: Procedures for psychological, psychometric, and personality research. Northwestern University, Evanston, Illinois. R package version 2.4.6, https://CRAN.R-project.org/package=psych.
Revelle, W. (2024b). psychTools: Tools to accompany the ‘psych’ package for psychological research. Northwestern University, Evanston, Illinois. R package version 2.4.3, https://CRAN.R-project.org/package=psychTools.