Confirmatory factor analysis

In statistics, confirmatory factor analysis (CFA) is a special form of factor analysis, most commonly used in social research.^[1] It is used to test whether measures of a construct are consistent with a researcher's understanding of the nature of that construct (or factor). As such, the objective of confirmatory factor analysis is to test whether the data fit a hypothesized measurement model. This hypothesized model is based on theory and/or previous analytic research.^[2] CFA was first developed by Jöreskog^[3] and has built upon and replaced older methods of analyzing construct validity such as the MTMM Matrix as described in Campbell & Fiske (1959).^[4]

In confirmatory factor analysis, the researcher first develops a hypothesis about what factors s/he believes are underlying the measures s/he has used (e.g., "Depression" being the factor underlying the Beck Depression Inventory and the Hamilton Rating Scale for Depression) and may impose constraints on the model based on these a priori hypotheses. By imposing these constraints, the researcher is forcing the model to be consistent with his/her theory. For example, if it is posited that there are two factors accounting for the covariance in the measures, and that these factors are unrelated to one another, the researcher can create a model where the correlation between factor A and factor B is constrained to zero. Model fit measures could then be obtained to assess how well the proposed model captured the covariance between all the items or measures in the model. If the constraints the researcher has imposed on the model are inconsistent with the sample data, then the results of statistical tests of model fit will indicate a poor fit, and the model will be rejected. If the fit is poor, it may be due to some items measuring multiple factors. It might also be that some items within a factor are more related to each other than others.

For some applications, the requirement of "zero loadings" (for indicators not supposed to load on a certain factor) has been regarded as too strict. A newly developed analysis method, "exploratory structural equation modeling", specifies hypotheses about the relation between observed indicators and their supposed primary latent factors while allowing for estimation of loadings with other latent factors as well.^[5]

Confirmatory factor analysis and exploratory factor analysis

Both exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) are employed to understand shared variance of measured variables that is believed to be attributable to a factor or latent construct. Despite this similarity, however, EFA and CFA are conceptually and statistically distinct analyses.

The goal of EFA is to identify factors based on data and to maximize the amount of variance explained.^[6] The researcher is not required to have any specific hypotheses about how many factors will emerge, and what items or variables these factors will comprise. If these hypotheses exist, they are not incorporated into and do not affect the results of the statistical analyses. By contrast, CFA evaluates a priori hypotheses and is largely driven by theory. CFA analyses require the researcher to hypothesize, in advance, the number of factors, whether or not these factors are correlated, and which items/measures load onto and reflect which factors.^[7] As such, in contrast to exploratory factor analysis, where all loadings are free to vary, CFA allows for the explicit constraint of certain loadings to be zero.

EFA is sometimes reported in research when CFA would be a better statistical approach.^[8] It has been argued that CFA can be restrictive and inappropriate when used in an exploratory fashion.^[9] However, the idea that CFA is solely a “confirmatory” analysis may sometimes be misleading, as modification indices used in CFA are somewhat exploratory in nature. Modification indices show the improvement in model fit if a particular coefficient were to become unconstrained.^[10] Likewise, EFA and CFA do not have to be mutually exclusive analyses; EFA has been argued to be a reasonable follow up to a poor-fitting CFA model.^[11]

Confirmatory factor analysis and structural equation modeling

Structural equation modeling software is typically used for performing confirmatory factor analysis. LISREL,^[12] EQS,^[13] AMOS,^[14] and Mplus^[15] are popular software programs. CFA is also frequently used as a first step to assess the proposed measurement model in a structural equation model. Many of the rules of interpretation regarding assessment of model fit and model modification in structural equation modeling apply equally to CFA. CFA is distinguished from structural equation modeling by the fact that in CFA, there are no directed arrows between latent factors. In other words, while in CFA factors are not presumed to directly cause one another, SEM often does specify particular factors and variables to be causal in nature. In the context of SEM, the CFA is often called 'the measurement model', while the relations between the latent variables (with directed arrows) are called 'the structural model'.

Evaluating model fit

Most statistical methods only require one statistical test to determine the significance of the analyses. However, in CFA, several statistical tests are used to determine how well the model fits to the data.^[6] Note that a good fit between the model and the data does not mean that the model is “correct”, or even that it explains a large proportion of the covariance. A “good model fit” only indicates that the model is plausible.^[16] When reporting the results of a confirmatory factor analysis, one is urged to report: a) the proposed models, b) any modifications made, c) which measures identify each latent variable, d) correlations between latent variables, e) any other pertinent information, such as whether constraints are used.^[17] With regard to selecting model fit statistics to report, one should not simply report the statistics that estimate the best fit, though this may be tempting. Though several varying opinions exist, Kline (2010) recommends reporting the Chi-squared test, the RMSEA, the CFI, and the SRMR.^[1]

Absolute fit indices

Absolute fit indices determine how well the a priori model fits, or reproduces the data.^[18] Absolute fit indices include, but are not limited to, the Chi-Squared test, RMSEA, GFI, AGFI, RMR, and SRMR.^[19]

Chi-squared test

The chi-squared test indicates the difference between observed and expected covariance matrices. Values closer to zero indicate a better fit; smaller difference between expected and observed covariance matrices.^[10] Chi-squared statistics can also be used to directly compare the fit of nested models to the data. One difficulty with the chi-squared test of model fit, however, is that researchers may fail to reject an inappropriate model in small sample sizes and reject an appropriate model in large sample sizes.^[10] As a result, other measures of fit have been developed.

Root mean square error of approximation

The root mean square error of approximation (RMSEA) avoids issues of sample size by analyzing the discrepancy between the hypothesized model, with optimally chosen parameter estimates, and the population covariance matrix.^[19] The RMSEA ranges from 0 to 1, with smaller values indicating better model fit. A value of .06 or less is indicative of acceptable model fit.^[20]

Root mean square residual and standardized root mean square residual

The root mean square residual (RMR) and standardized root mean square residual (SRMR) are the square root of the discrepancy between the sample covariance matrix and the model covariance matrix.^[19] The RMR may be somewhat difficult to interpret, however, as its range is based on the scales of the indicators in the model (this becomes tricky when you have multiple indicators with varying scales; e.g., two questionnaires, one on a 0-10 scale, the other on a 1-3 scale).^[1] The standardized root mean square residual removes this difficulty in interpretation, and ranges from 0 to 1, with a value of .08 or less being indicative of an acceptable model.^[20]

Goodness of fit index and adjusted goodness of fit index

The goodness of fit index (GFI) is a measure of fit between the hypothesized model and the observed covariance matrix. The adjusted goodness of fit index (AGFI) corrects the GFI, which is affected by the number of indicators of each latent variable. The GFI and AGFI range between 0 and 1, with a value of over .9 generally indicating acceptable model fit.^[21]

Relative fit indices

Relative fit indices (also called “incremental fit indices”^[22] and “comparative fit indices”^[23]) compare the chi-square for the hypothesized model to one from a “null”, or “baseline” model.^[18] This null model almost always contains a model in which all of the variables are uncorrelated, and as a result, has a very large chi-square (indicating poor fit).^[19] Relative fit indices include the normed fit index and comparative fit index.

Normed fit index and non-normed fit index

The normed fit index (NFI) analyzes the discrepancy between the chi-squared value of the hypothesized model and the chi-squared value of the null model.^[24] However, NFI tends to be negatively biased.^[25] The non-normed fit index (NNFI; also known as the Tucker-Lewis index, as it was built on an index formed by Tucker and Lewis, in 1973^[26]) resolves some of the issues of negative bias, though NNFI values may sometimes fall beyond the 0 to 1 range.^[23] Values for both the NFI and NNFI should range between 0 and 1, with a cutoff of .95 or greater indicating a good model fit.^[20]

Comparative fit index

The comparative fit index (CFI) analyzes the model fit by examining the discrepancy between the data and the hypothesized model, while adjusting for the issues of sample size inherent in the chi-squared test of model fit,^[10] and the normed fit index.^[23] CFI values range from 0 to 1, with larger values indicating better fit; a CFI value of .90 or larger is generally considered to indicate acceptable model fit.^[20]

Identification and underidentification

To estimate the parameters of a model, the model must be properly identified. That is, the number of estimated (unknown) parameters (q) must be less than or equal to the number of unique variances and covariances among the measured variables; p(p + 1)/2. This equation is known as the “t rule”. If there is too little information available on which to base the parameter estimates, then the model is said to be underidentified, and model parameters cannot be estimated appropriately.^[27]

References

↑ 1.0 1.1 1.2 Kline, R. B. (2010). Principles and practice of structural equation modeling (3rd ed.). New York, New York: Guilford Press.
↑ Preedy, V. R., & Watson, R. R. (2009) Handbook of Disease Burdens and Quality of Life Measures. New York: Springer.
↑ Jöreskog, K. G. (1969). A general approach to confirmatory maximum likelihood factor analysis. Psychometrika, 34(2), 183-202.
↑ Campbell, D. T. & Fisk, D. W. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychological Bulletin, 56, 81-105.
↑ Asparouhov, T. & Muthén, B. (2009). Exploratory structural equation modeling. Structural Equation Modeling, 16, 397-438
↑ 6.0 6.1 Suhr, D. D. (2006) - “Exploratory or confirmatory factor analysis?” in Statistics and Data Analysis, 31, Retrieved April 20, 2012, from http://www2.sas.com/proceedings/sugi31/200-31.pdf
↑ Thompson, B. (2004). Exploratory and confirmatory factor analysis: Understanding concepts and applications. Washington, DC, US: American Psychological Association.
↑ Levine, T. R. (2005). Confirmatory factor analysis and scale validation in communication research. Communication Research Reports, 22(4), 335-338.
↑ Browne, M. W. (2001). An overview of analytic rotation in exploratory factor analysis. Multivariate Behavioral Research, 36, 111-150.
↑ 10.0 10.1 10.2 10.3 Gatignon, H. (2010). Confirmatory Factor Analysis in Statistical analysis of management data. DOI: 10.1007/978-1-4419-1270-1_4
↑ Schmitt, T. A. (2011). Current methodological considerations in exploratory and confirmatory factor analysis. Journal of Psychoeducational Assessment, 29(4), 304-321.
↑ CFA with LISREL
↑ Byrne, B. M. (2006). Structural equation modeling with EQS: Basic concepts, application, and programming. New Jersey: Lawrence Elbaum Associates.
↑ CFA using AMOS
↑ Mplus homepage
↑ Schermelleh-Engel, K.,Moosbrugger, H., & Müller, H. (2003). Evaluating the fit of structural equation models: Tests of significance and descriptive goodness-of-fit measures, Methods of Psychological Research Online, 8(2), 23-74
↑ Jackson, D. L., Gillaspy, J. A., & Purc-Stephenson, R. (2009). Reporting practices in confirmatory factor analysis: An overview and some recommendations. Psychological Methods, 14(1), 6-23.
↑ 18.0 18.1 McDonald, R. P., & Ho, M. H. R. (2002). Principles and practice in reporting statistical equation analyses. Psychological Methods, 7(1), 64-82
↑ 19.0 19.1 19.2 19.3 Hooper, D., Coughlan, J., & Mullen, M.R. (2008). Structural equation modelling: Guidelines for determining model fit. Journal of Business Research Methods, 6, 53–60
↑ 20.0 20.1 20.2 20.3 Hu, L., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling, 6(1), 1-55.
↑ Baumgartner, H., & Hombur, C. (1996). Applications of structural equation modeling in marketing and consumer research: A review. International Journal of Research in Marketing, 13, 139-161.
↑ Tanaka, J. S. (1993). Multifaceted conceptions of fit in structure equation models. In K. A. Bollen & J.S. Long (Eds.), Testing structural equation models (pp. 136-162). Newbury Park, CA: Sage.
↑ 23.0 23.1 23.2 Bentler, P. M. (1990). Comparative fit indexes in structural models. Psychological Bulletin, 107(2), 238-46.
↑ Bentler, P. M., & Bonett, D. G. (1980). Significance tests and goodness of fit in the analysis of covariance structures. Psychological Bulletin, 88, 588-606.
↑ . Bentler, P. M. (1990). Comparative fit indexes in structural models. Psychological Bulletin, 107(2), 238-46.
↑ Tucker, L. R., & Lewis, C. (1973). A reliability coefficient for maximum likelihood factor analysis. Psychometrika, 38, 1-10.
↑ Babyak, M. A., & Green, S. B. (2010). Confirmatory factor analysis: An introduction for psychosomatic medicine researchers. Psychosomatic Medicine, 72, 587-597.

External sources

Center for Statistical and Mathematical Computing at Indiana University