Abstract
This paper refers to the exponential family of probability distributions and the conditional maximum likelihood (CML) theory. It is concerned with the determination of the sample size for three groups of tests of linear hypotheses, known as the fundamental trinity of Wald, score, and likelihood ratio tests. The main practical purpose refers to the special case of tests of the class of Rasch models. The theoretical background is discussed and the formal framework for sample size calculations is provided, given a predetermined deviation from the model to be tested and the probabilities of the errors of the first and second kinds.
Similar content being viewed by others
References
Agresti, A. (2002). Categorical data analysis (2nd ed.). New York: Wiley.
Aitchison, J., & Silvey, S. D. (1958). Maximum likelihood estimation of parameters subject to restraints. The Annals of Mathematical Statistics, 29, 813–828.
Andersen, E. B. (1970). Asymptotic properties of conditional maximum likelihood estimators. Journal of the Royal Statistical Society, Series B, 32, 283–301.
Andersen, E. B. (1973). A goodness of fit test for the Rasch model. Psychometrika, 38, 123–140.
Andersen, E. B. (1977). Sufficient statistics and latent trait models. Psychometrika, 42, 69–81.
Andersen, E. B. (1980). Discrete statistical models with social science applications. Amsterdam: North-Holland.
Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, 561–573.
Bahadur, R. R. (1954). Sufficiency and statistical decision functions. Annals of the Institute of Statistical Mathematics, 25, 423–462.
Barndorff-Nielsen, O. (1978). Information and exponential families in statistical theory. New York: Wiley.
Cohen, J. (1988). Statistical power analyses for the behavioral sciences. New York: Erlbaum.
Davidson, R. R., & Lever, E. L. (1967). The limiting distribution of the likelihood ratio statistic under a class of local alternatives. Florida State University Statistics Report M126, Tallahassee.
Diamond, E. L. (1963). The limiting power of categorical data chi-square tests analogous to normal analysis of variance. Annals of Mathematical Statistics, 34, 1432–1441.
Draxler, C. (2010). Sample size determination for Rasch model tests. Psychometrika, 75, 708–724.
Dynkin, E. B. (1951). Necessary and sufficient statistics for a family of probability distributions. Uspekhi Matematicheskikh Nauk, 6, 68–90.
Feder, P. I. (1968). On the distribution of the log likelihood ratio test statistic when the true parameter is near the boundaries of the hypothesis regions. The Annals of Mathematical Statistics, 39, 2044–2055.
Fischer, G. H. (1981). On the existence and uniqueness of maximum-likelihood estimates in the Rasch model. Psychometrika, 46, 59–77.
Fischer, G. H., & Molenaar, I. W. (1995). Rasch models-foundations, recent developments and applications. New York: Springer.
Fleiss, J. L. (1981). Statistical methods for rates and proportions (2nd ed.). New York: Wiley.
Gaffke, N., Steyer, R., & von Davier, A. A. (1999). On the asymptotic null-distribution of the Wald statistic at singular parameter points. Statistics & Decisions, 17, 339–358.
Glas, C. A. W. (1988). The derivation of some tests for the Rasch model from the multinomial distribution. Psychometrika, 53, 525–546.
Glas, C. A. W., & Verhelst, N. D. (1989). Extensions of the partial credit model. Psychometrika, 54, 635–659.
Glas, C. A. W., & Verhelst, N. D. (1995a). Testing the Rasch model. In G. H. Fischer & I. W. Molenaar (Eds.), Rasch models-foundations, recent developments and applications (pp. 69–95). New York: Springer.
Glas, C. A. W., & Verhelst, N. D. (1995b). Tests of fit for polytomous Rasch models. In G. H. Fischer & I. W. Molenaar (Eds.), Rasch models- foundations, recent developments and applications (pp. 325–352). New York: Springer.
Glas, C. A. W. (2006). Testing generalized Rasch models. In M. von Davier & C. H. Carstensen (Eds.), Multivariate and mixture distribution rasch models: Extensions and applications (pp. 37–56). New York: Springer.
Haberman, S. J. (1974). The analysis of frequency data. Chicago: University of Chicago Press.
Haberman, S. J. (1981). Tests for independence in two-way contingency tables based on canonical correlation and on linear-by-linear interaction. The Annals of Statistics, 9, 1178–1186.
Kelderman, H. (1984). Log linear Rasch model tests. Psychometrika, 49, 223–245.
Kelderman, H. (1989). Item bias detection using log linear IRT. Psychometrika, 54, 681–697.
Martin- Löf, P. (1973). Statistiska Modeller. (Statistical models. Notes from seminars 1969–1970 by Rolf Sundberg.) Stockholm.
Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149–174.
Maydeu-Olivares, A., & Montano, R. (2013). How should we assess the fit of Rasch-type models? Approximating the power of goodness-of-fit statistics in categorical data analysis. Psychometrika, 78, 116–133.
Mitra, S. K. (1958). On the limiting power function of the frequency chi-square test. Annals of Statistics, 29, 1221–1233.
Müller, H. (1987). A Rasch model for continuous ratings. Psychometrika, 52, 165–181.
Neyman, J., & Pearson, E. S. (1928). On the use and interpretation of certain test criteria for purposes of statistical inference. Biometrika, 20 A, 263–294.
Neyman, J., & Scott, E. L. (1948). Consistent estimates based on partially consistent observations. Econometrica, 16, 1–32.
Pearson, K. (1900). On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philosophical Magazine Series, 5(50), 157–175.
Pfanzagl, J. (1993). On the consistency of conditional maximum likelihood estimators. Annals of the Institute of Statistical Mathematics, 45, 703–719.
Rao, C. R. (1948). Large sample tests of statistical hypotheses concerning several parameters with applications to problems of estimation. Proceedings of the Cambridge Philosophical Society, 44, 50–57.
Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: The Danish Institute of Education Research (Expanded Edition, 1980. Chicago: University of Chicago Press).
Rasch, G. (1961). On general laws and the meaning of measurement in psychology. Berkeley: University of California Press.
Satorra, A., & Saris, W. E. (1985). The power of the likelihood ratio test in covariance structure analysis. Psychometrika, 50, 83–90.
Silvey, S. D. (1959). The Lagrangian multiplier test. The Annals of Mathematical Statistics, 30, 389–407.
Stroud, T. W. F. (1972). Fixed alternatives and Wald’s formulation of the noncentral asymptotic behavior of the likelihood ratio statistic. The Annals of Mathematical Statistics, 43, 447–454.
van den Wollenberg, A. (1982). Two new test statistics for the Rasch model. Psychometrika, 47, 123–140.
Verhelst, N. D., & Glas, A. W. (1995). The one parameter logistic model. In G. H. Fischer & I. W. Molenaar (Eds.), Rasch models- foundations, recent developments and applications (pp. 215–237). New York: Springer.
Verhelst, N. D., Glas, C. A. W., & Verstralen, H. H. F. M. (1994). OPLM: Computer program and manual. Arnhem: CITO.
von Davier, A. A. (2003). Large sample tests for comparing regression coefficients in models with normally distributed variables. Research Report RR-03-29. Princeton, NJ: Educational Testing Service.
Wald, A. (1943). Test of statistical hypotheses concerning several parameters when the number of observations is large. Transactions of the American Mathematical Society, 54, 426–482.
Wilks, S. S. (1938). The large sample distribution of the likelihood ratio for testing composite hypotheses. The Annals of Mathematical Statistics, 9, 60–62.
Wilson, M., & Masters, G. N. (1993). The partial credit model and null categories. Psychometrika, 58, 87–99.
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
This appendix provides a discussion of the fundamentals of the asymptotic properties of the CML estimator with reference to the concept of a sequence of local alternative hypotheses and a fixed alternative. It refers to the result that a joint limiting distribution for Wald, score, and likelihood ratio test statistics does not exist under a fixed alterative whereas it does exist under a sequence of local alternative hypotheses.
Let \(\varvec{\beta }_{0} \) denote a vector satisfying (4) and assume that \(\varvec{\beta }=\varvec{\beta }_{0} \) holds. Then, according to Andersen’s (Andersen, 1970) results,
for \(n\rightarrow \infty \), the Fisher information matrix \(\varvec{{I}}\left( {\varvec{\beta }_{0}} \right) \) being evaluated using \(\varvec{\beta }_{0} \). The covariance \(n\varvec{{I}}\left( {\varvec{\beta }_{0}} \right) ^{-1}\) of \(\sqrt{n} \left( {{\hat{\varvec{{\beta }}}-\varvec{\beta } }_{0}} \right) \) remains bounded as \(n\rightarrow \infty \) since the Fisher information is additive and may be rewritten as
i.e., the information contained in the conditional distribution of \(\varvec{Y}_{1} \left| {\varvec{T}_{1} =\varvec{t}_{1}} \right. ,\ldots ,\varvec{Y}_{n} \left| {\varvec{T}_{n} =\varvec{t}_{n}} \right. \) is obtained by summation of the information contained in the conditional distribution of each \(\varvec{Y}\left| {\varvec{T}=\varvec{t}} \right. \).
Let \(\varvec{\beta }_{a} =\varvec{\beta }_{0} +\varvec{\delta }, \varvec{\delta }\ne \varvec{0}\), be a vector not satisfying (4), i.e., a fixed alternative, a fixed deviation independent of n. If \(\varvec{\beta }=\varvec{\beta }_{a} \) holds, a limiting distribution of \(\sqrt{n} \left( {\hat{\varvec{{\beta }}}-\varvec{\beta }_{0}} \right) \) will not exist since, according to Andersen’s (1970) and Pfanzagl’s (1993) consistency theorems, \(\hat{\varvec{{\beta }}}\) will converge to \(\varvec{\beta }_{a} \) so that
will grow unboundedly with \(n\rightarrow \infty \), i.e., the mean vector amounts to \(\sqrt{n} \varvec{\delta }\). Hence, a limiting distribution for the three test statistics Wald, score, and likelihood ratio will also not exist if the fixed alternative \(\varvec{\beta }=\varvec{\beta }_{a} \) holds.
Let the assumption of a sequence of local alternative hypotheses \(\varvec{\beta }_{an} =\varvec{\beta }_{0} +\varvec{\delta }n^{-0.5}\) be introduced so that \(\varvec{T\beta }_{an} \rightarrow \varvec{c}\) as \(n\rightarrow \infty \). Then, one obtains
as \(n\rightarrow \infty \), since both \(\hat{\varvec{{\beta }}}\rightarrow \varvec{\beta }_{0} \) and \(\varvec{\beta }_{an} \rightarrow \varvec{\beta }_{0} \). With this and given technical regularity conditions, the results regarding the joint limiting distribution of Wald, score, and likelihood ratio test statistics derived in the papers cited in 3.2 apply. The non-central \(\chi ^{2}\) distribution obviously follows from the multivariate normal distribution of \(\sqrt{n} \left( {\hat{\varvec{{\beta }}}-\varvec{\beta }_{0}} \right) \) with mean vector \(\varvec{\delta }\ne \varvec{0}\). It might also be interesting to note that the statistic divided by n has a positive limit which may provide a measure of model error comparable to the well-known effect measures suggested by Cohen (1988).
In 4.1 it is assumed that the sequence of nuisance parameters \(\varvec{\theta }_{1} ,\ldots ,\varvec{\theta }_{n} \) is independently and identically distributed with joint probability density function \(\varphi \left( {\varvec{\theta }} \right) \) so that the common probability distribution of the sequence \(\varvec{T}_{1} ,\ldots ,\varvec{T}_{n} \) is given by (9). Under this assumption it holds for the asymptotic covariance matrix of \(\sqrt{n} \left( {\hat{\varvec{{\beta }}}-\varvec{\beta }_{0}} \right) \) that, according to (10),
with \({\varvec{\varGamma }}\left( {\varvec{\beta }_{0}} \right) \) given by the second factor (the integral) of (10) times \(-1\) evaluated at \(\varvec{\beta }_{0} \). Hence, for \(n\rightarrow \infty \), if \(\varvec{\beta }=\varvec{\beta }_{0} \) holds
and under the sequence of local alternative hypotheses
Rights and permissions
About this article
Cite this article
Draxler, C., Alexandrowicz, R.W. Sample Size Determination Within the Scope of Conditional Maximum Likelihood Estimation with Special Focus on Testing the Rasch Model. Psychometrika 80, 897–919 (2015). https://doi.org/10.1007/s11336-015-9472-y
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11336-015-9472-y