Components Identification in Finite Mixture Model
Abstract
Finite Mixture Model (FMM) are widely utilized in statistical analysis to model unobserved heterogeneity within complex datasets. A key challenge in their implementation is selecting the optimal number of components, as this directly affects the model’s interpretability, stability, and forecasting power. This study focuses on component selection strategies in FMMs, with particular emphasis on two widely used information criteria: the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC). While AIC favors model flexibility, BIC emphasizes parsimony and often performs better in finite samples. We apply the FMM framework to monthly data on China’s rice trade, investigating the relationship among exchange rates, import/export prices, and trade volumes. Empirical findings suggest that a two-component model offers the best fit according to the BIC, revealing distinct market regimes likely shaped by structural conditions and policy interventions.
Keywords:
Finite Mixture Model;components;Akaike Information Criterion;Bayesian InformationCopyright Notice & License:
All articles published in Global Review of Humanities, Arts, and Society (GRHAS) are licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
This license allows anyone to share, copy, distribute, transmit, and adapt the work, including for commercial purposes, provided that proper attribution is given to the original author(s) and the source.

This work is licensed under a Creative Commons Attribution 4.0 International License.
References
econometric models. Journal of Spatial Econometrics, 4(1), 2.
Akaike, H. (1974). A new look at the statistical model identification. IEEE transactions on automatic control, 19(6), 716-723.
Arevalo, F., Cid, A., & Moya, J. (2017). AIC and BIC for cosmological interacting scenarios. The European Physical Journal C, 77, 1-13.
Bainter, S. A. (2015). Advantages of a Bayesian approach for examining class structure in finite mixture models. Multivariate behavioral research, 50(1), 127-127.
Biernacki, C., Celeux, G., & Govaert, G. (2002). Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Transactions on pattern analysis and machine intelligence, 22(7), 719-725.
Brock, W. A. (2018). Nonlinearity and complex dynamics in economics and finance. In The economy as an evolving complex system (pp. 77-97). CRC Press.
Chakrabarti, A., & Ghosh, J. K. (2011). AIC, BIC and recent advances in model selection. Philosophy of statistics, 583-605.
Dew-Becker, I., Giglio, S., & Kelly, B. (2021). Hedging macroeconomic and financial uncertainty and volatility. Journal of Financial Economics, 142(1), 23-45.
Fei, X., Lou, Z., Lv, X., Ren, Z., & Xiao, R. (2023). Pollution threshold assessment and risk area delineation of heavy metals in soils through the finite mixture distribution model and Bayesian maximum entropy theory. Journal of Hazardous Materials, 452, 131231.
Feng, Z. D., & McCulloch, C. E. (1996). Using bootstrap likelihood ratios in finite mixture models. Journal of the Royal Statistical Society: Series B (Methodological), 58(3), 609-617.
Fraley, C., & Raftery, A. E. (2002). Model-based clustering, discriminant analysis, and density estimation. Journal of the american statistical association, 97(458), 611-631.
Frühwirth-Schnatter, S., Malsiner-Walli, G., & Grün, B. (2021). Generalized mixtures of finite mixtures and telescoping sampling. Bayesian Analysis, 16(4), 1279-1307.
Haji Aghabozorgi, H., & Eskandari, F. (2024). Clustering and estimation of finite mixture models under bivariate ranked set sampling with application to a breast cancer study. Statistical Papers, 65(2), 705-736.
Jenkins, S. P., & Rios-Avila, F. (2023). Finite mixture models for linked survey and administrative data: Estimation and postestimation. The Stata Journal, 23(1), 53-85.
Jochmans, K., Henry, M., & Salanié, B. (2017). Inference on two-component mixtures under tail restrictions. Econometric Theory, 33(3), 610-635.
Kuha, J. (2004). AIC and BIC: Comparisons of assumptions and performance. Sociological methods & research, 33(2), 188-229.
Liu, Q., Charleston, M. A., Richards, S. A., & Holland, B. R. (2023). Performance of akaike information criterion and Bayesian information criterion in selecting partition models and mixture models. Systematic Biology, 72(1), 92-105.
Manole, T., & Khalili, A. (2021). Estimating the number of components in finite mixture models via the Group-Sort-Fuse procedure. The annals of statistics, 49(6), 3043-3069.
McLachlan, G. J., & Basford, K. E. (1988). Mixture models: Inference and applications to clustering (Vol. 38). M. Dekker New York.
McLachlan, G. J., & Peel, D. (2000). Finite mixture models (Vol. 299). John Wiley & Sons.
Mortazavi, R., & Lundberg, M. (2020). Expenditure-based segmentation of tourists taking into account unobserved heterogeneity: The case of Venice. Tourism Economics, 26(3), 475-499.
Newcomb, S. (1886). A generalized theory of the combination of observations so as to obtain the best result. American journal of Mathematics, 343-366.
Pearson, K. (1894). Contributions to the mathematical theory of evolution. Philosophical Transactions of the Royal Society of London. A, 185, 71-110.
Sawadogo, R., Sawadogo, P. N., & Kinda, R. S. (2024). Financial Inclusion, Globalization and Structural Transformation in Developing Countries. Journal of Economic Integration, 39(2), 420-453. https://doi.org/10.11130/jei.2024013
Schlattmann, P. (2003). Estimating the number of components in a finite mixture model: the special case of homogeneity. Computational statistics & data analysis, 41(3-4), 441-451.
Schneider, M. P., & Scharfenaker, E. (2020). Mixing it up: The case for finite mixture models to study the distribution of income. The European Physical Journal Special Topics, 229(9), 1685-1704. https://doi.org/10.1140/epjst/e2020-900130-4
Schwarz, G. (1978). Estimating the dimension of a model. The annals of statistics, 461-464.
Smyth, P. (2000). Model selection for probabilistic clustering using cross-validated likelihood. Statistics and computing, 10(1), 63-72.
Suk, Y., Kim, J.-S., & Kang, H. (2021). Hybridizing machine learning methods and finite mixture models for estimating heterogeneous treatment effects in latent classes. Journal of Educational and Behavioral Statistics, 46(3), 323-347.
Vrieze, S. I. (2012). Model selection and psychological theory: a discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). Psychological methods, 17(2), 228.
Wang, C., & Yang, Y. (2024). Estimating the Number of Components in Finite Mixture Models via Variational Approximation. arXiv preprint arXiv:2404.16746.

