Methodology of the countries’ economic development data analysis
DOI:
https://doi.org/10.20535/SRIT.2308-8893.2023.4.02Keywords:
machine learning, digital development, fuzzy clustering, radial basis neural networks, logistic regression, analysis of variables informativenessAbstract
The paper examines the issue of improving the methods of identification of economic objects and their analysis using algorithms of intelligent data processing. The use of the developed methodology in the economic analysis allows for improvement in the quality of management. It can be the basis for creating decision support systems to prevent potentially dangerous changes in the economic status of the research object. In this work, an improved method of c-means data clustering with agent-oriented modification is proposed, and a radial-basis neural network and its extension are proposed to determine whether the obtained clusters are relevant and to analyze the informativeness of state variables and obtain a subset of informative variables. The effect of applying data compression using an autoencoder on the accuracy of the methods is also considered. According to the results of testing of the developed methodology, it was proved that the probability of incorrect determination of the state was reduced when identifying the states of economic systems, and a reduced value of the error of the third kind was obtained when classifying the states of objects.
References
Mei Yang, Ming K. Lim, Yingchi Qu, Du Ni, and Zhi Xiao, “Supply chain risk management with machine learning technology: A literature review and future research directions,” Computers & Industrial Engineering, vol. 175, January 2023, 108859. Available: https://doi.org/10.1016/j.cie.2022.108859
Benjamin Decardi-Nelson and Jinfeng Liu, “Robust Economic Model Predictive Control with Zone Control,” IFAC-PapersOnLine, vol. 54, issue 3, pp. 237–242, 2021. Available: https://doi.org/10.1016/j.ifacol.2021.08.248
M. Schlesinger and V. Hlavac, Ten lectures on statistical and structural pattern recognition. Springer, Dordrecht, 2002. doi: 10.1007/978-94-017-3217-8.
Data clustering: algorithms and applications, Charu C. Aggarwal and Chandan, K. Reddy (ed.). CRC Press, Taylor & Francis Group, 2014.
N. Bakumenko, V. Strilets, and M. Ugryumov, “Application of the C-Means Fuzzy Clustering Method for the Patient’s State Recognition Problems in the Medicine Monitoring Systems,” CEUR Workshop Proceedings of 3rd International Conference on Computational Linguistics and Intelligent Systems, COLINS 2019, vol. I, pp. 218–227, 2019, Available: https://www.researchgate.net/publication/338819685
R. Winkler, F. Klawonn, and R. Kruse, “Problems of Fuzzy c-Means Clustering and Similar Algorithms with High Dimensional Data Sets,” Challenges at the Interface of Data Analysis, Computer Science and Optimization, pp. 79–87, 2012. doi: 10.1007/978-3-642-24466-7_9.
Christopher D. Prabhakar Raghavan and Hinrich Schütze, Introduction to information retrieval. Cambridge University Press, 2008.
S. Askari, “Fuzzy C-Means clustering algorithm for data with unequal cluster sizes and contaminated with noise and outliers: Review and development,” Expert Systems with Applications, vol. 165, article no. 113856, 2020. doi: 10.1016/j.eswa.2020.113856.
Xuemei Zhao, Yu Li, and Quanhua Zhao, “Mahalanobis distance based on fuzzy clustering algorithm for image segmentation,” Digital Signal Processing, vol. 43, pp. 8–16, Aug 2015. Available: https://doi.org/10.1016/j.dsp.2015.04.009
Zarinbala M. Zarandia, M.H. Fazel, and I.B. Turksen, “Relative entropy fuzzy c means clustering,” Information Sciences, vol. 260, pp. 74–97, 2014. doi: 10.1016/j.ins.2013.11.004.
V. Strilets, V. Donets, M. Ugryumov, R. Zelenskyi, and T. Goncharova, “Agent-Oriented data clustering for medical monitoring,” Radioelectronic and Computer Systems, no. 1, pp. 103–114, 2022. Available: https://doi.org/10.32620/reks.2022.1.08
Meng Xing, Yanbo Zhang, Hongmei Yu, Zhenhuan Yang, and Xueling Li, “Predict DLBCL patients’ recurrence within two years with Gaussian mixture model cluster oversampling and multi-kernel learning,” Computer Methods Programs in Biomedicine, vol. 226, 107103, 2022. Available: https://doi.org/10.1016/j.cmpb.2022.107103
Lynne A. Kvapil, Mark W. Kimpel, Rasitha R. Jayasekare, and Kim Shelton, “Using Gaussian mixture model clustering to explore morphology and standardized production of ceramic vessels: A case study of pottery from Late Bronze Age Greece,” Journal of Archaeological Science: Reports, vol. 45, 103543, 2022. Available: https://doi.org/10.1016/j.jasrep.2022.103543
Meng Yinfeng, Jiye Liang, Fuyuan Cao and Yijun He, “A new distance with derivative information for functional k-means clustering algorithm,” Information Sciences, vol. 463–464, pp. 166–185, 2018. Available: https://doi.org/10.1016/j.ins.2018.06.035
Xinmin Tao, Ruotong Wang, Rui Chang, and Chenxi Li, “Density-sensitive fuzzy kernel maximum entropy clustering algorithm,” Knowledge-Based Systems, vol. 166, pp. 42–57, 2019. Available: https://doi.org/10.1016/j.knosys.2018.12.007.
K. Møllersen, S. Dhar and F. Godtliebsen, “On Data-Independent Properties for Density-Based Dissimilarity Measures in Hybrid Clustering,” Applied Mathematics, vol. 7, no. 15, pp. 1674–1706, 2016. doi: 10.4236/am.2016.715143.
Ian Goodfellow, Yoshua Bengio, and Aaron Courville, Softmax Units for Multinoulli Output Distributions. Deep Learning. MIT Press, 2016.
V.E. Strilets et al., Methods of machine learning in the problems of system analysis and decision making: monograph. Karazin Kharkiv National University, 2020, 195 p.
Farbod Farhangi, “Investigating the role of data preprocessing, hyperparameters tuning, and type of machine learning algorithm in the improvement of drowsy EEG signal modeling,” Intelligent Systems with Applications, vol. 15, 200100, September 2022. Available: https://doi.org/10.1016/j.iswa.2022.200100
Arthur Zimek and Peter Filzmoser, “There and back again: Outlier detection between statistical reasoning and data mining algorithms,” Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 8(6), 2018. doi: 10.1002/widm.1280.
Fei Tony Liu, Kai Ming Ting, and Zhi-Hua Zhou, “Isolation-Based Anomaly Detection,” ACM Transactions on Knowledge Discovery from Data, 6(1), pp. 1–39, 2012. doi:10.1145/2133360.2133363.
O.Yu. Lykhach, M.L. Ugryumov, D.O. Shevchenko, and S.I. Shmatkov, “Methods of detecting emissions in test samples during process control in state-based systems,” Bulletin of Karazin Kharkiv National University, ser. “Mathematical modeling. Information Technology. Automated control systems”, no. 53. pp. 21–40, 2022.
L.J.P van der Maaten and G.E. Hinton, “Visualizing Data Using t-SNE,” Journal of Machine Learning Research, 9, pp. 2579–2605, 2008.
Ian T. Jolliffe and Jorge Cadima, “Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A,” Mathematical, Physical and Engineering Sciences, 374(2065), 20150202, 2016. doi: 10.1098/rsta.2015.0202.
L. Chagovets, N. Chernova, T. Klebanova, O. Dorokhov, and A. Didenko, “Selective Adaptive Model for Forecasting of Regional Development Unevenness Indexes,” Proceedings of the Workshop on the XII International Scientific Practical Conference Modern problems of social and economic systems modelling (MPSESM-W 2020) Kharkiv, Ukraine, June 25, 2020, pp. 58–76.
L.O. Chagovets, S.V. Prokopovych, S.M. Vozniuk, and V.V. Chahovets, “Conceptual basis of modeling telecommunication development of regions by methods of system analysis,” Municipal economy of cities, vol. 1, no. 161, pp. 230–240, 2021.
Computer program “Nonlinear estimation methods in the multicriterion problems of system’s robust optimal designing and diagnosing under parametric apriority uncertainty (methodology, methods and computer decision support and making system)” (“ROD&IDS”): Copyright registration certificate no. 82875 / M.L. Ugryumov, Y.S. Meniaylov, S.V. Chernysh, K.M. Ugryumova (Ukraine). Copyright and related rights. Official bulletin. Ministry of Economic Development and Trade of Ukraine. 2018, no. 51, p. 403.