The solution of the classification problem in e-learning based on the method parallel construction of decision trees

Main Article Content

Olena Tolstoluzka
Bogdan Parshencev

Abstract

Topicality. Recently, more and more attention has been paid to the issues of machine learning in developed countries. On the one hand, this is due to the rapid growth of requirements for future specialists, and on the other - with the very rapid development of information technology and Internet communications. One of the main tasks of e-learning is the task of classification. The mathematical modeling system of decision trees is well adapted for the solution of the classification problem. However, as the number of input data increases, the issue of reducing the time of tree construction is becoming relevant. Using parallel computing systems and parallel programming technologies can produce positive results, but requires the development of new methods for constructing tree solutions. Results. The article reveals the main stages of the parallel tree construction method for solving the classification problem in e-learning. Unlike existing ones, the method allows to take into account the features of architecture and the organization of parallel processes in computing systems with shared and distributed memory. The method takes into account the possibility of evaluating performance indicators for constructing decision trees and parallel algorithms. Obtaining performance indicators for each iteration of the method helps to select the rational number of parallel processors in the computing system. This allows you to further reduce the time of building tree solutions. The simulation with the use of MPI parallel programming technology, the Python programming language for the architecture of the DM-MIMD system, confirms the reliability of the results. Here is an example of the organization of input data. Presented by Python is a program for building a decision tree. Conclusion. The developed visualization of the obtained estimates of performance indicators allows the user to select the necessary configuration of the computing system.

Article Details

How to Cite
Tolstoluzka, O., & Parshencev, B. (2018). The solution of the classification problem in e-learning based on the method parallel construction of decision trees. Advanced Information Systems, 2(2), 5–9. https://doi.org/10.20998/2522-9052.2018.2.01
Section
Identification problems in information systems
Author Biographies

Olena Tolstoluzka, V. N. Karazin Kharkiv National University, Kharkiv

Doctor of Technical Sciences, Senior Research Fellow, Professor of the Theoretical and Applied Systems Engineering Department

Bogdan Parshencev, V. N. Karazin Kharkiv National University, Kharkiv

PhD student of the Theoretical and Applied Systems Engineering Department

References

Sergeev, A., Zhigalov, I., and Balandina, V. (2012), Introduction to e-learning, VlSU, Vladimir, 182 p.

Shmatkov, S.I., Kuchuk, N.G. and Donets, V.V. (2018), “The model of information structure of the hyperconvergent system of support of electronic computing resources of university e-learning”, Control, navigation and communication systems, PNTU, Poltava, No. 2 (48), pp. 97-100.

Kuchuk, G., Kharchenko, V., Kovalenko, A. and Ruchkov, E. (2016), “Approaches to selection of combinatorial algorithm for optimization in network traffic control of safety-critical systems”, East-West Design & Test Symposium (EWDTS), pp. 1-6, available at: https://doi.org/10.1109/EWDTS.2016.7807655.

Voevodin, V.V. (2002), Parallel computing, BHV-Petersburg, St. Petersburg, 608 p.

Breiman, L., Friedman, J.H., Olshen, R.A. and Stone, C.T. (1984), Classification and Regression Trees, Wadsworth, Belmont, California.

Gehrke, Johannes, Ganti, Venkatesh, Ramakrishnan, Raghu and Loh, Wei-Yin (1999), BOAT – optimistic decision tree con-struction, ACM SIGMOD International Conference on Management of Data, June 1999, pp. 169-180.

Polyakov, G.A., Shmatkov, S.I., Tolstoluzhskaya, E.G. and Tolstoluzhsky D.A. (2012), Synthesis and Analysis of Parallel Processes in Adaptive Time-Parameterized Computer Systems, KhNU, Kharkiv, pp. 434-575.