MACHINE LEARNING METHODS APPLICATION FOR SOLVING THE PROBLEM OF BIOLOGICAL DATA ANALYSIS

Main Article Content

Olena Akhiiezer
https://orcid.org/0000-0002-7087-9749
Olha Dunaievska
https://orcid.org/0000-0003-0286-5991
Iryna Serdiuk
https://orcid.org/0000-0002-4511-4189
Semen Spivak

Abstract

According to statistics, every fifth married couple is faced with the inability to conceive a child. Male germ cells are very vulnerable, and the growing number of cases of male infertility confirms that in today's world there are many factors that affect the activity of spermatozoa and their number. But the important thing is not so much their quantity, but quality. The spermogram is an objective method of laboratory diagnosis, which allows  to accurately assess the man’s ability to fertilize by analyzing ejaculate for a number of key parameters. Only a spermogram can answer the question of a possible male infertility and the presence of urological diseases. When constructing spermograms, it is important to determine not only the number of good spermatozoa, but also their morphology and mobility. Therefore, research and improvement of some stages of spermogramm is the purpose of the study. This article addresses the problem of classification of spermatozoa in good and bad ones, taking into account their mobility and morphology, using methods of machine learning. In order to implement the first stage of machine learning (with a teacher) in the graphic editor, educational specimens (training sample) were created. The training was implemented by three methods: the method of support vector machine, the logistic regression and the method of K - the nearest neighbors. As a result of testing, the method K - the nearest neighbors is chosen. At the testing stage, a sample of 15 different spermatozoa was used in different variations of rotation around their axis. The test sample did not contain specimens from the training sample and was formed taking into account the morphological characteristics of the spermatozoa, but did not copy them from the training sample. At the final stage of study, the program's functioning was tested on real data.

Article Details

How to Cite
Akhiiezer, O., Dunaievska, O., Serdiuk, I., & Spivak, S. (2018). MACHINE LEARNING METHODS APPLICATION FOR SOLVING THE PROBLEM OF BIOLOGICAL DATA ANALYSIS. Advanced Information Systems, 2(3), 5–9. https://doi.org/10.20998/2522-9052.2018.3.01
Section
Identification problems in information systems
Author Biographies

Olena Akhiiezer, National Technical University «Kharkiv Polytechnic Institute», Kharkiv

Candidate of Technical Sciences, Associate Professor, Professor of Computer Mathematics and Data Analysis Department

Olha Dunaievska, National Technical University «Kharkiv Polytechnic Institute», Kharkiv

Candidate of Technical Sciences, Associate Professor of Computer Mathematics and Data Analysis Department

Iryna Serdiuk, National Technical University «Kharkiv Polytechnic Institute», Kharkiv

Associate Professor of Computer Mathematics and Data Analysis Department

Semen Spivak, National Technical University «Kharkiv Polytechnic Institute», Kharkiv

Student of Computer Mathematics and Data Analysis Department

References

Danilov, V.V. and Lelchuk, S.A. (2013), “Quantitative evaluation of spermograms in men based on interval scale”, Andrology and Genital Surgery, No.1, pp. 44–48.

Kruger, T.F., Acosta, A.A. and Simmons, K.F. (1987), “New method of evaluating sperm morphology with predictive value for human in vitro fertilization”, Urology, 30, pp. 248–251.

Kruger, T.F., Menkveld R and Stanger F.S.H. (1986), “Sperm morphologic features as a prognostic factor in in vitro fertilization”, Fertil Steril, 46, pp. 1118–1123.

World Health Organization, Department of Reproductive Health and Research (2010), WHO laboratory manual for the examination and processing of human semen, 5th edition, 234 p.

World Health Organization (2001), WHO guidelines on laboratory research of human ejaculate and interaction of spermatozoa with cervical mucus, 4th edition, Publishing house "MedPress", Moscow, 144 p.

Theodoridis, Sergios and Koutroumbas, Konstantinos (2009), Pattern Recognition, 4th Edition, Academic Press, 984 p.

Poreva A.S. and Karpliuk, Е.S. (2017), “Methods of machine learning for the study of lung sounds”, Biomedical instruments and systems, No. 22 (6), pp. 41–46.

Talarczyk-Desole, J., Berger, A., Taszarek-Hauke, G., Hauke, J., Pawelczyk, L. and Jedrzejczak P. (2017), “Manual vs. computer-assisted sperm analysis: can CASA replace manual assessment of human semen in clinical practice?”, Ginekologia Polska, No. 88 (2), pp. 56–60.

(2014), Systems M-AD. SCA - Sperm Class Analyzer v. 4.1, polish version, Microptic - Automatic Diagnostic Systems.