- Research
- Open access
- Published:
The prediction of semen quality based on lifestyle behaviours by the machine learning based models
Reproductive Biology and Endocrinology volume 22, Article number: 112 (2024)
Abstract
Purpose
To find the machine learning (ML) method that has the highest accuracy in predicting the semen quality of men based on basic questionnaire data about lifestyle behavior.
Methods
The medical records of men whose semen was analyzed for any reason were collected. Those who had data about their lifestyle behaviors were included in the study. All semen analyses of the men included were evaluated according to the WHO 2021 guideline. All semen analyses were categorized as normozoospermia, oligozoospermia, teratozoospermia, and asthenozoospermia. The Extra Trees Classifier, Average (AVG) Blender, Light Gradient Boosting Machine (LGBM) Classifier, eXtreme Gradient Boosting (XGB) Classifier, Logistic Regression, and Random Forest Classifier techniques were used as ML algorithms.
Results
Seven hundred thirty-four men who met the inclusion criteria and had data about lifestyle behavior were included in the study. 356 men (48.5%) had abnormal semen results, 204 (27.7%) showed the presence of oligozoospermia, 193 (26.2%) asthenozoospermia, and 265 (36.1%) teratozoospermia according to the WHO 2021. The AVG Blender model had the highest accuracy and AUC for predicting normozoospermia and teratozoospermia. The Extra Trees Classifier and Random Forest Classifier models achieved the best performance for predicting oligozoospermia and asthenozoospermia, respectively.
Conclusion
The ML models have the potential to predict semen quality based on lifestyles.
Introduction
Semen analysis is used to detect the fertility capacity of men in andrological practice and it is the first recommended test for an infertility diagnostic work-up [1]. It is recommended that the semen sample is given close to the laboratory [1] but giving a sample at the hospital could sometimes cause embarrassment and some men may hesitate to provide a sample to learn their fertility capacity. As a result, home semen analysis kits and smartphone-based semen analyzers have started to be used by men [2, 3] to minimize the embarrassment, but these methods have not been used worldwide and their accuracy and cost-effectiveness are have not been fully discussed [2, 3].
According to our knowledge, there have not been any nomograms or predictors using conventional statistical methods to detect semen quality, but it is well known that lifestyle behaviors could affect semen quality [4]. With the increasing use of artificial intelligence in medicine, a limited number of studies have recently been published that aimed to predict semen quality based on lifestyle behaviors using machine learning algorithms [5,6,7,8,9,10,11,12,13,14,15,16].
In this study, we aimed to find the machine learning (ML) method that has the highest accuracy in predicting the semen quality of men based on basic questionnaire data about lifestyle behavior.
Material-methods
This retrospective-designed study was conducted after ethical approval was obtained (Eskisehir City Hospital, Non-Interventional Clinical Research Ethics Committee; Date: 16/02/2024; Decision Number: ESH/GOEK 2024/77). The medical records of the men whose semen had been analyzed for any reason between August 2021 and January 2023 at the Eskisehir City Hospital were collected. The exclusion criteria were: Aged < 18 or ˃50 years, diagnosis of azoospermia, low semen volume (less than 1.5 mL), abnormal genetics, history of any type of testicular or genitourinary tract or pelvic surgery, recurrent or subclinical varicocele, cryptorchidism, small-sized testis (normal testicular volume is 12.5–19 cc), treated cancer, vascular problem, hematologic illness, systemic disease, genitourinary system infection, or hormonal problems. After the application of the exclusion criteria, the men who also had data about their lifestyle behavior on file were included in the study. This data included details of their Body Mass Index (BMI), smoking and alcohol consumption, coffee intake, physical activity, sauna usage, cell phone usage, and the wearing of tight-fitting underwear as described previously [17, 18]. To ensure strict selection, ex-alcoholics and ex-smokers, passive smokers, and those who only participate in the other lifestyle factors irregularly were also excluded from the study. The lifestyle factors were coded ‘1’ if the BMI was ≥ 25, he smoked every day, drank any amount of alcohol, drank more than 3 cups of coffee a day, did not do any type of exercise regularly, regularly wore Tight underwear, went to a sauna-Turkish Bath regularly, or had a mobile phone ≥ 10 years during the 3-month window before semen collection. If a man who had a BMI of < 25, did not smoke, did not drink alcohol, did not drink more than 3 cups of coffee a day, exercised regularly, did not wear tight underwear, did not go to a sauna, or had used a cell phone < 10 years, the lifestyle factors were coded ‘0’. After collecting the data about the lifestyle behaviors, all semen analyses of the men included in the study were evaluated according to the WHO 2021 guideline. All semen analyses were categorized as normozoospermia (normal semen characteristic value) or abnormal. If oligozoospermia (sperm concentration < 16 × 106/ml of semen) and/or asthenozoospermia (motility < 30% spermatozoa with progressive motility), and/or teratozoospermia (morphologically normal spermatozoa < 4%) [1] had been detected in a semen sample, these results were categorized abnormal. All results were then grouped as normozoospermia, oligozoospermia (sperm concentration < 16 × 106/ml of semen), asthenozoospermia (motility < 30% spermatozoa with progressive motility), or teratozoospermia (morphologically normal spermatozoa < 4%). The 4 groups were analyzed separately by statistical methods and the ML algorithms were applied to each group.
The Shapiro–Wilk test was used to test the normality of data distribution. Continuous variables were expressed as mean ± standard deviation, median (minimum–maximum), and categorical variables were expressed as counts (percentages). Comparisons of normally distributed continuous variables between the materials/groups were performed using the student’s t-test. Comparisons of non-normally distributed continuous variables between the groups were performed using the Mann–Whitney U Test. Comparisons of categorical variables between the groups were performed using the Yates Chi-Square test and the Monte Carlo Chi-Square test. A two-sided P value < 0.05 was considered statistically significant.
The study was designed according to the principles of ML. The Extra Trees Classifier, Average (AVG) Blender, Light Gradient Boosting Machine (LGBM) Classifier, eXtreme Gradient Boosting (XGB) Classifier, Logistic Regression, and Random Forest Classifier techniques were used as ML algorithms. 70% of the data was used for training and the remaining 30% for testing. In the tests conducted with these models, the model success rates were determined based on accuracy, sensitivity, and specificity values with confusion matrix metrics and the area under curve (AUC) graph in the receiver operating characteristic (ROC) curve analysis. A confusion matrix, which contains information on actual and predicted classifications performed by a classification system and the performance of such systems, is generally assessed using the data in the matrix. Independent variables that significantly affect each group’s dependent variable were selected by the permutation feature importance method, which is based on a decrease in the model score when a single variable value is randomly shuffled (1).
Results
Seven hundred thirty-four men who met the inclusion criteria and had data about lifestyle behavior were included in the study. As seen in Table 1, 356 men (48.5%) had abnormal semen results, 204 (27.7%) showed the presence of oligozoospermia, 193 (26.2%) asthenozoospermia, and 265 (36.1%) teratozoospermia according to the WHO 2021. Smoking, regularly wearing tight underwear, and regularly going to a sauna / Turkish Bath were statistically significant between having normal and abnormal semen results (p = 0.001, 0.003 and 0.038, respectively). While asthenozoospermic males had a statistically significance difference in the age parameter (p = 0.013), teratozoospermic males had a statistically significance difference in age, smoking, and alcohol use parameters (p = 0.025, 0.001, and 0.034, respectively) (Table 2).
Among the six models, the AVG Blender model had the highest accuracy (61.2%) and AUC (58.4%) for predicting normozoospermia. The Extra Trees Classifier, Random Forest Classifier, and AVG Blender model achieved the best performance for predicting oligozoospermia, asthenozoospermia, and teratozoospermia with an accuracy of 75.5%, 69.6%, and 64.4%, respectively with an AUC of 80%, 74%, and 69.2%, respectively (Table 3 and Fig. 1). Age and smoking were the most significant featured factors for all-predictive models. Table 4 shows the confusion matrices of the algorithms, detailing the number of true positive, false positive, true negative, and false negative cases in the predicted results.
Discussion
Infertility is a major cause of stress for couples because the diagnosis and treatment are often thought to be very complex and the failure to find the cause of infertility makes the situation even worse. For this reason, healthcare professionals have started to use clinical tools to make accurate decisions for diagnosis and/or treatment to minimize the uncertainty for couples. With the increasing use of artificial intelligence in medicine, machine or deep learning-based tools have become increasingly used in reproductive medicine.
A semen analysis is the first laboratory method used for assessing male reproductive health. However, many men feel embarrassed about giving a semen sample, even though it is the most important tool. Attempts have been made to use artificial intelligence-based clinical decision-making tools to at least have an idea of semen quality instead of going to an andrology clinic. The ML-based algorithms used for this purpose have been used with questionnaire-based information about lifestyle behavior to predict semen results [5,6,7,8,9,10,11,12,13,14,15,16]. The authors of this study chose these factors to develop a prediction model because the separate and cumulative effects of lifestyle on semen quality have long been known [17,18,19]. GhoshRoy et al. reviewed the studies about lifestyle and environmental factor-based analyses for determining semen quality [20]. In this review, all the classifiers and performance parameters of these algorithms that have been used were summarized.
In our study, we collected the lifestyle behavior data from the questionnaire form that had been prepared previously [17]. All semen analyses were evaluated with the recently published WHO 2021 guideline. All the responses from the questionnaire forms and semen analyses were gathered as a dataset. After balancing and validating this dataset, we tried to find the ML model that could predict male fertility capacity with the highest accuracy and AUC. We found that AVG Blender had the highest performance in predicting normozoospermia and the LGBM Classifier could be used for predicting the oligozoospermic, teratozoospermic, or asthenozoospermic semen analysis results.
There are limited studies on the prediction of semen parameters based on modifiable lifestyle factors by AI methods. However, various non-validated questionnaire forms were used to collect the data. When these forms were analyzed, it could be seen that different parameters were identified as risky and/or modifiable lifestyle behavior without sufficient evidence. The second major limitation of these studies is that WHO guidelines for semen analysis have been updated. The reference limits of semen parameters were revised in the 6th version of the WHO guidelines, therefore the results of these studies have become invalid and the algorithms recommended by these previous analyses should be reconducted. As with the previous reports in the literature, our outputs will become invalid and the algorithms will need to be re-run if the reference limits of semen parameters are updated by later versions of the WHO guidelines. Another limitation of this current study is that our data was obtained from a single center, and we did not validate the ML models with data from external infertility clinics. Another limitation of the prediction of semen quality based on lifestyle behaviors is that various AI methods have been employed and the reports lack information about the development of the models, the various parameters that have been used to find the highest performance to predict the semen quality, and that the health professionals related to infertility may not have the knowledge to fully understand the process and the results.
Conclusion
The ML models used in this study have the potential to predict semen quality based on lifestyles. Studies with larger training datasets obtained from standardized and validated questionnaire forms about lifestyle behavior should be designed and the AI methods should be developed with a wide range of performance parameters. Furthermore, extensive information should be reported about the construction methods of the models to enable clinicians and couples to easily understand the results.
Availability of data and materials
No datasets were generated or analysed during the current study.
References
WHO laboratory manual for the examination and processing of human semen. 6th ed. Geneva: World Health Organization; 2021. https://iris.who.int/bitstream/handle/10665/343208/9789240030787-eng.pdf?sequence=1.
Park MJ, Lim MY, Park HJ, Park NC. Accuracy comparison study of new smartphone-based semen analyzer versus laboratory sperm quality analyzer. Investig Clin Urol. 2021;62:672–80. https://doi.org/10.4111/icu.20210266.
Lai JD, Fantus RJ, Meza JA, Hudnall MT, Pham M, Brannigan RE, et al. Cost-effectiveness of early screening home semen analysis in couples attempting to conceive. Urology. 2022;170:104–10. https://doi.org/10.1016/j.urology.2022.06.053.
Greeson KW, Crow KMS, Edenfield RC, Easley CA. Inheritance of paternal lifestyles and exposures through sperm DNA methylation. Nat Rev Urol. 2023;20:356–70. https://doi.org/10.1038/s41585-022-00708-9.
Badura A, Marzec-Wroblewska U, Kaminski P, Lakota P, Ludwikowski G, Szymanski M, et al. Prediction of semen quality using artificial neural network. J Appl Biomed. 2019;17:167–74. https://doi.org/10.32725/jab.2019.015.
Huang HH, Hsieh SJ, Chen MS, Jhou MJ, Liu TC, Shen HL, et al. Machine learning predictive models for evaluating risk factors affecting sperm count: predictions based on health screening indicators. J Clin Med. 2023;12:12. https://doi.org/10.3390/jcm12031220.
Girela JL, Gil D, Johnsson M, Gomez-Torres MJ, De Juan J. Semen parameters can be predicted from environmental factors and lifestyle using artificial intelligence methods. Biol Reprod. 2013;88:99. https://doi.org/10.1095/biolreprod.112.104653.
Zhou M, Yao T, Li J, Hui H, Fan W, Guan Y, et al. Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm. Front Med (Lausanne). 2022;9:811890. https://doi.org/10.3389/fmed.2022.811890.
GhoshRoy D, Alvi PA, Santosh KC. Unboxing industry-standard AI models for male fertility prediction with SHAP. Healthcare (Basel). 2023;11:11. https://doi.org/10.3390/healthcare11070929.
Gil D, Girela JL, De Juan J, Gomez-Torres MJ, Johnsson M. Predicting seminal quality with artificial intelligence methods. Expert Syst Appl. 2012;39:12564–73. https://doi.org/10.1016/j.eswa.2012.05.028.
Huang HH, Lu CJ, Jhou MJ, Liu TC, Yang CT, Hsieh SJ, et al. Using a decision tree algorithm predictive model for sperm count assessment and risk factors in health screening population. Risk Manag Healthc Policy. 2023;16:2469–78. https://doi.org/10.2147/RMHP.S433193.
Sahoo AJ, Kumar Y. Seminal quality prediction using data mining methods. Technol Health Care. 2014;22:531–45. https://doi.org/10.3233/THC-140816.
Simfukwe M, Kunda D, Chembe C. Comparing naive bayes method and artificial neural network for semen quality categorization. Int J Innov Sci Eng Technol. 2015;2:689–94.
Dash SR, Ray R. Predicting seminal quality and its dependence on life style factors through ensemble learning. Int J E-Health Med Commun (IJEHMC). 2020;11:78–95.
Yibre AM, Koçer B. Semen quality predictive model using feed forwarded neural network trained by learning-based artificial algae algorithm. Eng Sci Technol Int J. 2021;24:310–8.
GhoshRoy D, Alvi PA, Santosh K. Explainable AI to Predict Male Fertility Using Extreme Gradient Boosting Algorithm with SMOTE. Electronics. 2023;12:15. https://www.mdpi.com/about/announcements/784. https://www.mdpi.com/2079-9292/12/1/15.
Kaya C, Aykac A, Kaya Y, Tas M. The effect of modifiable lifestyle factors on semen quality. Rev Int Androl. 2020;18:151–8. https://doi.org/10.1016/j.androl.2019.09.001.
Wogatzky J, Wirleitner B, Stecher A, Vanderzwalmen P, Neyer A, Spitzer D, et al. The combination matters–distinct impact of lifestyle factors on sperm quality: a study on semen analysis of 1683 patients according to MSOME criteria. Reprod Biol Endocrinol. 2012;10:115. https://doi.org/10.1186/1477-7827-10-115.
Lyons HE, Gyawali P, Mathews N, Castleton P, Mutuku SM, McPherson NO. The influence of lifestyle and biological factors on semen variability. J Assist Reprod Genet. 2024. https://doi.org/10.1007/s10815-024-03030-y.
GhoshRoy D, Alvi PA, Santosh KC. AI tools for assessing human fertility using risk factors: a state-of-the-art review. J Med Syst. 2023;47:91. https://doi.org/10.1007/s10916-023-01983-8.
Funding
The authors did not receive support from any organization for the submitted work. The authors have no relevant financial or non-financial interests to disclose.
Author information
Authors and Affiliations
Contributions
AA and CK wrote the main manuscript text and MEA prepared figures 1. CK and ÖÇ designed of the work. CK and MEA interpretation of data. AA and MS have drafted the work or substantively revised it. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Aykaç, A., Kaya, C., Çelik, Ö. et al. The prediction of semen quality based on lifestyle behaviours by the machine learning based models. Reprod Biol Endocrinol 22, 112 (2024). https://doi.org/10.1186/s12958-024-01268-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12958-024-01268-w