Machine learning based predictors for COVID-19 disease severity

Sci Rep. 2021 Feb 25;11(1):4673. doi: 10.1038/s41598-021-83967-7.

Abstract

Predictors of the need for intensive care and mechanical ventilation can help healthcare systems in planning for surge capacity for COVID-19. We used socio-demographic data, clinical data, and blood panel profile data at the time of initial presentation to develop machine learning algorithms for predicting the need for intensive care and mechanical ventilation. Among the algorithms considered, the Random Forest classifier performed the best with [Formula: see text] for predicting ICU need and [Formula: see text] for predicting the need for mechanical ventilation. We also determined the most influential features in making this prediction, and concluded that all three categories of data are important. We determined the relative importance of blood panel profile data and noted that the AUC dropped by 0.12 units when this data was not included, thus indicating that it provided valuable information in predicting disease severity. Finally, we generated RF predictors with a reduced set of five features that retained the performance of the predictors trained on all features. These predictors, which rely only on quantitative data, are less prone to errors and subjectivity.

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • COVID-19 / blood
  • COVID-19 / diagnosis*
  • COVID-19 / epidemiology
  • Cohort Studies
  • Female
  • Humans
  • Intensive Care Units
  • Machine Learning*
  • Male
  • Middle Aged
  • Prognosis
  • Risk Factors
  • SARS-CoV-2 / isolation & purification
  • Severity of Illness Index
  • Young Adult