Abstract
Free full text
AI and Machine Learning for Understanding Biological Processes
Artificial intelligence and machine learning to fight COVID-19
Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) (13), has become an unprecedented public health crisis. Coronavirus Resource Center at Johns Hopkins University of Medicine has reported a total of 23,638 deaths as worldwide COVID-19 infections surpass 500,000 (as of 5 PM EST on March 26, 2020). On March 16, 2020, the White House, collaborating with research institutes and tech companies, issued a call to action for global artificial intelligence (AI) researchers for developing novel text and data-mining techniques to assist COVID-19-related research. The Allen Institute for AI in partnership with leading research groups issued an open-source, weekly updated COVID-19 Open Research Dataset (2), which continuously documents COVID-19-related scholarly articles to accelerate novel research projects urgently requiring real-time data. The large-scale data of COVID-19 patients can be integrated and analyzed by advanced machine learning algorithms to better understand the pattern of viral spread, further improve diagnostic speed and accuracy, develop novel effective therapeutic approaches, and potentially identify the most susceptible people based on personalized genetic and physiological characteristics. Inspirationally, within a short period of time since COVID-19 outbreak, advanced machine learning techniques have been used in taxonomic classification of COVID-19 genomes (8), CRISPR-based COVID-19 detection assay (6), survival prediction of severe COVID-19 patients (11), and discovering potential drug candidates against COVID-19 (4).
Personalized protective strategies can greatly benefit from precise classifications of the population based on categorized COVID-19 susceptibility. The earlier observation that elderly people have a higher risk to COVID-19 is challenged by a recent finding that more and more young adults suffer from severe COVID-19 symptoms, indicating an urgent need of a comprehensive risk evaluation based on personalized genetic and physiological characteristics. Human angiotensin-converting enzyme 2 (ACE2), expressed in epithelial cells of lung, small intestines, heart, and kidneys, is an entry receptor for SARS-CoV-2 spike glycoprotein (3, 13). Fang et al. (3) hypothesized that increased expression of ACE2, from using ACE2-stimulating drugs to treat hypertension and diabetes, could actually worsen clinical outcomes of COVID-19 infection. Indeed, this hypothesis should be further tested with strict experimental designs and long-term clinical observations. Therefore, biochemistry (e.g., ACE2 expression level) and clinical data (e.g., age, respiratory pattern, viral load, and survival) of COVID-19 patients with underlying medical conditions can be analyzed by machine learning approaches to not only identify any reliable features (e.g., ACE2) for risk prediction, but also further perform risk classification and prediction for a balanced preparation of ongoing disease treatment and COVID-19 defense (Fig. 1). ACE2 genetic polymorphism, represented by diverse genetic variants in human genome, has been shown to affect virus-binding activity (1), suggesting a possible genetic predisposition to COVID-19 infection. Therefore, machine learning analysis of genetic variants from asymptomatic, mild or severe COVID-19 patients can be performed to classify and predict people based on their vulnerability or resistance to potential COVID-19 infection, by which the machine learning model can also return those prioritized genetic variants, such as ACE2 polymorphism, in their decision-making process as important features for functional and mechanistic studies (Fig. 1).
Currently, ongoing efforts have been made to develop novel diagnostic approaches using machine learning algorithms. For example, machine learning-based screening of SARS-CoV-2 assay designs using a CRISPR-based virus detection system was demonstrated with high sensitivity and speed (6). Neural network classifiers were developed for large-scale screening of COVID-19 patients based on their distinct respiratory pattern (10). Similarly, a deep learning-based analysis system of thoracic CT images was constructed for automated detection and monitoring of COVID-19 patients over time (5). Rapid development of automated diagnostic systems based on artificial intelligence and machine learning not only can contribute to increased diagnostic accuracy and speed but will also protect healthcare workers by decreasing their contacts with COVID-19 patients (Fig. 1).
An effective therapeutic strategy is urgently needed to treat rapidly growing COVID-19 patients worldwide. As there is no effective drug proven to treat COVID-19 patients, it is critical to develop efficient approaches to repurpose clinically approved drugs or design new drugs against SARS-CoV-2. A machine learning-based repositioning and repurposing framework was developed to prioritize existing drug candidates against SARS-CoV-2 for clinical trials (4). Additionally, a deep learning-based drug discovery pipeline has been used to design and generate novel drug-like compounds against SARS-CoV-2 (12). AlphaFold (9), which is a deep learning system developed by Google DeepMind, has released predicted protein structures associated with COVID-19, which can take months by traditional experimental approaches, serving as valuable information for COVID-19 vaccine formulation. Moreover, COVID-19 vaccine candidates were proposed by a newly developed Vaxign reverse vaccinology tool integrated with machine learning (7). The tremendous amount of COVID-19 treatment data in worldwide hospitals also require advanced machine learning methods for analyzing personalized therapeutic effects for evaluating new patients, such as hospitalization prediction, which can not only provide better care for each patient but also contribute to local hospital arrangement and operation (Fig. 1).
As artificial intelligence and machine learning scientists have been eagerly searching and waiting for real-time data generated by this pandemic around the world, timely delivery of COVID-19 patient data, such as physiological characteristics and therapeutic outcome of COVID-19 patients, followed by subsequent data transformation for easy access, is extremely important, but challenging. Figure 1 is a schematic representation of the workflow, but there are several steps in the process that currently limit the application of machine learning and artificial intelligence to combat COVID-19. Availability of COVID-19-related clinical data, which can be managed and processed into easily accessible databases, is a key current barrier. Therefore, development of cyber-infrastructure to fuel world-wide collaborations is important. To this end, the US federal agencies are already promoting the formations of consortia and funding opportunities (https://www.nsf.gov/pubs/2020/nsf20055/nsf20055.jsp). In addition to these initiatives, integrating COVID-19-related clinical data with existing biobanks, such as the UK Biobank, with pre-existing data of those patients (if already in biobanks), such as their genotype and physiological characteristics, could maximize our efforts toward a faster and feasible approach for meaningful data-mining by bioinformaticians and computational scientists. A centralized collection of worldwide COVID-19 patient data will be beneficial for future artificial intelligence and machine learning research to develop predictive, diagnostic, and therapeutic strategies against COVID-19 and similar pandemics in future.
GRANTS
X. Cheng acknowledges funding support from the Dean’s Postdoctoral to Faculty Fellowship from the University of Toledo College of Medicine and Life Sciences and P30 Core Center Pilot Grant from NIDA Center of Excellence in Omics, Systems Genetics, and the Addictome. B. Joe acknowledges support from the National Heart Lung and Blood Institute Grant HL-143082.
DISCLOSURES
No conflicts of interest, financial or otherwise, are declared by the authors.
AUTHOR CONTRIBUTIONS
A.A., S.A., I.M., and X.C. prepared figure; A.A., S.A., I.M., and X.C. drafted manuscript; A.A., S.A., I.M., P.B.M., B.J., and X.C. edited and revised manuscript; A.A., S.A., I.M., P.B.M., B.J., and X.C. approved final version of manuscript.
REFERENCES
Articles from Physiological Genomics are provided here courtesy of American Physiological Society
Full text links
Read article at publisher's site: https://doi.org/10.1152/physiolgenomics.00029.2020
Read article for free, from open access legal sources, via Unpaywall: https://europepmc.org/articles/pmc7191426
Citations & impact
Impact metrics
Citations of article over time
Alternative metrics
Smart citations by scite.ai
Explore citation contexts and check if this article has been
supported or disputed.
https://scite.ai/reports/10.1152/physiolgenomics.00029.2020
Article citations
Detection of COVID-19 using edge devices by a light-weight convolutional neural network from chest X-ray images.
BMC Med Imaging, 24(1):1, 02 Jan 2024
Cited by: 0 articles | PMID: 38166813 | PMCID: PMC10759384
COVID-19 and beyond: leveraging artificial intelligence for enhanced outbreak control.
Front Artif Intell, 6:1266560, 08 Nov 2023
Cited by: 0 articles | PMID: 38028660 | PMCID: PMC10663297
Review Free full text in Europe PMC
Engineering Materials and Devices for the Prevention, Diagnosis, and Treatment of COVID-19 and Infectious Diseases.
Nanomaterials (Basel), 13(17):2455, 30 Aug 2023
Cited by: 0 articles | PMID: 37686965 | PMCID: PMC10490511
Review Free full text in Europe PMC
Prediction of COVID-19 Using a WOA-BILSTM Model.
Bioengineering (Basel), 10(8):883, 25 Jul 2023
Cited by: 0 articles | PMID: 37627768 | PMCID: PMC10451184
Reviewing methods of deep learning for diagnosing COVID-19, its variants and synergistic medicine combinations.
Comput Biol Med, 163:107191, 20 Jun 2023
Cited by: 4 articles | PMID: 37354819 | PMCID: PMC10281043
Review Free full text in Europe PMC
Go to all (187) article citations
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
How could artificial intelligence aid in the fight against coronavirus?
Expert Rev Anti Infect Ther, 18(6):493-497, 29 Mar 2020
Cited by: 6 articles | PMID: 32223349
The COVID-19 pandemic.
J Pak Med Assoc, 70(Suppl 3)(5):S4-S6, 01 May 2020
Cited by: 2 articles | PMID: 32515365
COVID-19: How has the scientific community risen to the challenge?
Biotechniques, 68(5):232-234, 22 Apr 2020
Cited by: 1 article | PMID: 32319301
Towards effective COVID‑19 vaccines: Updates, perspectives and challenges (Review).
Int J Mol Med, 46(1):3-16, 06 May 2020
Cited by: 183 articles | PMID: 32377694 | PMCID: PMC7255458
Review Free full text in Europe PMC
COVID-19: Main therapeutic options.
Tunis Med, 98(4):299-303, 01 Apr 2020
Cited by: 8 articles | PMID: 32395792
Review
Funding
Funders who supported this work.
HHS | NIH | National Heart, Lung, and Blood Institute (1)
Grant ID: HL143082
HHS | NIH | National Institute on Drug Abuse (1)
Grant ID: P30 Core Center Pilot Grant
NHLBI NIH HHS (1)
Grant ID: R01 HL143082
University of Toledo College of Medicine and Life Sciences (1)
Grant ID: Dean's Postdoctoral to Faculty Fellowship