Photo by Riizz on Unsplash
Photo by Riizz on Unsplash

Could artificial intelligence predict the next pandemic?

Embargoed until: Publicly released:
Peer-reviewed: This work was reviewed and scrutinised by relevant independent experts.

Simulation/modelling: This type of study uses a computer simulation or mathematical model to predict an outcome. The original values put into the model may have come from real-world measurements (eg: past spread of a disease used to model its future spread).

Artificial intelligence could detect animal viruses with the potential to cause pandemics in humans, according to international research. The researchers developed a machine learning model using a dataset of nearly 900 virus species which can determine the probability of an animal virus spreading to humans based on its genetic makeup and similarity to other viruses. The researchers say this model could help identify diseases that need more research and monitoring before they spread to humans and potentially become a pandemic.

Journal/conference: PLOS Biology

Link to research (DOI): 10.1371/journal.pbio.3001390

Organisation/s: University of Glasgow, UK

Funder: D.G.S. and N.M. were supported by a Wellcome Senior Research Fellowship (217221/Z/19/Z). Additional funding was provided by the Medical Research Council through program grants MC_UU_12014/8 and MC_UU_12014/12. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Media release

From: PLOS

AI may predict the next virus to jump from animals to humans

Most emerging infectious diseases of humans (like COVID-19) are zoonotic – caused by viruses originating from other animal species. Identifying high-risk viruses earlier can improve research and surveillance priorities. A study publishing in PLOS Biology on September 28th by Nardus Mollentze, Simon Babayan, and Daniel Streicker at University of Glasgow, United Kingdom suggests that machine learning (a type of artifical intelligence) using viral genomes may predict the likelihood that any animal-infecting virus will infect humans, given biologically relevant exposure.

Identifying zoonotic diseases prior to emergence is a major challenge because only a small minority of the estimated 1.67 million animal viruses are able to infect humans. To develop machine learning models using viral genome sequences, the researchers first compiled a dataset of 861 virus species from 36 families. They then built machine learning models, which assigned a probability of human infection based on virus taxonomy and/or relatedness to known human-infecting viruses. The authors then applied the best-performing model to analyze patterns in the predicted zoonotic potential of additional virus genomes sampled from a range of species.

The researchers found that viral genomes may have generalizable features that are independent of virus taxonomic relationships and may preadapt viruses to infect humans. They were able to develop machine learning models capable of identifying candidate zoonoses using viral genomes. These models have limitations, as computer models are only a preliminary step of identifying zoonotic viruses with potential to infect humans. Viruses flagged by the models will require confirmatory laboratory testing before pursuing major additional research investments. Further, while these models predict whether viruses might be able to infect humans, the ability to infect is just one part of broader zoonotic risk, which is also influenced by the virus’ virulence in humans, ability to transmit between humans, and the ecological conditions at the time of human exposure.

According to the authors, “Our findings show that the zoonotic potential of viruses can be inferred to a surprisingly large extent from their genome sequence. By highlighting viruses with the greatest potential to become zoonotic, genome-based ranking allows further ecological and virological characterisation to be targeted more effectively.”

“These findings add a crucial piece to the already surprising amount of information that we can extract from the genetic sequence of viruses using AI techniques,” Babayan adds. “A genomic sequence is typically the first, and often only, information we have on newly-discovered viruses, and the more information we can extract from it, the sooner we might identify the virus’ origins and the zoonotic risk it may pose. As more viruses are characterized, the more effective our machine learning models will become at identifying the rare viruses that ought to be closely monitored and prioritized for preemptive vaccine development.”

Attachments:

Note: Not all attachments are visible to the general public

  • PLOS
    Web page
    The URL will go live after the embargo ends

News for:

International

Media contact details for this story are only visible to registered journalists.