SARS-CoV-2 may have diverged from bat viruses approximately 40–70 years ago

Embargoed until: Publicly released:
Peer-reviewed: This work was reviewed and scrutinised by relevant independent experts.

Experimental study: At least one thing in the experiment was changed to see if it had an impact on the subjects (often people or animals) – eg: changing the amount of time mice spend on an exercise wheel to find out what impact it has on weight loss.

Observational study: A study in which the subject is observed to see if there is a relationship between two or more things (eg: the consumption of diet drinks and obesity). Observational studies cannot prove that one thing causes another, only that they are linked.

SARS-CoV-2, the virus behind COVID-19, is likely to have diverged from the most closely related bat viruses approximately 40–70 years ago, reports international research. The authors suggest RaTG13 (the bat virus identified as the most closely related virus to SARS-CoV-2) and SARS-CoV-2 share a single ancestral lineage and estimate that SARS-CoV-2 genetically diverged from related bat viruses in 1948, 1969 and 1982, respectively.

Journal/conference: Nature Microbiology

DOI: 10.1038/s41564-020-0771-4

Organisation/s: Pennsylvania State University, USA

Funder: We thank all authors who have kindly deposited and shared genome data on GISAID. We thank T. Bedford for providing M.F.B. with an alignment on which an initial recombination analysis was done. We thank A. Chan and A. Irving for helpful comments on the manuscript. The research leading to these results received funding (to A.R. and P.L.) from the European Research Council under the European Union’s Horizon 2020 research and innovation programme (grant agreement no. 725422-ReservoirDOCS). D.L.R. is funded by the MRC (no. MC_UU_1201412). The Artic Network receives funding from the Wellcome Trust through project no. 206298/Z/17/Z. P.L. acknowledges support by the Research Foundation—Flanders (‘Fonds voor Wetenschappelijk Onderzoek—Vlaanderen’ (nos. G066215N, G0D5117N and G0B9317N)) and by the European Union’s Horizon 2020 project MOOD (no. 874850). T.L. is funded by The National Natural Science Foundation of China Excellent Young Scientists Fund (Hong Kong and Macau; no. 31922087). We thank originating laboratories at South China Agricultural University (Y. Shen, L. Xiao and W. Chen; no. EPI_ISL_410721) and Beijing Institute of Microbiology and Epidemiology (W.-C. Cao, T.T.-Y.L., N. Jia, Y.-W. Zhang, J.-F. Jiang and B.-G. Jiang, nos. EPI_ISL_410538, EPI_ISL_410539, EPI_ISL_410540, EPI_ISL_410541 and EPI_ISL_410542) for the use of sequence data via the GISAID platform.

Media release

From: Springer Nature

SARS-CoV-2 is likely to have diverged from the most closely related bat viruses approximately 40–70 years ago, reports a paper in Nature Microbiology. The findings suggest that the lineage giving rise to SARS-CoV-2 may have been circulating in bats for decades.

Understanding the evolutionary history of SARS-CoV-2 has been difficult because coronaviruses are known to recombine (exchange genetic material between different viruses) and small genomic subregions of the virus may have different ancestries. The bat virus RaTG13 has been identified as the most closely related virus to SARS-CoV-2, suggesting that a bat origin for the COVID-19 outbreak is likely. However, research has also identified similar coronaviruses in pangolins (notably, a pangolin virus sampled in Guangdong in 2019; Pangolin-2019) and it has been proposed that they may have been an intermediary host.

Maciej Boni and colleagues analysed the evolutionary history of SARS-CoV-2 using genomic data on sarbecoviruses (the subgenus to which SARS-CoV-2 belongs). They employed three approaches to identify regions in the virus that had not undergone recombination and that could be used to reconstruct its evolution. The approaches all suggest that RaTG13 and SARS-CoV-2 share a single ancestral lineage and estimate that SARS-CoV-2 genetically diverged from related bat sarbecoviruses in 1948, 1969 and 1982, respectively. The authors also examined the receptor-binding domain (RBD) of the virus spike protein, which enables it to use the human ACE2 receptor to enter cells. Although this has been shown to be genetically more similar to pangolin viruses than RaTG13, the authors found that the spike protein did not show evidence of recombination taking place between the lineage leading to SARS-CoV-2 and other known sarbecoviruses. Based on this finding, they propose this protein and its RBD are an ancestral trait of the lineage leading to SARS-CoV-2, RaTG13 and Pangolin-2019. The authors conclude that although SARS-CoV-2 and pangolin viruses share a common ancestor, and although pangolins may have played a role in the transmission to humans, they are unlikely to be an intermediate host for the virus.

The authors argue that the long divergence period of SARS-CoV-2 indicates there may be unsampled virus lineages in bats that may have zoonotic potential due to the ancestral position of human-adapted contact residues on the SARS-CoV-2 RBD, but state that better sampling is needed to assess this. They conclude that existing diversity and the dynamic process of recombination amongst lineages in the bat reservoir demonstrate how difficult it will be to identify viruses with the potential to cause significant human outbreaks before they emerge, underscoring the need for real-time human disease surveillance systems that can rapidly identify and classify pathogens.


  • Springer Nature
    Web page
  • Science Media Centre UK
    Web page
    UK SMC Expert Reaction

News for:


Media contact details for this story are only visible to registered journalists.