On January 22, 2020, the National Genomics Science Data Center officially released the 2019 new coronavirus resource library.
Academic literature, news, popular science articles. At the same time, mutation analysis and display of the genome sequence of different coronavirus strains.
New Coronavirus (2019-nCoV) Resource Library
The 2019 New Coronavirus Resource Library contains meta information of the 2019 New Coronavirus (2019-nCoV) virus strains from the NCBI GenBank database and GISAID database, including the strain name, sampling date, sampling location, sample providing unit, data submission unit, etc.
Through this resource library, you can also access the Coronaviridae genome and protein sequences collected from the public database of the National Genome Science Data Center Genome Database GWH.
Users can base on Accession number, species name, genus name, sampling date, sampling location, host, separation source, release date and other meta-information screen the corona virus strains of interest, and personalize the selected sequence to download to carry out relevant scientific research.
Coronavirus Resource Library will continue to update information
The 2019 new coronavirus resource library will continue to update meta information and genomic sequence data, and monitor the academic documents of 2019 new coronavirus and other coronaviruses published in NCBI's PubMed database in real time, and the news released by China News and Xinhuanet to update the world simultaneously.
The popular science introduction issued by WHO and the Chinese Center for Disease Control and Prevention provides resources and windows for users to carry out academic research, master scientific research progress, understand news and scientific knowledge.
The 2019 new coronavirus resource library carried out 2019-nCoV virus strain genome variation analysis based on different reference genome sequences, and statistically and visually displayed the results.
Through the comparison of genome-wide sequence similarity and analysis of mutation sites, the degree of variation, variation region, and variation between 2019-nCoV virus strains, 2019-nCoV virus strains and SARS coronavirus, and SARS-like coronavirus bat strains are obtained for base details.
According to data analysis, the 2019-nCoV genome sequence sequence similarity to the SARS virus outbreak in 2003 was 80%, which was similar to the Bat SARS-like coronavirus isolate bat-SL-CoVZC45 genome sequence sequence collected from domestic bats in February 2017.
The highest, the similarity is 88%. Genomic variation analysis of the 2019-nCoV virus strain can provide an important data foundation and decision support for tracing the source of the virus, tracking the mutation path of the virus strain, preventing and controlling the epidemic caused by the new coronavirus, and treating viral pneumonia.
2019 New Coronavirus Genome Variation Analysis
The reference genome for genome variation analysis is GenBank 2019 new coronavirus strain genome sequence MN908947.3, SARS-Cov is SARS coronavirus, bat-SL-CoVZC45 is SARS-like coronavirus bat strain. The rest of the genome sequence is derived from the 2019 new coronavirus strain released by GISAID. The red vertical line indicates the mutation site.