Reference databases for virus metagenomics
Kuvaus
This is a collection of reference databases for virus metagenomics.
File: 2022_11_04.nt_abv.tar.gz
Archaeal, bacterial and virus sequences from NCBI nt database.
Composition by sequence accessions: archaea (14,266), bacteria (537,783) and viruses (234,136).
16,842,845 sequences; 356,180,464,604 total bases
Date: Nov 4, 2022
File: blast_gb_vi_2022_11_04.tar.gz
Database:
NCBI GeneBank Viruses Complete genomes 2022_11_04
86,352 sequences; 2,371,168,247 total bases
Date: Nov 4, 2022 5:03 PM
BLASTDB Version: 5
File: nt_2021_12_habv_cent.tar.gz
Database: Centrifuge index for NCBI nt Archaea + nt Bacteria + nt Viruses + human GRCh38 reference assembly
Description: NCBI nt fasta database and human reference assembly GRCh38 were download on 30/12/2021. Sequences for Archaea, Bacteria and Viruses were selected from the nt database using NCBI taxonomy and the latest human assembly was added to this collection. Centrifuge index was compiled with centrifuge-build.
File: nt_2020_12_vi_blastn.tar.gz
NCBI GeneBank Viruses Complete genomes 13/12/2020
Description: NCBI nt fasta database was downloaded on 13/12/2020. Virus sequences were selected using NCBI taxonomy. Blastn index was compiled with makeblastdb.
Project website:
https://www2.helsinki.fi/en/projects/lazypipe
Citing:
Ilya Plyusnin, Ravi Kant, Anne J. Jaaskelainen, Tarja Sironen, Liisa Holm, Olli Vapalahti, Teemu Smura. (2020) Novel NGS Pipeline for Virus Discovery from a Wide Spectrum of Hosts and Sample Types. Virus Evolution, veaa091, https://doi.org/10.1093/ve/veaa091.
Näytä enemmänJulkaisuvuosi
2022
Aineiston tyyppi
Tekijät
Projekti
Muut tiedot
Tieteenalat
Genetiikka, kehitysbiologia, fysiologia
Kieli
Saatavuus
Avoin