Release date
Release ID
230
Data source
NCBI Refseq release 230
Number of genomes
15003
Number of proteins
690046
VOGDB group
Number of VFAM: 39585 (Virus protein families)
Number of VFOLD: 33042 (Virus protein structural folds)
Number of VOG: 48337 (Virus orthologous groups)
Base URL
VOGDB File
vfam.annotations.tsv.gz (Funcational annotations of groups): 285,006 bytes, MD5 checksum d1c6ce27c3477cc8c7b7de7d301d48bf
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.annotations.tsv.gz
vfam.faa.tar.gz (Protein sequences of groups): 62,718,749 bytes, MD5 checksum 85ec2471121f687e4ce6b302b07afd3d
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.faa.tar.gz
vfam.hmm.tar.gz (Hidden Markov Models of groups): 458,311,966 bytes, MD5 checksum 83c91263e73a09b2cdf193f20675ba5a
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.hmm.tar.gz
vfam.lca.tsv.gz (Last common aencestors of groups): 436,108 bytes, MD5 checksum ca7659cb173c209ddd71e88c74d8ba03
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.lca.tsv.gz
vfam.members.tsv.gz (Member protein ids of groups): 4,252,753 bytes, MD5 checksum 8e2291b77fbc5e767eba8c3e16d2cf46
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.members.tsv.gz
vfam.raw_algs.alistat.txt (Statistics of multiple alignments): 3,016,925 bytes, MD5 checksum 261e8de8316db7149066a13763277ff6
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.raw_algs.alistat.txt
vfam.raw_algs.tar.gz (Multiple sequence alignments of groups): 60,464,815 bytes, MD5 checksum ec79878d70e69953f43546501644408b
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.raw_algs.tar.gz
vfam.representatives.colabfold_mean_plddt.txt (Mean pLDDT values of protein structure predictions): 2,016 bytes, MD5 checksum f6259d5eeddb217c7cf090e48e08d98c
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.representatives.colabfold_mean_plddt.txt
vfam.representatives.colabfold_predictions.tar.gz (Protein structure predictions): 2,055,002,852 bytes, MD5 checksum 4ccf16c20049f2829aa647573b50ee0d
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.representatives.colabfold_predictions.tar.gz
vfam.virusonly.tsv.gz (Specificity if groups to Viruses): 104,262 bytes, MD5 checksum 022f2220514f5254a040f64293f06179
https://fileshare.csb.univie.ac.at/vog/vog230/vfam.virusonly.tsv.gz
vfold.annotations.tsv.gz (Funcational annotations of groups): 238,214 bytes, MD5 checksum 669a47ce1c48deb04a96a385d3c90701
https://fileshare.csb.univie.ac.at/vog/vog230/vfold.annotations.tsv.gz
vfold.faa.tar.gz (Protein sequences of groups): 61,549,560 bytes, MD5 checksum 7452d78330ffc5cea220b96b43f5864a
https://fileshare.csb.univie.ac.at/vog/vog230/vfold.faa.tar.gz
vfold.lca.tsv.gz (Last common aencestors of groups): 362,523 bytes, MD5 checksum c0f72e6a260853e2885802d25085f9e9
https://fileshare.csb.univie.ac.at/vog/vog230/vfold.lca.tsv.gz
vfold.members.tsv.gz (Member protein ids of groups): 4,237,526 bytes, MD5 checksum f5af8a2cbf2d4924b947e31a542f0951
https://fileshare.csb.univie.ac.at/vog/vog230/vfold.members.tsv.gz
vfold.virusonly.tsv.gz (Specificity if groups to Viruses): 80,972 bytes, MD5 checksum 13394e4030519894ad7482e7b40e987d
https://fileshare.csb.univie.ac.at/vog/vog230/vfold.virusonly.tsv.gz
vog.annotations.tsv.gz (Funcational annotations of groups): 365,437 bytes, MD5 checksum d289c5b26a545e2a08debea61d17b1d8
https://fileshare.csb.univie.ac.at/vog/vog230/vog.annotations.tsv.gz
vogdb.functional_categories.txt (Lettercodes of functional categories): 264 bytes, MD5 checksum 91da9fb2ea00ce7ffb3248a072847a62
https://fileshare.csb.univie.ac.at/vog/vog230/vogdb.functional_categories.txt
vogdb.genes.all.fa.gz (Gene sequences of all genomes): 165,282,731 bytes, MD5 checksum 4032e4c4f163e6732d1764de36826652
https://fileshare.csb.univie.ac.at/vog/vog230/vogdb.genes.all.fa.gz
vogdb.host.txt (Host information and classification for genomes): 476,614 bytes, MD5 checksum 2079e3a463837e41b83813a550111525
https://fileshare.csb.univie.ac.at/vog/vog230/vogdb.host.txt
vogdb.proteins.all.fa.gz (Protein sequences of all genomes): 102,652,919 bytes, MD5 checksum d3800cd702e1d04b6b74d12a47dceca5
https://fileshare.csb.univie.ac.at/vog/vog230/vogdb.proteins.all.fa.gz
vogdb.species.txt (Virus genomes used for VOG construction): 779,729 bytes, MD5 checksum d01ca734ab50b673204f243575e45761
https://fileshare.csb.univie.ac.at/vog/vog230/vogdb.species.txt
vogdb.taxonomy.krona.html (Distribution of virus genome taxonomies): 5,854,305 bytes, MD5 checksum bc560a9ceea1a4efc7f6932c0defbc40
https://fileshare.csb.univie.ac.at/vog/vog230/vogdb.taxonomy.krona.html
vog.faa.tar.gz (Protein sequences of groups): 63,579,878 bytes, MD5 checksum 2c93f01dc70bec1644205bed98ecb2ab
https://fileshare.csb.univie.ac.at/vog/vog230/vog.faa.tar.gz
vog.hmm.tar.gz (Hidden Markov Models of groups): 562,512,287 bytes, MD5 checksum d9df8dd6261bcc76f61752466d38af76
https://fileshare.csb.univie.ac.at/vog/vog230/vog.hmm.tar.gz
vog.lca.tsv.gz (Last common aencestors of groups): 556,026 bytes, MD5 checksum 48aca0fe730f62142a51471b579facde
https://fileshare.csb.univie.ac.at/vog/vog230/vog.lca.tsv.gz
vog.members.tsv.gz (Member protein ids of groups): 4,401,744 bytes, MD5 checksum 7f995bed2106bb03b4aa0922457811b2
https://fileshare.csb.univie.ac.at/vog/vog230/vog.members.tsv.gz
vog.raw_algs.alistat.txt (Statistics of multiple alignments): 3,625,960 bytes, MD5 checksum 6caadbd96551066fda65cf8711c5b05e
https://fileshare.csb.univie.ac.at/vog/vog230/vog.raw_algs.alistat.txt
vog.raw_algs.tar.gz (Multiple sequence alignments of groups): 59,437,448 bytes, MD5 checksum 73b2b56dbe3e325b49ef744f3fd10ce6
https://fileshare.csb.univie.ac.at/vog/vog230/vog.raw_algs.tar.gz
vog.virusonly.tsv.gz (Specificity if groups to Viruses): 126,048 bytes, MD5 checksum 21d3a6151a33b645ac77274cfe4862f2
https://fileshare.csb.univie.ac.at/vog/vog230/vog.virusonly.tsv.gz