Release date
Release ID
229
Data source
NCBI Refseq release 229
Number of genomes
14975
Number of proteins
685789
VOGDB group
Number of VFAM: 39553 (Virus protein families)
Number of VFOLD: 33010 (Virus protein structural folds)
Number of VOG: 48299 (Virus orthologous groups)
Base URL
VOGDB File
vfam.annotations.tsv.gz (Funcational annotations of groups): 284,780 bytes, MD5 checksum a8b36f018d90e5215c525dc453398683
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.annotations.tsv.gz
vfam.faa.tar.gz (Protein sequences of groups): 62,555,530 bytes, MD5 checksum abee523e356b3bce66e8bc0a9a1a5b37
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.faa.tar.gz
vfam.hmm.tar.gz (Hidden Markov Models of groups): 458,255,167 bytes, MD5 checksum 6fb26e447c49a7593feb83876ada718c
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.hmm.tar.gz
vfam.lca.tsv.gz (Last common aencestors of groups): 433,787 bytes, MD5 checksum f356a28d3b8d2427b8e99e93b0b1fba4
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.lca.tsv.gz
vfam.members.tsv.gz (Member protein ids of groups): 4,221,593 bytes, MD5 checksum 9a959dc51e1530f95de5ee6509a1ad3a
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.members.tsv.gz
vfam.raw_algs.alistat.txt (Statistics of multiple alignments): 3,014,906 bytes, MD5 checksum 31fee2250a09e1d54377c1bf48035950
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.raw_algs.alistat.txt
vfam.raw_algs.tar.gz (Multiple sequence alignments of groups): 60,430,396 bytes, MD5 checksum 3e47c106b6d35ed20a5aad80f216353c
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.raw_algs.tar.gz
vfam.representatives.colabfold_mean_plddt.txt (Mean pLDDT values of protein structure predictions): 0 bytes, MD5 checksum d41d8cd98f00b204e9800998ecf8427e
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.representatives.colabfold_mean_plddt.txt
vfam.representatives.colabfold_predictions.tar.gz (Protein structure predictions): 2,027,246,897 bytes, MD5 checksum 213108a012805e38838549eb10074fe0
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.representatives.colabfold_predictions.tar.gz
vfam.virusonly.tsv.gz (Specificity if groups to Viruses): 104,140 bytes, MD5 checksum 3dcd558ee7e8910176995322c909150b
https://fileshare.csb.univie.ac.at/vog/vog229/vfam.virusonly.tsv.gz
vfold.annotations.tsv.gz (Funcational annotations of groups): 237,838 bytes, MD5 checksum b9023f475defaf026a4171138b675309
https://fileshare.csb.univie.ac.at/vog/vog229/vfold.annotations.tsv.gz
vfold.faa.tar.gz (Protein sequences of groups): 61,399,093 bytes, MD5 checksum 705baa28f00bf008adfce104fea20105
https://fileshare.csb.univie.ac.at/vog/vog229/vfold.faa.tar.gz
vfold.lca.tsv.gz (Last common aencestors of groups): 360,718 bytes, MD5 checksum 1aa58ce1ba9149fac4eebc2ecd509453
https://fileshare.csb.univie.ac.at/vog/vog229/vfold.lca.tsv.gz
vfold.members.tsv.gz (Member protein ids of groups): 4,204,043 bytes, MD5 checksum 0c18e5b00b5833a4461a62f0652b75d6
https://fileshare.csb.univie.ac.at/vog/vog229/vfold.members.tsv.gz
vfold.virusonly.tsv.gz (Specificity if groups to Viruses): 80,853 bytes, MD5 checksum a22008641bc6cf5fff86961d7a16600c
https://fileshare.csb.univie.ac.at/vog/vog229/vfold.virusonly.tsv.gz
vog.annotations.tsv.gz (Funcational annotations of groups): 364,963 bytes, MD5 checksum 469ecfa7cdf1a45ee92a1f875b96953e
https://fileshare.csb.univie.ac.at/vog/vog229/vog.annotations.tsv.gz
vogdb.functional_categories.txt (Lettercodes of functional categories): 264 bytes, MD5 checksum 91da9fb2ea00ce7ffb3248a072847a62
https://fileshare.csb.univie.ac.at/vog/vog229/vogdb.functional_categories.txt
vogdb.genes.all.fa.gz (Gene sequences of all genomes): 164,458,591 bytes, MD5 checksum d9fd27c6c79306bac12ec855ce59bf71
https://fileshare.csb.univie.ac.at/vog/vog229/vogdb.genes.all.fa.gz
vogdb.host.txt (Host information and classification for genomes): 475,117 bytes, MD5 checksum 49c933704754aabd753b1e8d975c7714
https://fileshare.csb.univie.ac.at/vog/vog229/vogdb.host.txt
vogdb.proteins.all.fa.gz (Protein sequences of all genomes): 102,105,096 bytes, MD5 checksum fec4ee56e06b3669dda0ecce072333f6
https://fileshare.csb.univie.ac.at/vog/vog229/vogdb.proteins.all.fa.gz
vogdb.species.txt (Virus genomes used for VOG construction): 778,402 bytes, MD5 checksum 2bd0d7d03a478246e59d219ce66a2fd7
https://fileshare.csb.univie.ac.at/vog/vog229/vogdb.species.txt
vogdb.taxonomy.krona.html (Distribution of virus genome taxonomies): 5,849,368 bytes, MD5 checksum 656143f639a2c99f691c88f762dc07d7
https://fileshare.csb.univie.ac.at/vog/vog229/vogdb.taxonomy.krona.html
vog.faa.tar.gz (Protein sequences of groups): 63,422,630 bytes, MD5 checksum ff108995ae2e529eaca52d473b00b8f9
https://fileshare.csb.univie.ac.at/vog/vog229/vog.faa.tar.gz
vog.hmm.tar.gz (Hidden Markov Models of groups): 562,253,307 bytes, MD5 checksum 88868693aef40e50f418e5163deaba01
https://fileshare.csb.univie.ac.at/vog/vog229/vog.hmm.tar.gz
vog.lca.tsv.gz (Last common aencestors of groups): 553,084 bytes, MD5 checksum 10fffd08bb9e38b805f98431799a2d89
https://fileshare.csb.univie.ac.at/vog/vog229/vog.lca.tsv.gz
vog.members.tsv.gz (Member protein ids of groups): 4,369,635 bytes, MD5 checksum cc434e0867f99d49c41503d3a7c1b839
https://fileshare.csb.univie.ac.at/vog/vog229/vog.members.tsv.gz
vog.raw_algs.alistat.txt (Statistics of multiple alignments): 3,623,495 bytes, MD5 checksum d22b5a35c0497a15c389a1b88940b6b9
https://fileshare.csb.univie.ac.at/vog/vog229/vog.raw_algs.alistat.txt
vog.raw_algs.tar.gz (Multiple sequence alignments of groups): 59,446,257 bytes, MD5 checksum 0517a1278d5eff6d03c2ca62a14f8bd3
https://fileshare.csb.univie.ac.at/vog/vog229/vog.raw_algs.tar.gz
vog.virusonly.tsv.gz (Specificity if groups to Viruses): 125,957 bytes, MD5 checksum 93f63d47989b2595a059afe3123a4fd3
https://fileshare.csb.univie.ac.at/vog/vog229/vog.virusonly.tsv.gz