Release date
Release ID
232
Data source
NCBI Refseq release 232
Number of genomes
15054
Number of proteins
696483
VOGDB group
Number of VFAM: 39637 (Virus protein families)
Number of VOG: 48439 (Virus orthologous groups)
Number of VFOLD: 33064 (Virus protein structural folds)
Base URL
VOGDB File
vog.raw_algs.alistat.txt (Statistics of multiple alignments): 3,247,042 bytes, MD5 checksum 54f25efe01cdcecdf36e2bf2390f7bd6
https://fileshare.csb.univie.ac.at/vog/vog232/vog.raw_algs.alistat.txt
vog.annotations.tsv.gz (Funcational annotations of groups): 370,498 bytes, MD5 checksum d8a6f6e14ed5e60712826436c0a529e3
https://fileshare.csb.univie.ac.at/vog/vog232/vog.annotations.tsv.gz
vfam.members.tsv.gz (Member protein ids of groups): 4,370,673 bytes, MD5 checksum 4610e1db81d08d2e7072aefd75ee98c6
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.members.tsv.gz
vfam.virusonly.tsv.gz (Specificity if groups to Viruses): 104,442 bytes, MD5 checksum d7047f5cd35161226beb3aecc774a516
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.virusonly.tsv.gz
vfam.lca.tsv.gz (Last common aencestors of groups): 470,612 bytes, MD5 checksum 0670d7a3998728fa4809f89089ec891a
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.lca.tsv.gz
vog.hmm.tar.gz (Hidden Markov Models of groups): 563,704,576 bytes, MD5 checksum 264e7126f8839a4b2786b5b620823d83
https://fileshare.csb.univie.ac.at/vog/vog232/vog.hmm.tar.gz
vfam.annotations.tsv.gz (Funcational annotations of groups): 293,911 bytes, MD5 checksum b4553a03a8fcc9ed3a3f93c18d6c43aa
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.annotations.tsv.gz
vog.faa.tar.gz (Protein sequences of groups): 63,847,852 bytes, MD5 checksum 0c1164c1ee325ed44e56de90a91ac4aa
https://fileshare.csb.univie.ac.at/vog/vog232/vog.faa.tar.gz
vogdb.host.txt (Host information and classification for genomes): 463,575 bytes, MD5 checksum ea5773f9fcaf1ae469797b9115400e56
https://fileshare.csb.univie.ac.at/vog/vog232/vogdb.host.txt
vfold.faa.tar.gz (Protein sequences of groups): 61,829,088 bytes, MD5 checksum 89c0f6bd161755b96247dc7656c9ac19
https://fileshare.csb.univie.ac.at/vog/vog232/vfold.faa.tar.gz
vfam.raw_algs.tar.gz (Multiple sequence alignments of groups): 60,665,831 bytes, MD5 checksum 898f41c7503b5e8018fe8ff7013b11e7
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.raw_algs.tar.gz
vog.members.tsv.gz (Member protein ids of groups): 4,442,742 bytes, MD5 checksum 3288463edd6ea2254695c6cd6e80e8d9
https://fileshare.csb.univie.ac.at/vog/vog232/vog.members.tsv.gz
vfold.members.tsv.gz (Member protein ids of groups): 4,328,470 bytes, MD5 checksum bfe9625a1ba5b6a016429df2cc7c8071
https://fileshare.csb.univie.ac.at/vog/vog232/vfold.members.tsv.gz
vfam.hmm.tar.gz (Hidden Markov Models of groups): 459,123,399 bytes, MD5 checksum f15ae54455afe1466c804b09ebf81c66
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.hmm.tar.gz
vfold.annotations.tsv.gz (Funcational annotations of groups): 241,049 bytes, MD5 checksum c01e0bd2a032550a8ac08883ed1032a4
https://fileshare.csb.univie.ac.at/vog/vog232/vfold.annotations.tsv.gz
vfam.representatives.colabfold_predictions.tar.gz (Protein structure predictions): 7,756,665,248 bytes, MD5 checksum 7314337b8a935540f8fff02c0fe77acd
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.representatives.colabfold_predictions.tar.gz
vogdb.proteins.all.fa.gz (Protein sequences of all genomes): 103,463,128 bytes, MD5 checksum e413e01eab459429de20f4a63a1af259
https://fileshare.csb.univie.ac.at/vog/vog232/vogdb.proteins.all.fa.gz
vogdb.genes.all.fa.gz (Gene sequences of all genomes): 166,512,496 bytes, MD5 checksum 54a3fbb900f2c977a9e5ee7c73054cca
https://fileshare.csb.univie.ac.at/vog/vog232/vogdb.genes.all.fa.gz
vfam.raw_algs.alistat.txt (Statistics of multiple alignments): 2,703,945 bytes, MD5 checksum b64ce2c7813dd98f46064bb6ecfd4b53
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.raw_algs.alistat.txt
vfam.representatives.colabfold_mean_plddt.txt (Mean pLDDT values of protein structure predictions): 633,200 bytes, MD5 checksum 39df86f973d7dcf7ec241ab0fea6f57c
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.representatives.colabfold_mean_plddt.txt
vog.virusonly.tsv.gz (Specificity if groups to Viruses): 125,693 bytes, MD5 checksum 6d6f549dc9a82b39d876f41cc95ac04f
https://fileshare.csb.univie.ac.at/vog/vog232/vog.virusonly.tsv.gz
vog.raw_algs.tar.gz (Multiple sequence alignments of groups): 59,636,671 bytes, MD5 checksum 3f304c47a2751cd98e73567eab9c0df8
https://fileshare.csb.univie.ac.at/vog/vog232/vog.raw_algs.tar.gz
vfold.lca.tsv.gz (Last common aencestors of groups): 390,141 bytes, MD5 checksum ec63d2f824fdfdb1ba11f2ed2f98a276
https://fileshare.csb.univie.ac.at/vog/vog232/vfold.lca.tsv.gz
vogdb.functional_categories.txt (Lettercodes of functional categories): 308 bytes, MD5 checksum 6b816cc49c17d0095da91bad4e7552fa
https://fileshare.csb.univie.ac.at/vog/vog232/vogdb.functional_categories.txt
vfold.virusonly.tsv.gz (Specificity if groups to Viruses): 86,450 bytes, MD5 checksum f050c01f021c677e269d01fa9bc4f874
https://fileshare.csb.univie.ac.at/vog/vog232/vfold.virusonly.tsv.gz
vog.lca.tsv.gz (Last common aencestors of groups): 585,992 bytes, MD5 checksum 9780d0ef376cb3105e8c033373fa2cc8
https://fileshare.csb.univie.ac.at/vog/vog232/vog.lca.tsv.gz
vogdb.species.txt (Virus genomes used for VOG construction): 782,393 bytes, MD5 checksum 6dde9668531368f875c6256bbd08b988
https://fileshare.csb.univie.ac.at/vog/vog232/vogdb.species.txt
vogdb.taxonomy.krona.html (Distribution of virus genome taxonomies): 5,864,072 bytes, MD5 checksum 88496b9994d1a6f3f748599e65b0c4fa
https://fileshare.csb.univie.ac.at/vog/vog232/vogdb.taxonomy.krona.html
vfam.faa.tar.gz (Protein sequences of groups): 63,074,335 bytes, MD5 checksum 1f361a5a1371a4c897aa25a249fe9272
https://fileshare.csb.univie.ac.at/vog/vog232/vfam.faa.tar.gz