Release date
Release ID
226
Data source
NCBI Refseq release 226
Number of genomes
14970
Number of proteins
685326
VOGDB group
Number of VFAM: 39000 (Virus protein families)
Number of VFOLD: 32315 (Virus protein structural folds)
Number of VOG: 60257 (Virus orthologous groups)
Base URL
VOGDB File
vfam.annotations.tsv.gz (Funcational annotations of groups): 284,364 bytes, MD5 checksum 89c2ea8a3b2c3bac5ac6acf2f3fd66fe
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.annotations.tsv.gz
vfam.faa.tar.gz (Protein sequences of groups): 65,742,699 bytes, MD5 checksum 8dc9789ed0d97cbec9502e7a55e66e75
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.faa.tar.gz
vfam.hmm.tar.gz (Hidden Markov Models of groups): 472,958,111 bytes, MD5 checksum 9b22961f390c258c18912fca2ee6399c
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.hmm.tar.gz
vfam.lca.tsv.gz (Last common aencestors of groups): 421,072 bytes, MD5 checksum 53f50bffd20c73042505ae9598a95438
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.lca.tsv.gz
vfam.members.tsv.gz (Member protein ids of groups): 4,297,123 bytes, MD5 checksum 4894dac22d98369b57eeaf7cec0674b2
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.members.tsv.gz
vfam.raw_algs.alistat.txt (Statistics of multiple alignments): 2,987,561 bytes, MD5 checksum 12b15700a5ba32122eaad917c07bb299
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.raw_algs.alistat.txt
vfam.raw_algs.tar.gz (Multiple sequence alignments of groups): 63,393,306 bytes, MD5 checksum f5adb92be0b6b52a40e6e74cec4cbf2a
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.raw_algs.tar.gz
vfam.representatives.colabfold_mean_plddt.txt (Mean pLDDT values of protein structure predictions): 896 bytes, MD5 checksum a03beccff797498013014fc18d1b965d
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.representatives.colabfold_mean_plddt.txt
vfam.representatives.colabfold_predictions.tar.gz (Protein structure predictions): 48,873,723 bytes, MD5 checksum 4207466805bcb66f82c90c985f87f85c
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.representatives.colabfold_predictions.tar.gz
vfam.virusonly.tsv.gz (Specificity if groups to Viruses): 102,195 bytes, MD5 checksum 80f6fc25edd5f0dd3ab085007ca2095d
https://fileshare.csb.univie.ac.at/vog/vog226/vfam.virusonly.tsv.gz
vfold.annotations.tsv.gz (Funcational annotations of groups): 234,546 bytes, MD5 checksum c5cfd31c6440008e2ff2e49ddf650be7
https://fileshare.csb.univie.ac.at/vog/vog226/vfold.annotations.tsv.gz
vfold.faa.tar.gz (Protein sequences of groups): 64,695,429 bytes, MD5 checksum ab12b545e43d155d56494b711545c435
https://fileshare.csb.univie.ac.at/vog/vog226/vfold.faa.tar.gz
vfold.lca.tsv.gz (Last common aencestors of groups): 349,501 bytes, MD5 checksum bdc108176fa3e91aa636507119ae8491
https://fileshare.csb.univie.ac.at/vog/vog226/vfold.lca.tsv.gz
vfold.members.tsv.gz (Member protein ids of groups): 4,276,015 bytes, MD5 checksum 74e9d5ca56c3c9b90efaec0b3875696e
https://fileshare.csb.univie.ac.at/vog/vog226/vfold.members.tsv.gz
vfold.virusonly.tsv.gz (Specificity if groups to Viruses): 79,032 bytes, MD5 checksum bdc0142de5762a08c063767c79e8f769
https://fileshare.csb.univie.ac.at/vog/vog226/vfold.virusonly.tsv.gz
vog.annotations.tsv.gz (Funcational annotations of groups): 480,010 bytes, MD5 checksum 4db898ef5807b8d3bf44aad8ab74d2bb
https://fileshare.csb.univie.ac.at/vog/vog226/vog.annotations.tsv.gz
vogdb.functional_categories.txt (Lettercodes of functional categories): 264 bytes, MD5 checksum 91da9fb2ea00ce7ffb3248a072847a62
https://fileshare.csb.univie.ac.at/vog/vog226/vogdb.functional_categories.txt
vogdb.genes.all.fa.gz (Gene sequences of all genomes): 164,323,484 bytes, MD5 checksum ceccc694c2a026b27339df7e86746051
https://fileshare.csb.univie.ac.at/vog/vog226/vogdb.genes.all.fa.gz
vogdb.host.txt (Host information and classification for genomes): 524,794 bytes, MD5 checksum e6a9cb8d3cb62c34eaaabfa6571da8e8
https://fileshare.csb.univie.ac.at/vog/vog226/vogdb.host.txt
vogdb.proteins.all.fa.gz (Protein sequences of all genomes): 102,027,794 bytes, MD5 checksum c347177571233635225ffdf8829f9af5
https://fileshare.csb.univie.ac.at/vog/vog226/vogdb.proteins.all.fa.gz
vogdb.species.txt (Virus genomes used for VOG construction): 778,149 bytes, MD5 checksum 4fbfaebd3de3234496ea8918fbb94d1f
https://fileshare.csb.univie.ac.at/vog/vog226/vogdb.species.txt
vogdb.taxonomy.krona.html (Distribution of virus genome taxonomies): 5,809,437 bytes, MD5 checksum 79e2cfd1dea7fcc31a58f79bc3b8d2f5
https://fileshare.csb.univie.ac.at/vog/vog226/vogdb.taxonomy.krona.html
vog.faa.tar.gz (Protein sequences of groups): 69,689,972 bytes, MD5 checksum e45b26c085d6e6079d2be9d21e9135dd
https://fileshare.csb.univie.ac.at/vog/vog226/vog.faa.tar.gz
vog.hmm.tar.gz (Hidden Markov Models of groups): 783,554,982 bytes, MD5 checksum 276cb0865d77ce865da87b12025f3488
https://fileshare.csb.univie.ac.at/vog/vog226/vog.hmm.tar.gz
vog.lca.tsv.gz (Last common aencestors of groups): 649,343 bytes, MD5 checksum b929140bb27174db4b8436e7ead984c4
https://fileshare.csb.univie.ac.at/vog/vog226/vog.lca.tsv.gz
vog.members.tsv.gz (Member protein ids of groups): 4,623,106 bytes, MD5 checksum dcfb702fa2e7eabb4ec4c8cf1ecddecb
https://fileshare.csb.univie.ac.at/vog/vog226/vog.members.tsv.gz
vog.raw_algs.alistat.txt (Statistics of multiple alignments): 4,648,408 bytes, MD5 checksum ceffa86af2a57e7ae492167690b064fa
https://fileshare.csb.univie.ac.at/vog/vog226/vog.raw_algs.alistat.txt
vog.raw_algs.tar.gz (Multiple sequence alignments of groups): 66,048,344 bytes, MD5 checksum 8318b964e1aab33f2c6fdd0bc352a400
https://fileshare.csb.univie.ac.at/vog/vog226/vog.raw_algs.tar.gz
vog.virusonly.tsv.gz (Specificity if groups to Viruses): 156,701 bytes, MD5 checksum 53cbdafbbdc502d79aff1560b3a75c1e
https://fileshare.csb.univie.ac.at/vog/vog226/vog.virusonly.tsv.gz