Release date
Release ID
235
Data source
NCBI Refseq release 235
Number of genomes
15646
Number of proteins
719983
VOGDB group
Number of VFAM: 39808 (Virus protein families)
Number of VOG: 48646 (Virus orthologous groups)
Number of VFOLD: 33168 (Virus protein structural folds)
Base URL
VOGDB File
vog.raw_algs.alistat.txt (Statistics of multiple alignments): 3,271,162 bytes, MD5 checksum 85aadc23a1fc9d4450b1d6ed4dd155dd
https://fileshare.csb.univie.ac.at/vog/vog235/vog.raw_algs.alistat.txt
vog.annotations.tsv.gz (Funcational annotations of groups): 371,457 bytes, MD5 checksum 1f6481d5b4fa82ec716caa142a638190
https://fileshare.csb.univie.ac.at/vog/vog235/vog.annotations.tsv.gz
vfam.members.tsv.gz (Member protein ids of groups): 4,550,166 bytes, MD5 checksum 9c65bd0b20f3782c5e56659d1df839e8
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.members.tsv.gz
vfam.virusonly.tsv.gz (Specificity if groups to Viruses): 105,200 bytes, MD5 checksum 5fa26182051112b06a931bae3aae2df1
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.virusonly.tsv.gz
vfam.lca.tsv.gz (Last common aencestors of groups): 497,395 bytes, MD5 checksum b78878d70feb20e79a3069fc5c830d92
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.lca.tsv.gz
vog.hmm.tar.gz (Hidden Markov Models of groups): 572,994,391 bytes, MD5 checksum ef92a4c880126ea3b86e52fdf902ba06
https://fileshare.csb.univie.ac.at/vog/vog235/vog.hmm.tar.gz
vfam.annotations.tsv.gz (Funcational annotations of groups): 294,137 bytes, MD5 checksum 20c8ffdc7a754f5ae224516faf84bd12
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.annotations.tsv.gz
vog.faa.tar.gz (Protein sequences of groups): 66,220,480 bytes, MD5 checksum f40d8a1ac3b02c8b14a8b70c7d910d47
https://fileshare.csb.univie.ac.at/vog/vog235/vog.faa.tar.gz
vogdb.host.txt (Host information and classification for genomes): 482,633 bytes, MD5 checksum 8db6ce183779d105ce78acdfc0f2652a
https://fileshare.csb.univie.ac.at/vog/vog235/vogdb.host.txt
vfold.faa.tar.gz (Protein sequences of groups): 64,110,540 bytes, MD5 checksum 770348ee822b1e166f9c5f1b79d40130
https://fileshare.csb.univie.ac.at/vog/vog235/vfold.faa.tar.gz
vfam.raw_algs.tar.gz (Multiple sequence alignments of groups): 62,847,087 bytes, MD5 checksum 713cdb47d5428f9b1c0984bb7a8c8ddf
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.raw_algs.tar.gz
vog.members.tsv.gz (Member protein ids of groups): 4,631,415 bytes, MD5 checksum 6e4789ecd517f7f0f6a4e8433486e1e2
https://fileshare.csb.univie.ac.at/vog/vog235/vog.members.tsv.gz
vfold.members.tsv.gz (Member protein ids of groups): 4,515,230 bytes, MD5 checksum 432de22ded6b124bafa74fab883d7ccf
https://fileshare.csb.univie.ac.at/vog/vog235/vfold.members.tsv.gz
vfam.hmm.tar.gz (Hidden Markov Models of groups): 463,359,936 bytes, MD5 checksum ed7e43c877c1f6201f8c5fa4f51698e7
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.hmm.tar.gz
vfold.annotations.tsv.gz (Funcational annotations of groups): 243,224 bytes, MD5 checksum 1a8f3c89470a2b59172432682ce6db58
https://fileshare.csb.univie.ac.at/vog/vog235/vfold.annotations.tsv.gz
vfam.representatives.colabfold_predictions.tar.gz (Protein structure predictions): 7,834,407,189 bytes, MD5 checksum 44d9e30f77f6062e43fa5f19c6ec16f3
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.representatives.colabfold_predictions.tar.gz
vogdb.proteins.all.fa.gz (Protein sequences of all genomes): 106,448,839 bytes, MD5 checksum cbe52bc431a17f3151a4be425c825a0d
https://fileshare.csb.univie.ac.at/vog/vog235/vogdb.proteins.all.fa.gz
vogdb.genes.all.fa.gz (Gene sequences of all genomes): 171,387,463 bytes, MD5 checksum 80dbfedab373421dc51025e28f6f046e
https://fileshare.csb.univie.ac.at/vog/vog235/vogdb.genes.all.fa.gz
vfam.raw_algs.alistat.txt (Statistics of multiple alignments): 2,720,802 bytes, MD5 checksum 1d95525e583d7f5f0fe9a469c970ed49
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.raw_algs.alistat.txt
vfam.representatives.colabfold_mean_plddt.txt (Mean pLDDT values of protein structure predictions): 635,888 bytes, MD5 checksum 2459f9050db3ac3496c5ea0913c3a494
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.representatives.colabfold_mean_plddt.txt
vog.virusonly.tsv.gz (Specificity if groups to Viruses): 126,237 bytes, MD5 checksum 6976c520c9b0ebaccaad75e2fe5d95df
https://fileshare.csb.univie.ac.at/vog/vog235/vog.virusonly.tsv.gz
vog.raw_algs.tar.gz (Multiple sequence alignments of groups): 61,847,213 bytes, MD5 checksum 676e286a49d6b1716cd3a0f6e1d1c2d9
https://fileshare.csb.univie.ac.at/vog/vog235/vog.raw_algs.tar.gz
vfold.lca.tsv.gz (Last common aencestors of groups): 412,625 bytes, MD5 checksum bc051ddfd1f80ed8d99bafddcde797e9
https://fileshare.csb.univie.ac.at/vog/vog235/vfold.lca.tsv.gz
vogdb.functional_categories.txt (Lettercodes of functional categories): 308 bytes, MD5 checksum 6b816cc49c17d0095da91bad4e7552fa
https://fileshare.csb.univie.ac.at/vog/vog235/vogdb.functional_categories.txt
vfold.virusonly.tsv.gz (Specificity if groups to Viruses): 86,919 bytes, MD5 checksum 784cd9b82e399a2f55cf9c20d2ebb058
https://fileshare.csb.univie.ac.at/vog/vog235/vfold.virusonly.tsv.gz
vog.lca.tsv.gz (Last common aencestors of groups): 615,054 bytes, MD5 checksum 3a785ed8e7f61ea264b00326e5998f0b
https://fileshare.csb.univie.ac.at/vog/vog235/vog.lca.tsv.gz
vogdb.species.txt (Virus genomes used for VOG construction): 815,682 bytes, MD5 checksum 62f5793436c6f263b9827e76f00faa2e
https://fileshare.csb.univie.ac.at/vog/vog235/vogdb.species.txt
vogdb.taxonomy.krona.html (Distribution of virus genome taxonomies): 6,933,398 bytes, MD5 checksum 125eb083dc816556c945964907cecc93
https://fileshare.csb.univie.ac.at/vog/vog235/vogdb.taxonomy.krona.html
vfam.faa.tar.gz (Protein sequences of groups): 65,384,759 bytes, MD5 checksum 45195e9f8a037f50ea7770bedfcb5c54
https://fileshare.csb.univie.ac.at/vog/vog235/vfam.faa.tar.gz