database: name: RVDB_PROT version: '31.0' release_date: 2026-01 description: Protein-derived clustered and annotated viral database from U-RVDB derived_from: U-RVDBv31.0 author: Thomas Bigot / Bioinformatics Hub @ Institut Pasteur contact: thomas.bigot@pasteur.fr tools: clustering: CD-HIT unknown hmm_generation: HMMER unknown files: protein_sequences: filename: U-RVDBv31.0-prot.fasta.xz description: Unclustered protein sequences (translated from nucleotide RVDB) format: FASTA (xz compressed) content_format: FASTA (uncompressed) sequence_count: 41330762 residue_count: 42916677181 md5_checksum_uncompressed: 4cbe4d34ab83a9bfd648c74cbcc7f52b md5_checksum_compressed: d1a99a889b6efb68939c4484ca700b80 clustered_protein_sequences: filename: U-RVDBv31.0-prot_unique.fasta.xz description: Clustered protein sequences (100% identity threshold) format: FASTA (xz compressed) content_format: FASTA (uncompressed) sequence_count: 786371 residue_count: 292611075 md5_checksum_uncompressed: 388b6e052a55b127841c9ec04c5ff097 md5_checksum_compressed: 68c925691c40045d784ba60c5e5f2d5d hmm_profiles: filename: U-RVDBv31.0-prot.hmm.xz description: HMM profiles built from clustered protein families format: HMMER3 (xz compressed) content_format: HMM text format profile_count: 13758 md5_checksum_uncompressed: 1a879c549dd7e88725c989d7b0d9eb61 md5_checksum_compressed: 22338817bbe1f479bae94ac466634e75 annotations_sqlite: filename: U-RVDBv31.0-prot-hmm.sqlite.xz description: Annotations linked to HMM models (taxonomy, function, keywords) format: SQLite DB (xz compressed) content_format: SQLite binary record_count: 13758 md5_checksum_uncompressed: fe92e20420cd44bbdf8342de7730f466 md5_checksum_compressed: 8122cc02edcaf12d84ead4ec486c0530 annotations_text: filename: U-RVDBv31.0-prot-hmm-txt.tar.xz description: Text-based annotations extracted from SQLite DB format: Tarball (xz compressed) content_format: Tab-separated values file_count_in_tar: 13760 md5_checksum_compressed: 427a405a064c61d2f233812de2a35fba disclaimer: 'This database is distributed for academic and research use only. It should NOT be considered a regulatory standard or official reference. All users must validate data quality before any application.' build_info: sample_name: rvdb31.0 build_timestamp: '2026-02-27T15:05:43.155358' build_date: '2026-02-27 15:05:43'