Download AADB sequence and annotation resources for offline analysis, reproducible workflows, and local reuse.

FASTA Sequences

Download curated AADB protein sequences in FASTA format. Suitable for local sequence analysis and custom workflows.

Complete Database

Download all sequences from the current database version (v1.2).

File: pH_resistance_database_v1.2.faa
Format: FASTA (.faa)
Entries: 728 sequences
Updated: v1.2

Filtered Download

Download sequences filtered by functional system and/or species.


Version History

v1.2 (Current)

728 sequences, aligned sequences, enhanced annotations

v1.1

907 sequences, 11 functional systems

Available in data/v1.1/ directory

Note: This version contains duplicate sequences

v1.0

509 sequences, 16 genes

Available in data/v1.0/ directory

Metadata Tables

Download structured metadata tables in CSV, TSV, or Excel format. These files include identifiers, gene/protein labels, organism information, functional system assignments, and related annotations.

Gene Function Table (CSV)
File: gene_function_table.csv
Format: CSV
Entries: 728
Use: Full metadata for all entries
Gene Function Table (Excel)
File: gene_function_table.xlsx
Format: Excel (.xlsx)
Entries: 728
Use: Spreadsheet with formatting
Database Statistics (CSV)
File: database_statistics.csv
Format: CSV
Use: Summary statistics
BLAST Database Files

Download the local BLAST-formatted AADB sequence database for sequence similarity search in external environments.

Complete BLAST Database


Usage Instructions

  1. Download and extract the tar.gz archive
  2. Use the database with BLAST+ tools:
    blastp -query your_sequences.faa -db path/to/pH_resistance_db_final -out results.txt
  3. For more information, see the Database Overview page
Release Notes

AADB is versioned to support reproducibility and transparent database updates. Please use the release information below when citing or reusing the resource.

Current release: v1.2
  • 728 unique protein sequences
  • 11 functional systems
  • Enhanced annotation and aligned outputs
Version History
VersionEntriesNotes
v1.2728Current release. Aligned sequences, enhanced annotations
v1.1907Contains duplicates. Available in data/v1.1/
v1.0509Initial release. Available in data/v1.0/
Format Information

Supported Formats

  • FASTA (.faa, .fa, .fasta): Standard protein sequence format with headers
  • CSV (.csv): Comma-separated values for spreadsheet applications
  • Excel (.xlsx): Microsoft Excel format with formatting
  • BLAST Database: Pre-built BLAST+ database (tar.gz archive)

Data Fields

Downloaded sequences include the following metadata:

  • Sequence ID
  • Gene name
  • Species
  • Functional system classification
  • pH range
  • Environment
  • Description