All reference database (2,326 plastidial 23S rRNA sequences with 1,388 Eukaryota and 938 Bacteria) at fasta format.
Full length sequences and AlgaeBase taxonomy reference (2,326 plastidial 23S rRNA sequences with 1,388 Eukaryota and 938 Bacteria):
BIOCOM-PIPE format (FASTA with Taxonomy)
mothur format
Taxonomy file only
Full length sequences and NCBI taxonomy reference (2,326 plastidial 23S rRNA sequences with 1,388 Eukaryota and 938 Bacteria):
BIOCOM-PIPE format
mothur format
Taxonomy file only
Full length sequences and PR2/SILVA taxonomy reference (2,217 plastidial 23S rRNA sequences with 1,341 Eukaryota and 876 Bacteria):
BIOCOM-PIPE format
mothur format
Taxonomy file only
Taxonomy file description
The taxonomy file: It is a 2 column file with the following format: sequenceID tab taxonomy lineage specific to each nomenclature (NCBI, AlgaeBase, PR2/SILVA).
Fasta file description
Full length sequences: the associated identifier is in the following form: [C or I or E]AccessionNumber.Letter(if duplicate).start.end;AllLineage where ‘C’ signifies complete, ‘I’ incomplete and ‘E’ environmental.