Kaiju is a program for the taxonomic classification of high-throughput sequencing reads, e.g., Illumina or Roche/454, from whole-genome sequencing of metagenomic DNA. Reads are directly assigned to taxa using the NCBI taxonomy and a reference database of protein sequences from microbial and viral genomes.
Taxpasta expects a five column output. This is generated by the
kaiju2table companion tool included with the Kaiju. The following format is expected:
file percent reads taxon_id taxon_name barcode41_se-barcode41-kaiju.tsv 2.988734 841 28901 taxonid:28901 barcode41_se-barcode41-kaiju.tsv 2.739969 771 1902245 taxonid:1902245 barcode41_se-barcode41-kaiju.tsv 2.057642 579 2760310 taxonid:2760310 barcode41_se-barcode41-kaiju.tsv 1.929706 543 1108 taxonid:1108
Value used in standardised profile output ↩