Sodamhan.com

TL;DR

nextclade

Bioinformatics tool for virus genome alignment, clade assignment and qc checks. More information: https://docs.nextstrain.org/projects/nextclade/en/stable/user/nextclade-cli/index.html.

  • Align sequences to user provided reference, outputting the alignment to a file:

nextclade run path/to/sequences.fa -r path/to/reference.fa -o path/to/alignment.fa

  • Create a tSV report, auto-downloading the latest dataset:

nextclade run path/to/fasta -d dataset_name -t path/to/report.tsv

  • List all available datasets:

nextclade dataset list

  • Download the latest SARS-CoV-2 dataset:

nextclade dataset get --name sars-cov-2 --output-dir path/to/directory

  • Use a downloaded Dataset, producing all Outputs:

nextclade run -D path/to/dataset_dir -O path/to/output_dir path/to/sequences.fasta

  • Run on multiple files:

nextclade run -d dataset_name -t path/to/output_tsv -- path/to/input_fasta_1 path/to/input_fasta_2 ...

  • Try reverse complement if sequence does not align:

nextclade run --retry-reverse-complement -d dataset_name -t path/to/output_tsv path/to/input_fasta

This document was created using the contents of the tldr project.