nf-core/viralrecon
Assembly and intrahost/low-frequency variant calling for viral samples
2.5). The latest
stable release is
2.6.0
.
Define where the pipeline should find input data and save output data.
Path to comma-separated file containing information about the samples you would like to analyse.
string^\S+\.csv$NGS platform used to sequence the samples.
stringSpecifies the type of protocol used for sequencing.
stringThe output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
stringEmail address for completion summary.
string^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$Options for the reference genome indices used to align reads.
Name of viral reference genome.
stringPath to FASTA genome file.
string^\S+\.fn?a(sta)?(\.gz)?$Full path to GFF annotation file.
string^\S+\.gff(\.gz)?$Path to directory or tar.gz archive for pre-built Bowtie2 index.
stringIf the ‘—protocol amplicon’ parameter is provided then iVar is used to trim primer sequences after read alignment and before variant calling.
string^\S+\.bed(\.gz)?$If the ‘—protocol amplicon’ parameter is provided then Cutadapt is used to trim primer sequences from FastQ files before de novo assembly.
string^\S+\.fn?a(sta)?(\.gz)?$The primer set to be used for the data analysis.
stringVersion of the primer set e.g. ‘—primer_set artic —primer_set_version 3’.
numberSuffix used in name field of ‘—primer_bed’ to indicate left primer position.
string_LEFTSuffix used in name field of ‘—primer_bed’ to indicate right primer position.
string_RIGHTIf generated by the pipeline save reference genome related files to the results folder.
booleanOptions exclusive to running the pipeline on Nanopore data using the ARTIC fieldbioinformatics pipeline.
Path to a folder containing fastq files from the Nanopore run.
stringPath to a folder containing fast5 files from the Nanopore run.
stringSequencing summary file generated after Nanopore run completion.
string^\S+\.txt$Minimum number of raw reads required per sample/barcode in order to be considered for the downstream processing steps.
integer100Minimum number of reads required after the artic guppyplex process per sample/barcode in order to be considered for the downstream processing steps.
integer10Variant caller used when running artic minion (default: ‘nanopolish’).
stringAligner used when running artic minion (default: ‘minimap2’).
stringPrimer scheme recognised by the artic minion command.
stringParameter passed to artic minion and required when using the ‘—artic_minion_caller medaka’ workflow.
stringSkip pycoQC.
booleanSkip NanoPlot.
booleanOptions common to both the Nanopore and Illumina workflows in the pipeline.
Full path to Nextclade dataset required for ‘nextclade run’ command.
stringName of Nextclade dataset to retrieve. A list of available datasets can be obtained using the ‘nextclade dataset list’ command.
stringAccession id to download dataset based on a particular reference sequence. A list of available datasets can be obtained using the ‘nextclade dataset list’ command.
stringVersion tag of the dataset to download. A list of available datasets can be obtained using the ‘nextclade dataset list’ command.
stringMaximum read depth used to generate ASCIIGenome screenshots for variant locii.
integer50Maximum window size before and after variant locii used to generate ASCIIGenome screenshots.
integer50Custom title for the MultiQC report.
stringCustom config file to supply to MultiQC.
stringFile size limit when attaching MultiQC reports to summary emails.
string25.MBSkip genome-wide and amplicon coverage plot generation from mosdepth output.
booleanSkip Pangolin lineage analysis for genome consensus sequence.
booleanSkip Nextclade clade assignment, mutation calling, and sequence quality checks for genome consensus sequence.
booleanSkip variant screenshot generation with ASCIIGenome.
booleanSkip generation of QUAST aggregated report for consensus sequences.
booleanSkip long table generation for reporting variants.
booleanSkip MultiQC.
booleanOptions to adjust QC, read trimming and host read filtering with Kraken2 for the Illumina workflow.
Full path to Kraken2 database built from host genome.
strings3://nf-core-awsmegatests/viralrecon/input_data/kraken2_human.tar.gzName for host genome as recognised by Kraken2 when using the ‘kraken2 build’ command.
stringhumanRemove host reads identified by Kraken2 before running variant calling steps in the pipeline.
booleanRemove host reads identified by Kraken2 before running aseembly steps in the pipeline.
booleantrueSave the trimmed FastQ files in the results directory.
booleanSkip FastQC.
booleanSkip Kraken2 process for removing host classified reads.
booleanSkip the initial read trimming step peformed by fastp.
booleanSkip the amplicon trimming step with Cutadapt when using —protocol amplicon.
booleanVarious options for the variant calling branch of the Illumina workflow.
Specify which variant calling algorithm you would like to use. Available options are ‘ivar’ (default for ‘—protocol amplicon’) and ‘bcftools’ (default for ‘—protocol metagenomic’).
stringSpecify which consensus calling algorithm you would like to use. Available options are ‘bcftools’ and ‘ivar’ (default: ‘bcftools’).
stringMinimum number of mapped reads below which samples are removed from further processing. Some downstream steps in the pipeline will fail if this threshold is too low.
integer1000This option unsets the ‘-e’ parameter in ‘ivar trim’ to discard reads without primers.
booleanThis option sets the ‘-x’ parameter in ‘ivar trim’ so that reads that occur at the specified offset positions relative to primer positions will also be trimmed.
integerFiltered duplicates reads detected by Picard MarkDuplicates from alignments.
booleanSave unaligned reads in FastQ format from Bowtie 2 to the results directory.
booleanSave mpileup files generated when calling variants with iVar variants or iVar consensus.
booleanSkip iVar primer trimming step. Not recommended for —protocol amplicon.
booleanSkip picard MarkDuplicates step.
booleantrueSkip Picard CollectMultipleMetrics steps.
booleanSkip SnpEff and SnpSift annotation of variants.
booleanSkip creation of consensus base density plots.
booleanSkip genome consensus creation step and any downstream QC.
booleanSpecify this parameter to skip all of the variant calling and mapping steps in the pipeline.
booleanVarious options for the de novo assembly branch of the Illumina workflow.
Specify which assembly algorithms you would like to use. Available options are ‘spades’, ‘unicycler’ and ‘minia’.
stringspadesSpecify the SPAdes mode you would like to run (default: ‘rnaviral’).
stringPath to profile HMMs specific for gene/organism to enhance SPAdes assembly.
stringPath to directory or tar.gz archive for pre-built BLAST database.
stringSkip Bandage image creation for assembly visualisation.
booleanSkip blastn of assemblies relative to reference genome.
booleanSkip ABACAS process for assembly contiguation.
booleanSkip assembly report generation by PlasmidID.
booleanSkip generation of QUAST aggregated report for assemblies.
booleanSpecify this parameter to skip all of the de novo assembly steps in the pipeline.
booleanLess common options for the pipeline, typically set in a config file.
Display help text.
booleanMethod used to save pipeline results to output directory.
stringEmail address for completion summary, only when pipeline fails.
string^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$Send plain-text email instead of HTML.
booleanDo not use coloured log outputs.
booleanDirectory to keep pipeline Nextflow logs and reports.
string${params.outdir}/pipeline_infoRun this workflow with Conda. You can also use ‘-profile conda’ instead of providing this parameter.
booleanBoolean whether to validate parameters against the schema at runtime
booleantrueShow all params when using --help
booleanSet the top limit for requested resources for any single job.
Maximum number of CPUs that can be requested for any single job.
integer16Maximum amount of memory that can be requested for any single job.
string128.GB^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$Maximum amount of time that can be requested for any single job.
string240.h^(\d+\.?\s*(s|m|h|day)\s*)+$Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
stringmasterBase directory for Institutional configs.
stringhttps://raw.githubusercontent.com/nf-core/configs/masterInstitutional config name.
stringInstitutional config description.
stringInstitutional config contact information.
stringInstitutional config URL link.
string