nf-core/cutandrun

Analysis pipeline for CUT&RUN and CUT&TAG experiments that includes QC, support for spike-ins, IgG controls, peak calling and downstream analysis.

cutandruncutandrun-seqcutandtagcutandtag-seq

These pages are for an old version of the pipeline (2.0). The latest stable release is 3.2.2 .

Launch version 2.0 https://github.com/nf-core/cutandrun

Define where the pipeline should find input data and save output data.

Path to comma-separated file containing information about the samples in the experiment.

required

type: string

pattern: ^\S+\.csv$

The output directory where the results will be saved. You have to use absolute paths to store on Cloud infrastructure.

required

type: string

MultiQC report title. Printed as page header, used for filename if not otherwise specified.

type: string

Save genome reference data to the output directory

type: boolean

Save any technical replicate FASTQ files that were merged to the output directory

type: boolean

Save trimmed FASTQ files to the output directory

type: boolean

Save BAM files aligned to the spike-in genome to the output directory

type: boolean

Save unaligned sequences to the output directory

type: boolean

Save alignment intermediates to the output directory (WARNING: can be very large)

type: boolean

Email address for completion summary.

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Reference genome related files and options.

Name of iGenomes reference.

type: string

Path to bowtie2 index

type: string

Path to GTF annotation file

type: string

Path to gene BED file

type: string

Path to genome blacklist

type: string

Name of the igenome reference for the spike-in genome

type: string

default: K12-MG1655

Path to spike-in bowtie2 index

type: string

Path to spike-in fasta

type: string

Path to FASTA genome file.

type: string

pattern: ^\S+\.fn?a(sta)?(\.gz)?$

Directory / URL base for iGenomes references.

hidden

type: string

default: s3://ngi-igenomes/igenomes

Do not load the iGenomes reference config.

hidden

type: boolean

Run pipeline up to input checking

type: boolean

Run pipeline up to reference preparation

type: boolean

Run pipeline up to pre-alignment

type: boolean

Run pipeline up to alignment

type: boolean

Run pipeline up to q-filtering

type: boolean

Run pipeline up to peak calling

type: boolean

Skips fastqc reporting

type: boolean

Skips trimming

type: boolean

Skips de-duplication

type: boolean

Skips reporting

type: boolean

Skips igv session generation

type: boolean

Skips deeptools heatmap generation

type: boolean

Skips multiqc

type: boolean

Skip upset plot calculation

type: boolean

Skip fragments in peaks calculation

type: boolean

Instructs Trim Galore to remove bp from the 5’ end of read 1 (or single-end reads).

type: integer

Instructs Trim Galore to remove bp from the 5’ end of read 2 (paired-end reads only).

type: integer

Instructs Trim Galore to remove bp from the 3’ end of read 1 AFTER adapter/quality trimming has been performed.

type: integer

Instructs Trim Galore to remove bp from the 3’ end of read 2 AFTER adapter/quality trimming has been performed.

type: integer

Instructs Trim Galore to apply the —nextseq=X option, to trim based on quality after removing poly-G tails.

type: integer

Select aligner

hidden

type: string

default: bowtie2

Normalisation constant for spike-in read normalisation

hidden

type: integer

default: 10000

Filter reads below a q-score threshold

type: integer

De-duplicate target reads AND control reads (default is control only)

type: boolean

Sets the target read normalisation mode. Options are: [“Spikein”, “RPKM”, “CPM”, “BPM”, “None” ]

type: string

If normsalisation option is one of “RPKM”, “CPM”, “BPM” - then the binsize that the reads count is calculated on is used.

type: integer

default: 1

Threshold for peak calling when no IgG is present

type: number

default: 0.05

Selects the peak caller for the pipeline. Options are: [seacr, macs2]. More than one peak caller can be chosen and the order specifies which is a primary peak called (the first) that will be used downstream. Any secondary peak callers will be run and outputed to the results folder.

type: string

default: seacr

Specifies whether to use a control to normalise peak calls against (e.g. IgG)

type: boolean

default: true

Specifies whether the background control is scaled prior to being used to normalise peaks.

type: number

default: 1

P-value threshold for macs2 peak caller

type: number

default: 0.05

parameter required by MACS2. If using an iGenomes reference these have been provided when --genome is set as GRCh37, GRCh38, GRCm38, WBcel235, BDGP6, R64-1-1, EF2, hg38, hg19 and mm10. Otherwise the gsize will default to GRCh38.

type: number

default: 2700000000

Specifies whether to run macs2 in narrow peak mode

type: boolean

Specifies what samples to group together for consensus peaks. Options are [group, all]

type: string

Minimum number of overlapping replicates needed for a consensus peak

type: number

default: 1

Parameters used to describe centralised config profiles. These should not be edited.

Git commit id for Institutional configs.

hidden

type: string

default: master

Base directory for Institutional configs.

hidden

type: string

default: https://raw.githubusercontent.com/nf-core/configs/master

Institutional config name.

hidden

type: string

Institutional config description.

hidden

type: string

Institutional config contact information.

hidden

type: string

Institutional config URL link.

hidden

type: string

Set the top limit for requested resources for any single job.

Maximum number of CPUs that can be requested for any single job.

hidden

type: integer

default: 16

Maximum amount of memory that can be requested for any single job.

hidden

type: string

default: 128.GB

pattern: ^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$

Maximum amount of time that can be requested for any single job.

hidden

type: string

default: 240.h

pattern: ^(\d+\.?\s*(s|m|h|day)\s*)+$

Less common options for the pipeline, typically set in a config file.

Display help text.

hidden

type: boolean

Method used to save pipeline results to output directory.

hidden

type: string

Email address for completion summary, only when pipeline fails.

hidden

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Send plain-text email instead of HTML.

hidden

type: boolean

File size limit when attaching MultiQC reports to summary emails.

hidden

type: string

default: 25.MB

pattern: ^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$

Do not use coloured log outputs.

hidden

type: boolean

Custom config file to supply to MultiQC.

hidden

type: string

Directory to keep pipeline Nextflow logs and reports.

hidden

type: string

default: ${params.outdir}/pipeline_info

Boolean whether to validate parameters against the schema at runtime

hidden

type: boolean

default: true

Show all params when using --help

hidden

type: boolean

Run this workflow with Conda. You can also use ‘-profile conda’ instead of providing this parameter.

hidden

type: boolean

On this page