Define where the pipeline should find input data and save output data.

Path to comma-separated file containing information about the samples in the experiment.

required
type: string
pattern: ^\S+\.csv$

The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.

required
type: string

Email address for completion summary.

type: string
pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Parameters used to describe centralised config profiles. These should not be edited.

Git commit id for Institutional configs.

hidden
type: string
default: master

Base directory for Institutional configs.

hidden
type: string
default: https://raw.githubusercontent.com/nf-core/configs/master

Institutional config name.

hidden
type: string

Institutional config description.

hidden
type: string

Institutional config contact information.

hidden
type: string

Institutional config URL link.

hidden
type: string

Less common options for the pipeline, typically set in a config file.

Display version and exit.

hidden
type: boolean

Method used to save pipeline results to output directory.

hidden
type: string

Email address for completion summary, only when pipeline fails.

hidden
type: string
pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Send plain-text email instead of HTML.

hidden
type: boolean

Do not use coloured log outputs.

hidden
type: boolean

Boolean whether to validate parameters against the schema at runtime

hidden
type: boolean
default: true

Base URL or local path to location of pipeline test dataset files

hidden
type: string
default: https://raw.githubusercontent.com/nf-core/test-datasets/refs/heads/genomeassembler/

Display the full detailed help message.

type: boolean

Display hidden parameters in the help message (only works when –help or –help_full are provided).

type: boolean

Options controlling pipeline behavior

Path to reference genome seqeunce (fasta)

type: string

Path to reference genome annotations (gff)

type: string

use reference genome

hidden
type: boolean

Options controlling assembly

Assembly strategy to use. Valid choices are 'single', 'hybrid' and 'scaffold'

type: string
default: single

Assembler to use. Valid choices depend on strategy; for single either flye or hifiasm, hybrid can be done with hifiasm and for scaffolded assembly provide the names of the assemblers separated with an underscore. The first assembler will be used for ONT reads, the second for HiFi reads. (see below, asembler1 and assembler_hifi)

type: string

When strategy is “scaffold”, which assembly should be scaffolded onto which?

type: string

expected genome size, optional

type: string

flye mode

type: string

additional args for flye

type: string

Extra arguments passed to hifiasm

type: string

assembler_ont assembles ONT reads. This option is mainly useful when building more complex samplesheets

type: string

Arguments to be passed to assembler_ont (ONT)

type: string

assembler_hifi assembles HiFi reads. This option is mainly useful when building more complex samplesheets

type: string

Arguments to be passed to assembler_hifi (HiFi)

type: string

Path to ONT reads

type: string

Collect ONT reads from several files?

type: boolean

Adaptors for ONT read-trimming

type: string

Additional args to be passed to fastplong for ONT reads

type: string

Path to HiFi reads

type: string

Adaptors for HiFi read-trimming

type: string

Additional args to be passed to fastplong for HiFi reads

type: string

Run jellyfish and genomescope (recommended)

type: boolean

Value of k used during k-mer analysis with jellyfish

type: integer
default: 21

Initial hash size used by jellyfish count

type: string
default: 200M

Polishing options

Polish assembly with pilon? Requires short reads

type: boolean

Polish assembly with dorado (ONT only)

type: boolean

Polish assembly with medaka (ONT only)

type: boolean

model to use with medaka

type: string

Scaffolding options

Scaffold with longstitch?

type: boolean

Scaffolding with links?

type: boolean

Scaffold with ragtag (requires reference)?

type: boolean

Scaffold using HiC reads using yahs (requires reads)?

type: boolean
default: true

Aligner to use for HiC reads; default: bwa-mem2

type: string

Options for QC tools

Run merqury (if short reads are provided)

type: boolean
default: true

Long reads that should be used for QC when both ONT and HiFi reads are provided. Options are 'ont' or 'hifi'

type: string

Run BUSCO?

type: boolean

Path to busco db (optional)

type: string

Busco lineage to use

type: string
default: auto_euk

Run quast

type: boolean

A mapping (bam) of reads mapped to the reference can be provided for QC. If provided, alignment to reference fasta will not run.

type: string

Can be used to proved existing assembly will skip assembly and perform downstream steps including qc

type: string

A mapping (bam) of reads mapped to the provided assembly can be specified for QC. If provided, alignment to the provided assembly fasta will not run

type: string

Index size to use for csi index (default: 14), creating and index of size 2^csi_index_size. See samtools index documentation for details.

type: integer
default: 14

Options controlling annotation liftover

Lift-over annotations (requires ref_gff)?

type: boolean
default: true

Options for short reads

Use short reads?

type: boolean

Trim short reads?

type: boolean

kmer length for meryl / merqury

type: integer
default: 21

Path to forward short reads

type: string

Path to reverse short reads

type: string

Are shortreads paired?

type: string

Options for HiC short reads

Trim HiC short reads?

type: boolean
default: true

Path to forward HiC short reads

type: string

Path to reverse HiC short reads

type: string