Parameters
A set of default parameters is provided, but these can be easily altered by either editing the
params.config file, or by passing the new value when executing the workflow with nextflow.
There are three main parameters in LMAS: ``–reference``, ``–fastq`` and ``–md``.
The reference sequences, in a single multifasta file, can be passed with the --reference parameter, and --fastq receives the
raw data for assembly. The raw data is a collection of sequence fragments from the references and can be either
obtained in silico or from real sequencing platforms. Users can pass text information, in a markdown file,
on input samples to be presented in the report with the --md parameter.
Several options are available to alter the behaviour of the assemblers incorporated in LMAS, namely to alter the values of the k-mer for each assembly iteration. By default, these values reflect the corresponding default settings of the assemblers.
The target values for some quality assessment metrics can also be adjusted, such as N50 and NG50.
Input Files
fastq
Path expression to paired-end fastq files. Required.
Param:
--fastqDefault:
data/fastq/*_{1,2}.*
reference
Path to reference fasta file. Required.
Param:
--referenceDefault:
data/reference/*.fasta
md
Path to markdown file with text to be displayed in the report. Optional.
Param:
--mdDefault:
data/*.md
Assembly Quality Assessment
Minimum contig length
Value for minimum contig length, in basepairs.
Param:
--minLengthDefault: 1000
Mapped reads threshold
Value for the minimum percentage of a read length aligning to the contig to be considered as mapped.
Param:
--mapped_reads_thresholdDefault: 0.75
N Target
Target value for the N*x*, NA*x* and NG*x* metrics.
Param:
--n_targetDefault: 0.9
L Target
Target value for the L*x* metric.
Param:
--l_targetDefault: 0.5
L Target
Scale of x-axis for the L, NA and NG metrics plots.
Param:
--plot_scaleDefault: log
Assembler options
ABySS
Param:
--abyssDefinition: Boolean controling the execution of the ABySS assembler.
Default: true
Param:
--abyssKmerSizeDefinition: K-mer size for the ABySS assembler, as an intiger.
Default: 96
Param:
--abyssBloomSizeDefinition: Bloom filter size for the ABySS assembler. It must be a sting with a value and an unit.
Default: 2G
GATB Minia Pipeline
Param:
--gatb_miniaDefinition: Boolean controling the execution of the GATB Minia Pipeline assembler.
Default: true
Param:
--gatbKmerSizeDefinition: K-mer sizes for the GATB Minia Pipeline assembler. It must be a sting with the values separated with a comma.
Default: ‘21,61,101,141,181’
Param:
--gatb_besst_iterDefinition: Number of iteration during BESST scaffolding for the GATB Minia Pipeline assembler.
Default: 10000
Param:
--gatb_error_correctionDefinition: Boolean to control weather to skip error correction for the GATB Minia Pipeline assembler.
Default: false
IDBA-UD
Param:
--idbaDefinition: Boolean controling the execution of the IDBA-UD assembler.
Default: true
MetaHipMer2
Param:
--metahipmer2Definition: Boolean controling the execution of the MetaHipMer2 assembler.
Default: true
Param:
--metahipmer2KmerSizeDefinition: K-mer sizes for the MetaHipMer2 assembler. It must be a sting with the values separated with a comma.
Default: ‘21,33,55,77,99’
Minia
Param:
--miniaDefinition: Boolean controling the execution of the minia assembler.
Default: true
Param:
--miniaKmerSizeDefinition: K-mer size for the minia assembler, as an intiger.
Default: 31
MEGAHIT
Param:
--megahitDefinition: Boolean controling the execution of the MEGAHIT assembler.
Default: true
Param:
--megahitKmerSizeDefinition: K-mer sizes for the MEGAHIT assembler. It must be a sting with the values separated with a comma.
Default: ‘21,29,39,59,79,99,119,141’
metaSPAdes
Param:
--metaspadesDefinition: Boolean controling the execution of the metaSPAdes assembler.
Default: true
Param:
--metaspadesKmerSizeDefinition: K-mer sizes for the metaSPAdes assembler. It must be a sting with ‘auto’ or with the values separated with a space.
Default: ‘auto’
SPAdes
Param:
--spadesDefinition: Boolean controling the execution of the SPAdes assembler.
Default: true
Param:
--spadesKmerSizeDefinition: K-mer sizes for the metaSPAdes assembler. It must be a sting with ‘auto’ or with the values separated with a space.
Default: ‘auto’
SKESA
Param:
--skesaDefinition: Boolean controling the execution of the SKESA assembler.
Default: true
Unicycler
Param:
--unicyclerDefinition: Boolean controling the execution of the Unicycler assembler.
Default: true
VelvetOptimiser
Param:
--velvetoptimiserDefinition: Boolean controling the execution of the VelvetOptimiser assembler.
Default: true
Param:
--velvetoptimiser_hashsDefinition: Starting K-mer size for the VelvetOptimiser assembler, as an intiger.
Default: 19
Param:
--velvetoptimiser_hasheDefinition: End K-mer size for the VelvetOptimiser assembler, as an intiger.
Default: 31
Execution Resources Parameters
CPUs
Number of CPUs for the assembly and mapping processes, as an intiger. This resource is double for each retry until max_cpus is reached.
Param:
--cpusDefault: 8
Memory
Memory for the assembly and mapping processes, in the format of ‘value’.’unit’. This resource is double for each retry until max_memory is reached.
Param:
--memoryDefault: 32.Gb
Time
Time limit for the assembly and mapping processes, in the format of ‘value’.’unit’. This resource is double for each retry until max_time is reached.
Param:
--memoryDefault: 24.h
Max_cpus
Maximum number of CPUs for the assembly and mapping processes, as an intiger.
Param:
--max_cpusDefault: 32
Max_memory
Maximum memory for the assembly and mapping processes, in the format of ‘value’.’unit’.
Param:
--max_memoryDefault: 100.Gb
Max_time
Maximum time for the assembly and mapping processes, in the format of ‘value’.’unit’.
Param:
--max_memoryDefault: 72.h