Chapter 5 Processes
This section includes some extra features related to processes.
A full link of process info can be seen in the nextflow docs
5.1 Directives
You can include directives in each process. These can be used to specify the execution requirements of a process.
Link to full list of process directives
5.1.1 cpus
One example is that you can use cpus
to specify the number of cpus to be used by a process.
5.1.2 publishDir
The basics of pblishDir
My preferred method is to assign an overall output/results directory as a params.
You can also set a subdiretcory of the output directory in the process.
You can even do this with input variables.
5.1.3 conda
A conda environment can be specified.
Further info on using conda environments
5.1.3.1 Local environment
You can specify a conda environment you have locally created.
If using a locally installed env it is best to specify it as a params to make it quicker to add/edit for multiple processes.
5.1.3.2 URI based environment
You can have nextflow install a conda packages for specific process.
You can find what packages can be downloaded this way through sequera containers.
For an easy example search for bioconda::samtools
on the above link.
5.2 Script
5.3 Modules
The primary main.nf
can become quite large by having a lot of processes.
To counteract this each process can be stored in a separate main.nf
file.
The recommendation is to store them in a directory called modules/local
within the main workflow directory.
Then each process would be within a main.nf
file within various directories.
A module main.nf
would be as so:
Modules are imported as a process as so:
It is common to have subdirectories within modules/local
grouped by tools. For example if you were performing 16S analysis with qiime2 you may have some of the following modules: