Setup

We first recommend to clone the repository.

gitGitHub CLI

git clone https://github.com/dienerlab/pipelines

gh repo clone dienerlab/pipelines

It's also a good idea to install nextflow into your conda base environment.

conda install -c conda-forge -c bioconda nextflow

Install environments

Most of the pipelines contain a conda environment file that can be used. For instance, to set up the metagenomics environment:

cd /path/to/cloned/repository
conda env create -f metagenomics/conda.yml

No additional setup is required for pipelines using docker or apptainer/singularity.

Configure Nextflow

If you are part of our lab pleae use the configuration from our wiki.

Otherwise you can use the following basic configuration which sets up profiles for local execution or SLURM:

nextflow.config

profiles {
    standard {
        executor {
            name = "local"
            cpus = 12
            memory = "128 GB"
        }
    }

    slurm {
        executor {
            name = "slurm"
            queueSize = 40
            pollInterval = '2 min'
            dumpInterval = '3 min'
            queueStatInterval = '5 min'
            exitReadTimeout = '10 min'
            killBatchSize = 50
            submitRateLimit = '40 min'
        }
        process.queue = "cpu"

    }

    tower {
        workspaceId = "<my-id>"
        accessToken = "<my-token>"
        enabled = false
    }
}

singularity.runOptions = "-B $TMPDIR"

Make sure to make any adjustments beforehand. For instance, for SLURM you probably want to use the correct name for the queue in your HPC system.If you use Seqera Cloud (formerly Tower) also add in your token and woskspace ID and set it to enabled.

Either copy this file into your project directory or create the file ~/.nextflow/config in your home directory and copy the contents there.

mkdir ~/.nextflow

After that edit and copy the config:

cp /path/to/pipelines/nextlow.config ~/.nextflow/config

You can switch between the profiles by using the Nextflow -profile flag.

nextflow run [PIPELINE] -profile slurm ...

Note

For the lab config the default is already using SLURM. You only need to specify the profile if you want to force local execution:

nextflow run [PIPELINE] -profile local ...

Run the pipelines

There are generally two options to run the pipelines:

Directly call the pipeline from the cloned directory
Create a symlink to the specific [pipeline].nf file

directlink

export DLP=/path/to/cloned/repository
nextflow run $DLP/metagenomics ...

ln -s /path/to/cloned/repository/metagenomics/main.nf
nextflow run main.nf ...

If the pipeline is using a conda environment it should be run using the -with-conda flag indicating the directory where the environment was created, usually one in ~/miniforge3/envs. For the docker based ones usually it is run using apptainer/singularity.

Please see the individual pipelines for instructions.

For the metagenomics pipeline this looks like:

nexflow run $DLP/metagenomics/main.nf -resume -with-conda=$HOME/miniforge3/envs/metagenomics ...