Part 1: Run Nextflow¶

Translation in Progress

This page has not yet been translated into your language. You are viewing the original English content.

Want to help? See our translation guide.

In this first part of the Nextflow Triathlon, we introduce the core concepts of running Nextflow pipelines. We start with a simple Hello World workflow, then progress to a complete multi-step pipeline that processes multiple inputs in parallel using containers.

Tip

Make sure your working directory is set to triathlon/basics/ before running the commands in this part.

cd basics

1. Hello World¶

The workflow 1-hello.nf takes a greeting via a command-line argument and writes it to a file.

1.1. Launch the workflow¶

Run the following command in your terminal.

nextflow run 1-hello.nf --input 'Hello World!'

Command output

N E X T F L O W   ~  version 25.10.4

Launching `1-hello.nf` [infallible_volhard] DSL2 - revision: 82d40dbf94

executor >  local (1)
[6d/740edd] sayHello | 1 of 1 ✔

The most important line in the output is the last one:

[6d/740edd] sayHello | 1 of 1 ✔

This tells us that the sayHello process ran successfully once. The [6d/740edd] prefix is a truncated path to the task's working directory — more on that below.

1.2. Find the output¶

This workflow is configured to publish its output to a results directory. After running, you should find the output there:

results/

results
└── 1-hello
    └── Hello World!-output.txt

Open the file to confirm it contains Hello World!.

1.3. Explore the `work/` directory¶

Behind the scenes, Nextflow creates a unique task directory for every process call inside a directory named work/. The hash shown in the console output ([6d/740edd]) is the path to that directory.

ls work/6d/740edd*

Inside you will find the output file along with several hidden log files:

.command.sh: the exact command Nextflow ran
.command.out / .command.err: stdout and stderr from the process
.command.log: combined log output
.exitcode: the process exit code

The .command.sh file is especially useful when debugging — it shows precisely what was executed.

1.4. Understand the workflow code¶

Let's open 1-hello.nf and look at its main components.

Full code file

1-hello.nf
#!/usr/bin/env nextflow

include { sayHello } from './modules/sayHello.nf'

/*
 * Pipeline parameters
 */
params {
    input: String
}

workflow {

    main:
    // emit a greeting
    sayHello(params.input)

    publish:
    first_output = sayHello.out
}

output {
    first_output {
        path '1-hello'
        mode 'copy'
    }
}

We see the following:

an include statement pointing to a process module
a params block defining pipeline parameters
a workflow block describing the work to be done
an output block describing what to do with the outputs

Let's take a look at each in turn.

1.4.1. The `process` module¶

The include statement tells Nextflow to load something called sayHello from a separate code file.

1-hello.nf
include { sayHello } from './modules/sayHello.nf'

In that file, we find the definition for a process called sayHello:

modules/sayHello.nf
process sayHello {

    input:
    val greeting

    output:
    path "${greeting}-output.txt"

    script:
    """
    echo '${greeting}' > '${greeting}-output.txt'
    """
}

A process defines a single step in the pipeline. It declares its inputs, outputs, and the script to execute. The val qualifier means the input is a plain value (string, number, etc.). The path qualifier means the output is a file path.

It is possible to write the process definition in the main workflow file, but keeping them in separate module files makes them reusable:the same module can be imported by multiple workflow scripts.

1.4.2. The `params` block¶

The params block declares the command-line parameters the workflow accepts:

1-hello.nf
params {
    input: String
}

Any parameter declared here becomes available on the command line with a double-dash (--input). Supported types include String, Integer, Float, Boolean, and Path.

Tip

Workflow parameters always use two dashes (--input) to distinguish them from Nextflow's own CLI flags, which use one dash (e.g. -resume).

1.4.3. The `workflow` block¶

The workflow block defines the dataflow logic: which processes to run and in what order.

1-hello.nf
workflow {

    main:
    // emit a greeting
    sayHello(params.input)

    publish:
    first_output = sayHello.out
}

Here there is only one process being called so it's very simple; we will cover more realistic examples later.

The main: section calls the sayHello process with the --input value. The publish: section lists which outputs should be copied to the results directory.

1.4.4. The `output` block¶

The output block at the bottom of the file specifies the destination path and copy mode.

1-hello.nf
output {
    first_output {
        path '1-hello'
        mode 'copy'
    }
}

Each named entry corresponds to a publish: label in the workflow and maps it to a subdirectory under results/.

Takeaway¶

You know how to run a Nextflow pipeline, find its outputs, and understand its main components: the process modules imported using include statements, and the params, workflow and output blocks.

What's next?¶

Find out how Nextflow handles multiple inputs efficiently.

2. Process multiple inputs¶

Real-world pipelines typically process many pieces of data, not just one. The workflow 2-inputs.nf reads from a CSV file and runs sayHello once per row, in parallel.

Let's run the workflow first, then we'll take a look at what mechanism Nextflow uses to handle these multiple inputs.

2.1. Run the workflow¶

Run the following command in your terminal.

nextflow run 2-inputs.nf --input data/greetings.csv

Command output

N E X T F L O W   ~  version 25.10.4

Launching `2-inputs.nf` [nauseous_babbage] DSL2 - revision: b90778224d

executor >  local (3)
[66/de7844] sayHello (3) | 3 of 3 ✔

The 3 of 3 tells us the sayHello process was called three times, once per row in the CSV.

In the results directory, you should now see three output files, one per greeting:

results/2-inputs/

2-inputs
├── Bonjour-output.txt
├── Hello-output.txt
└── Hola-output.txt

Open any of the output files to confirm each one contains a greeting.

2.2. Run the workflow again with `-ansi-log false`¶

By default, Nextflow condenses the output to a single summary line per process. To see each process call listed individually, add -ansi-log false:

nextflow run 2-inputs.nf --input data/greetings.csv -ansi-log false

Command output

N E X T F L O W   ~  version 25.10.4

Launching `2-inputs.nf` [extravagant_bardeen] DSL2 - revision: b90778224d
[43/0bac1c] Submitted process > sayHello (1)
[2d/99f604] Submitted process > sayHello (2)
[6d/7578d7] Submitted process > sayHello (3)

This shows all three process calls and the unique work subdirectory created for each one.

2.3. How the multiple inputs are handled¶

The key change in 2-inputs.nf is in the main: section of the workflow:

2-inputs.nf
    main:
    // create a channel for inputs from a CSV file
    greeting_ch = channel.fromPath(params.input)
                        .splitCsv()
                        .map { line -> line[0] }
    // emit a greeting
    sayHello(greeting_ch)

What you see here is called a channel: a queue construct that handles input data in a way that makes it easy to parallelize operations.

channel.fromPath(params.input) creates a channel from the file path given with --input
.splitCsv() parses the CSV into rows
.map { line -> line[0] } extracts the first column from each row

The result is a channel containing Hello, Bonjour, and Hola. When passed to sayHello(greeting_ch), Nextflow automatically calls the process once per item, running them in parallel when resources allow.

2.4. Use `-resume` to skip completed work¶

Now switch to the extended input file, which adds two more greetings, and add -resume to the command line:

nextflow run 2-inputs.nf --input data/greetings-extended.csv -resume

Command output

N E X T F L O W   ~  version 25.10.4

Launching `2-inputs.nf` [adoring_mayer] DSL2 - revision: b90778224d

executor >  local (2)
[84/2f3067] sayHello (5) | 5 of 5, cached: 3 ✔

Nextflow ran only the two new inputs. The three greetings processed in the previous run were cached and reused automatically.

This also works to skip executing processes for steps that have already been run successfully in a multi-step pipeline. For example, if a pipeline run was interrupted by a system error, or if you added new steps to a pipeline in development.

The -resume capability is especially valuable in long pipelines where recovering from failure can save critical time and resources.

Takeaway¶

You understand how channels load data from files and enable automatic parallelism, and how to use -resume to avoids repeating completed work.

What's next?¶

Learn how a complete multi-step pipeline chains processes together using channels, and how to use containers to manage analysis tools and their dependencies.

3. Run a multi-step pipeline¶

The workflow main.nf chains four processes into a complete pipeline.

Each input greeting flows through all four steps: sayHello writes it to a file, convertToUpper converts the text to uppercase, collectGreetings merges all results into one file, and cowpy generates ASCII art from the merged output using a containerized tool.

3.1. Run the workflow¶

Run the following command in your terminal.

nextflow run main.nf --input data/greetings.csv --character turkey

Command output

N E X T F L O W   ~  version 25.10.4

Launching `main.nf` [reverent_jepsen] DSL2 - revision: c3c85dec78

executor >  local (8)
[87/35824b] sayHello (3)       | 3 of 3 ✔
[0f/31c37e] convertToUpper (1) | 3 of 3 ✔
[49/c25ca1] collectGreetings   | 1 of 1 ✔
[a5/29e13f] cowpy              | 1 of 1 ✔

Four processes ran: sayHello and convertToUpper each ran once per input (3 of 3), collectGreetings gathered all outputs into one file (1 of 1), and cowpy generated the ASCII art (1 of 1).

Check results/full_pipeline/ for the ASCII art file.

3.2. How the processes connect¶

Each process passes its output channel to the next:

main.nf
    main:
    // create a channel for inputs from a CSV file
    greeting_ch = channel.fromPath(params.input)
                        .splitCsv()
                        .map { line -> line[0] }
    sayHello(greeting_ch)
    convertToUpper(sayHello.out)
    collectGreetings(convertToUpper.out.collect(), params.batch)
    cowpy(collectGreetings.out.outfile, params.character)

The pattern processName.out refers to a process's output channel.

The .collect() operator gathers all individual outputs from convertToUpper into a single channel item before passing them to collectGreetings.

3.3. The container directive¶

The cowpy process runs inside a Docker container specified in its module file:

modules/cowpy.nf
process cowpy {

    container 'community.wave.seqera.io/library/cowpy:1.1.5--3db457ae1977a273'

    input:
    path input_file
    val character

    output:
    path "cowpy-${input_file}"

    script:
    """
    cat ${input_file} | cowpy -c "${character}" > cowpy-${input_file}
    """
}

Nextflow automatically pulls the image, runs the script inside the container, and cleans up afterward. Docker is enabled for this project in nextflow.config:

nextflow.config

docker.enabled = true

This single line enables Docker for any process in the pipeline that has a container specified.

Takeaway¶

You understand how multi-step pipelines chain processes together using output channels, and how the container directive lets Nextflow manage software dependencies automatically.

What's next?¶

Learn how to configure pipeline behavior using nextflow.config.

4. Configure the pipeline¶

Nextflow automatically picks up nextflow.config from the working directory and applies its settings to every run.

We provide you with a configuration file that covers four areas: software packaging, process settings, pipeline parameters and execution profiles. We'll review each briefly then zoom in on two points of special interest: how to generate an execution report, and how to use a test profile.

4.1. The configuration file¶

The full file contains four configuration sections.

nextflow.config

nextflow.config
/*
 * Software packaging
 */
docker.enabled = true

/*
 * Process settings
 */
process {
    memory = 1.GB
    // withName: 'cowpy' {
    //     memory = 2.GB
    //     cpus = 2
    // }
}

/*
 * Pipeline parameters
 */
params {
    input = 'data/greetings.csv'
    batch = 'batch'
    character = 'turkey'
}

/*
 * Profiles
 */
profiles {
    test {
        params.input = 'data/greetings.csv'
        params.batch = 'test'
        params.character = 'tux'
    }
    conda {
        docker.enabled = false
        conda.enabled = true
    }
}

Software packaging (docker.enabled = true): enables Docker for all processes. Any process that declares a container directive runs inside the specified image. To use Conda instead, activate the conda profile.

Process settings: sets a 1 GB memory limit for all processes. The commented withName: 'cowpy' block is included as a syntax example showing how to apply separate limits to a specific process. It is commented out because cowpy only appears in main.nf, and an unmatched selector produces a warning when running the other scripts. Feel free to uncomment it, run main.nf again, and experiment with different resource values.

Parameter defaults: provide fallback values for parameters not supplied on the command line. Running nextflow run main.nf with no flags uses these values.

Profiles: group settings that activate together when you pass -profile <name>. The test profile overrides three parameters to run the pipeline with a small, well-defined input set. The conda profile switches software packaging from Docker to Conda.

Note

This config covers local execution on a single machine. Nextflow also supports HPC schedulers (SLURM, PBS, LSF) and cloud executors (AWS Batch, Google Cloud Batch, Azure Batch), all configured through the same nextflow.config mechanism. See the Nextflow Run: Configuration tutorial for a full walkthrough.

4.2. Generate an execution report¶

Add -with-report to any nextflow run command to generate an HTML report after the pipeline completes:

nextflow run main.nf --input data/greetings.csv --character turkey -with-report

Command output

N E X T F L O W   ~  version 25.10.4

Launching `main.nf` [lonely_aryabhata] DSL2 - revision: c3c85dec78

executor >  local (8)
[9e/f3fa5e] sayHello (3)       | 3 of 3 ✔
[08/4a93d7] convertToUpper (2) | 3 of 3 ✔
[c5/c8595a] collectGreetings   | 1 of 1 ✔
[2a/1e5abe] cowpy              | 1 of 1 ✔

Nextflow writes the report to a file named report-<timestamp>.html in the working directory. Open it in a browser to see an execution summary, a table of every task with its status and runtime, and resource usage charts broken down by process.

The report is especially useful when a pipeline takes longer than expected or a task fails — the task table shows exactly where time was spent and which tasks succeeded or failed.

4.3. Run with a profile¶

The test profile is a standard nf-core convention. Every nf-core pipeline ships with one for quick validation.

Run the pipeline using the test profile:

nextflow run main.nf -profile test

Command output

N E X T F L O W   ~  version 25.10.4

Launching `main.nf` [silly_goodall] DSL2 - revision: c3c85dec78

executor >  local (8)
[06/9614da] sayHello (2)       | 3 of 3 ✔
[8d/5f1b8a] convertToUpper (3) | 3 of 3 ✔
[1e/cd52db] collectGreetings   | 1 of 1 ✔
[2c/5f41d8] cowpy              | 1 of 1 ✔

The pipeline runs with batch = 'test' and character = 'tux'. Check results/full_pipeline/. The batch name appears in the output file names, and the ASCII art features the tux penguin instead of a turkey.

To inspect the fully resolved settings for any profile combination, run nextflow config with the relevant profiles:

nextflow config -profile test

Command output

params {
   input = 'data/greetings.csv'
   batch = 'test'
   character = 'tux'
}

docker {
   enabled = true
}

process {
   memory = '1 GB'
}

This shows the merged result: the test profile's overrides applied on top of the base config. Use it to confirm what settings will be active before running a pipeline.

Takeaway¶

You know how to configure pipeline behavior using nextflow.config, how to generate an HTML execution report with -with-report, and how profiles bundle related settings that activate together with a single flag.

What's next?¶

You've covered the fundamentals of Nextflow. Move on to Part 2 to discover the nf-core community pipeline ecosystem.

Summary¶

In this part you learned to:

Run a Nextflow workflow and find its outputs
Understand the process module, params, and workflow blocks
Process multiple inputs from a CSV file using channels
Use -resume to skip completed work when adding new inputs
Connect processes in a multi-step pipeline with containerized software
Generate an HTML execution report with -with-report
Configure pipeline behavior using nextflow.config and profiles

Part 1: Run Nextflow¶

1. Hello World¶

1.1. Launch the workflow¶

1.2. Find the output¶

1.3. Explore the work/ directory¶

1.4. Understand the workflow code¶

1.4.1. The process module¶

1.4.2. The params block¶

1.4.3. The workflow block¶

1.4.4. The output block¶

Takeaway¶

What's next?¶

2. Process multiple inputs¶

2.1. Run the workflow¶

2.2. Run the workflow again with -ansi-log false¶

2.3. How the multiple inputs are handled¶

2.4. Use -resume to skip completed work¶

Takeaway¶

What's next?¶

3. Run a multi-step pipeline¶

3.1. Run the workflow¶

3.2. How the processes connect¶

3.3. The container directive¶

Takeaway¶

What's next?¶

4. Configure the pipeline¶

4.1. The configuration file¶

4.2. Generate an execution report¶

4.3. Run with a profile¶

Takeaway¶

What's next?¶

Summary¶

1.3. Explore the `work/` directory¶

1.4.1. The `process` module¶

1.4.2. The `params` block¶

1.4.3. The `workflow` block¶

1.4.4. The `output` block¶

2.2. Run the workflow again with `-ansi-log false`¶

2.4. Use `-resume` to skip completed work¶