Nipype gives you many liberties on how to create workflows, but the execution of them uses a lot of default parameters. But you have of course all the freedom to change them as you like.
Nipype looks for the configuration options in the local folder under the name nipype.cfg
and in ~/.nipype/nipype.cfg
(in this order). It can be divided into Logging and Execution options. A few of the possible options are the following:
workflow_level
: How detailed the logs regarding workflow should beINFO
and DEBUG
; default value: INFO
)utils_level
: How detailed the logs regarding nipype utils, like file operations (for example overwriting warning) or the resource profiler, should beINFO
and DEBUG
; default value: INFO
)interface_level
: How detailed the logs regarding interface execution should beINFO
and DEBUG
; default value: INFO
)filemanip_level
(deprecated as of 1.0): How detailed the logs regarding file operations (for example overwriting warning) should beINFO
and DEBUG
)log_to_file
: Indicates whether logging should also send the output to a filetrue
and false
; default value: false
)log_directory
: Where to store logs.log_size
: Size of a single log file.log_rotate
: How many rotations should the log file make.plugin
: This defines which execution plugin to use.Linear
, MultiProc
, SGE
, IPython
; default value: Linear
)stop_on_first_crash
: Should the workflow stop upon the first node crashing or try to execute as many
nodes as possible?true
and false
; default value: false
)stop_on_first_rerun
: Should the workflow stop upon the first node trying to recompute (by that we mean rerunning a node that has been run before - this can happen due changed inputs and/or hash_method since the last run).true
and false
; default value: false
)hash_method
: Should the input files be checked for changes using their content (slow, but 100% accurate) or just their size and modification date (fast, but potentially prone to errors)?content
and timestamp
; default value: timestamp
)keep_inputs
: Ensures that all inputs that are created in the nodes working directory are
kept after node executiontrue
and false
; default value: false
)single_thread_matlab
: Should all of the Matlab interfaces (including SPM) use only one thread? This is useful if you are parallelizing your workflow using MultiProc or IPython on a single multicore machine.true
and false
; default value: true
)display_variable
: Override the $DISPLAY
environment variable for interfaces that require an X server. This option is useful if there is a running X server, but $DISPLAY
was not defined in nipype's environment. For example, if an X server is listening on the default port of 6000, set display_variable = :0
to enable nipype interfaces to use it. It may also point to displays provided by VNC, xnest or Xvfb.display_variable
nor the $DISPLAY
environment variable is set, nipype will try to configure a new virtual server using Xvfb.remove_unnecessary_outputs
: This will remove any interface outputs not needed by the workflow. If the
required outputs from a node changes, rerunning the workflow will rerun the
node. Outputs of leaf nodes (nodes whose outputs are not connected to any
other nodes) will never be deleted independent of this parameter.true
and false
; default value: true
)try_hard_link_datasink
: When the DataSink is used to produce an organized output file outside
of nipypes internal cache structure, a file system hard link will be
attempted first. A hard link allows multiple file paths to point to the
same physical storage location on disk if the conditions allow. By
referring to the same physical file on disk (instead of copying files
byte-by-byte) we can avoid unnecessary data duplication. If hard links
are not supported for the source or destination paths specified, then
a standard byte-by-byte copy is used.true
and false
; default value: true
)use_relative_paths
: Should the paths stored in results (and used to look for inputs)
be relative or absolute. Relative paths allow moving the whole
working directory around but may cause problems with
symlinks.true
and false
; default value: false
)local_hash_check
: Perform the hash check on the job submission machine. This option minimizes
the number of jobs submitted to a cluster engine or a multiprocessing pool
to only those that need to be rerun.true
and false
; default value: true
)job_finished_timeout
: When batch jobs are submitted through, SGE/PBS/Condor they could be killed
externally. Nipype checks to see if a results file exists to determine if
the node has completed. This timeout determines for how long this check is
done after a job finish is detected. (float in seconds; default value: 5)remove_node_directories
(EXPERIMENTAL): Removes directories whose outputs have already been used
up. Doesn't work with IdentiInterface or any node that patches
data through (without copying)true
and false
; default value: false
)stop_on_unknown_version
: If this is set to True, an underlying interface will raise an error, when no
version information is available. Please notify developers or submit a patch.parameterize_dirs
: If this is set to True, the node's output directory will contain full
parameterization of any iterable, otherwise parameterizations over 32
characters will be replaced by their hash.true
and false
; default value: true
)poll_sleep_duration
: This controls how long the job submission loop will sleep between submitting
all pending jobs and checking for job completion. To be nice to cluster
schedulers the default is set to 2 seconds.xvfb_max_wait
: Maximum time (in seconds) to wait for Xvfb to start, if the _redirect_x
parameter of an Interface is True.crashfile_format
: This option controls the file type of any crashfile generated. Pklz
crashfiles allow interactive debugging and rerunning of nodes, while text
crashfiles allow portability across machines and shorter load time.pklz
and txt
; default value: pklz
)enabled
: Enables monitoring the resources occupation (possible values: true
and
false
; default value: false
). All the following options will be
dismissed if the resource monitor is not enabled.sample_frequency
: Sampling period (in seconds) between measurements of resources (memory, cpus)
being used by an interface1
)summary_file
: Indicates where the summary file collecting all profiling information from the
resource monitor should be stored after execution of a workflow.
The summary_file
does not apply to interfaces run independently.
(unset by default, in which case the summary file will be written out to
<base_dir>/resource_monitor.json
of the top-level workflow).summary_append
: Append to an existing summary file (only applies to workflows).true
, possible values: true
or false
).[logging]
workflow_level = DEBUG
[execution]
stop_on_first_crash = true
hash_method = timestamp
display_variable = :1
[monitoring]
enabled = false
Workflow.config
property has a form of a nested dictionary reflecting the structure of the .cfg
file.
from nipype import Workflow
myworkflow = Workflow(name='myworkflow')
myworkflow.config['execution'] = {'stop_on_first_rerun': 'True',
'hash_method': 'timestamp'}
You can also directly set global config options in your workflow script. An example is shown below. This needs to be called before you import the pipeline or the logger. Otherwise, logging level will not be reset.
from nipype import config
cfg = dict(logging=dict(workflow_level = 'DEBUG'),
execution={'stop_on_first_crash': False,
'hash_method': 'content'})
config.update_config(cfg)
By default, logging to file is disabled. One can enable and write the file to a location of choice as in the example below.
import os
from nipype import config, logging
config.update_config({'logging': {'log_directory': os.getcwd(),
'log_to_file': True}})
logging.update_logging(config)
The logging update line is necessary to change the behavior of logging such as output directory, logging level, etc.
To enable debug mode, one can insert the following lines:
from nipype import config
config.enable_debug_mode()
In this mode the following variables are set:
config.set('execution', 'stop_on_first_crash', 'true')
config.set('execution', 'remove_unnecessary_outputs', 'false')
config.set('execution', 'keep_inputs', 'true')
config.set('logging', 'workflow_level', 'DEBUG')
config.set('logging', 'interface_level', 'DEBUG')
config.set('logging', 'utils_level', 'DEBUG')
The primary loggers (workflow
, interface
and utils
) are also reset to level DEBUG
.
You may wish to adjust these manually using:
from nipype import logging
logging.getLogger(<logger>).setLevel(<level>)
The configuration options can be changed globally (i.e. for all workflows), for just a workflow, or for just a node. The implementations look as follows (note that you should first create directories if you want to change crashdump_dir
and log_directory
):
from nipype import config, logging
import os
os.makedirs('/output/log_folder', exist_ok=True)
os.makedirs('/output/crash_folder', exist_ok=True)
config_dict={'execution': {'remove_unnecessary_outputs': 'true',
'keep_inputs': 'false',
'poll_sleep_duration': '60',
'stop_on_first_rerun': 'false',
'hash_method': 'timestamp',
'local_hash_check': 'true',
'create_report': 'true',
'crashdump_dir': '/output/crash_folder',
'use_relative_paths': 'false',
'job_finished_timeout': '5'},
'logging': {'workflow_level': 'INFO',
'filemanip_level': 'INFO',
'interface_level': 'INFO',
'log_directory': '/output/log_folder',
'log_to_file': 'true'}}
config.update_config(config_dict)
logging.update_logging(config)
from nipype import Workflow
wf = Workflow(name="config_test")
# Change execution parameters
wf.config['execution']['stop_on_first_crash'] = 'true'
# Change logging parameters
wf.config['logging'] = {'workflow_level' : 'DEBUG',
'filemanip_level' : 'DEBUG',
'interface_level' : 'DEBUG',
'log_to_file' : 'True',
'log_directory' : '/output/log_folder'}
from nipype import Node
from nipype.interfaces.fsl import BET
bet = Node(BET(), name="config_test")
bet.config = {'execution': {'keep_unnecessary_outputs': 'false'}}