Processing Steps Submodule

This module contains the classes which represent individual processing steps as well as the respective base classes, which can be used to implement custom processing steps (see PipelineStepBase) as well as access modifier wrapper steps (see GroupToApplyToSelectedStepBase).

The individual processing steps are the building blocks of the pipeline, which is defined by a sequence of processing steps (in addition to the input callable/iterable, see the inputs sub-module).

class accvlab.dali_pipeline_framework.processing_steps.PipelineStepBase[source]

Bases: ABC

Base class for pipeline processing steps.

Pipeline processing steps are the building blocks of the pipeline and represent individual operations applied to input data in sequence to produce outputs.

Provides the common interface and common functionality shared by all processing steps:

Checking the input data format for compatibility and setting the output data format (blueprint) (see check_input_data_format_and_set_output_data_format()).

Applying the step via __call__(). This

Invokes _process() to perform the actual processing.

Validates the resulting data format against a reference blueprint from check_input_data_format_and_set_output_data_format() to ensure that the resulting format is “as advertised”, i.e. as obtained by independent calls to check_input_data_format_and_set_output_data_format(). Note that this check is performed at DALI graph construction time and therefore does not affect runtime during training.

Support for operating on sub-trees of input data (through specialized wrapper steps, see GroupToApplyToSelectedStepBase).

Consistent & Independent Data Processing

Many of the included processing steps can be configured to operate on more than one field in the input SampleDataGroup object. For some steps (e.g. those which apply random transformations), the question arises whether these steps should apply consistent processing across all fields they process (e.g. same augmentation transformation for all images), or if the processing should happen independently for different fields (e.g. different transformations for different images). The answer to this question depends on the use-case.

By default, the processing steps are designed to apply consistent processing. For example, AffineTransformer applies the same spatial transform to all processed images, as well as corresponding fields such as point sets defined on the image or projection matrices. This ensures that:

Consistent randomization is possible if needed (e.g., between an image, a corresponding segmentation mask, projection matrix, and points defined on the image).

No correspondences between multiple fields need to be explicitly maintained. For example, if multiple images and projection matrices are present, there is no need to know which projection matrix corresponds to which image, as the same transformation is applied to all of them. This is useful when processing multiple fields which are related to one another.

To ensure that independent processing (e.g. different randomizations) can be applied to different parts of the data (e.g., different randomizations for data from different cameras), sub-classes of GroupToApplyToSelectedStepBase can be used to select one or more parts (sub-trees) of the input data to process independently of each other. The selection of the sub-trees also allows to establish field correspondences (e.g., process the image and projection matrix from one camera consistently) in a natural way, i.e. by grouping all related fields in one sub-tree (e.g. one sub-tree per camera).

The available wrappers include DataGroupInPathAppliedStep, DataGroupsWithNameAppliedStep, DataGroupArrayInPathElementsAppliedStep, and DataGroupArrayWithNameElementsAppliedStep. Please see the documentation of these classes for more details. If necessary, new wrappers can be added by subclassing GroupToApplyToSelectedStepBase.

Having both options (e.g. consistent or different randomizations for different parts of the data) available, as well as the ability to group related data (e.g. all images and projection matrices for one camera) allows for flexible pipeline design which can be tailored to the specific use-case by configuration.

abstract _check_and_adjust_data_format_input_to_output(data_empty)[source]

Check the input data format for compatibility and return the output data format (blueprint).

If the input data format is incompatible, raise an exception describing the problem.

Please see check_input_data_format_and_set_output_data_format() for a description of typical checks and format changes that need to be performed here.

This method may or may not modify data_empty directly, but in any case has to return an object representing the modified format (i.e., either the modified data_empty or a new object).

Note

Override this method in each (non-abstract) derived class to define the actual functionality.
This method is called by check_input_data_format_and_set_output_data_format() and should not be called directly.

Parameters:: data_empty (SampleDataGroup) – Input data format (blueprint)
Returns:: SampleDataGroup – Resulting data format (blueprint)

abstract _process(data)[source]

Apply the processing step to the input, or to a selected sub-tree when wrapped accordingly.

Individual processing steps need to override this method and implement the actual functionality.

The method may mutate the input data; callers must not rely on the input remaining unchanged or corresponding to the output after the call.

Note

Override this method in each (non-abstract) derived class to define the actual functionality.
This method is called by __call__() and should not be called directly.

Parameters:: data (SampleDataGroup) – Data to be processed by the step.
Returns:: SampleDataGroup – Resulting processed data.

__call__(data)[source]

Apply the processing step and validate its output format.

Important

To define the actual functionality of a processing step, override _process(), not this method.

Parameters:: data (SampleDataGroup) – Input data to process.
Returns:: SampleDataGroup – Processed output data.

check_input_data_format_and_set_output_data_format(data_empty)[source]

Check the input data format for compatibility and return the output data format (blueprint).

Compatibility typically means that expected data fields are present and types are compatible, and that the output data fields can be added (are not already present). Typical changes to the data format include additions/removals of fields or changes to data types (e.g., an image may change from types.DALIDataType.UINT8 to types.DALIDataType.FLOAT in a normalization step).

This method does not modify data_empty in place; it returns a new SampleDataGroup describing the modified format.

If the input data format is incompatible, an exception is raised.

Important

To define the actual functionality of the check, override _check_and_adjust_data_format_input_to_output(), not this method.

Parameters:: data_empty (SampleDataGroup) – Input data format (blueprint),
Returns:: SampleDataGroup – Resulting data format (blueprint).

class accvlab.dali_pipeline_framework.processing_steps.GroupToApplyToSelectedStepBase(processing_step_to_apply)[source]