mmselfsup/docs/en/advanced_guides/add_transforms.md

# Add Transforms

In this tutorial, we introduce the basic steps to create your customized transforms. Before learning to create your customized transforms, it is recommended to learn the basic concept of transforms in file [transforms.md](transforms.md).

- [Add Transforms](#add-transforms)
  - [Overview of Pipeline](#overview-of-pipeline)
  - [Creating a new transform in Pipeline](#creating-a-new-transform-in-pipeline)
    - [Step 1: Creating the transform](#step-1-creating-the-transform)
    - [Step 2: Add NewTransform to \_\_init\_\_py](#step-2-add-newtransform-to-__init__py)
    - [Step 3: Modify the config file](#step-3-modify-the-config-file)

## Overview of Pipeline

`Pipeline` is an important component in `Dataset`, which is responsible for applying a series of data augmentations to images, such as `RandomResizedCrop`, `RandomFlip`, etc.

Here is a config example of `Pipeline` for `SimCLR` training:

```python
view_pipeline = [
    dict(type='RandomResizedCrop', size=224, backend='pillow'),
    dict(type='RandomFlip', prob=0.5),
    dict(
        type='RandomApply',
        transforms=[
            dict(
                type='ColorJitter',
                brightness=0.8,
                contrast=0.8,
                saturation=0.8,
                hue=0.2)
        ],
        prob=0.8),
    dict(
        type='RandomGrayscale',
        prob=0.2,
        keep_channels=True,
        channel_weights=(0.114, 0.587, 0.2989)),
    dict(type='RandomGaussianBlur', sigma_min=0.1, sigma_max=2.0, prob=0.5),
]

train_pipeline = [
    dict(type='LoadImageFromFile', file_client_args=file_client_args),
    dict(type='MultiView', num_views=2, transforms=[view_pipeline]),
    dict(type='PackSelfSupInputs', meta_keys=['img_path'])
]
```

Every augmentation in the `Pipeline` receives a `dict` as input and outputs a `dict` containing the augmented image and other related information.

## Creating a new transform in Pipeline

Here are the steps to create a new transform.

### Step 1: Creating the transform

Write a new transform in [processing.py](https://github.com/open-mmlab/mmselfsup/tree/dev-1.x/mmselfsup/datasets/transforms/processing.py) and overwrite the `transform` function, which takes a `dict` as input:

```python
@TRANSFORMS.register_module()
class NewTransform(BaseTransform):
    """Docstring for transform.
    """

    def transform(self, results: dict) -> dict:
        # apply transform
        return results
```

**Note:** For the implementation of transforms, you could apply functions in [mmcv](https://github.com/open-mmlab/mmcv/tree/dev-2.x/mmcv/image).

### Step 2: Add NewTransform to \_\_init\_\_py

Then, add the transform to [\_\_init\_\_.py](https://github.com/open-mmlab/mmselfsup/blob/1.x/mmselfsup/datasets/transforms/__init__.py).

```python
...
from .processing import NewTransform, ...

__all__ = [
    ..., 'NewTransform'
]
```

### Step 3: Modify the config file

To use `NewTransform`, you can modify the config as the following:

```python
view_pipeline = [
    dict(type='RandomResizedCrop', size=224, backend='pillow'),
    dict(type='RandomFlip', prob=0.5),
    # add `NewTransform`
    dict(type='NewTransform'),
    dict(
        type='RandomApply',
        transforms=[
            dict(
                type='ColorJitter',
                brightness=0.8,
                contrast=0.8,
                saturation=0.8,
                hue=0.2)
        ],
        prob=0.8),
    dict(
        type='RandomGrayscale',
        prob=0.2,
        keep_channels=True,
        channel_weights=(0.114, 0.587, 0.2989)),
    dict(type='RandomGaussianBlur', sigma_min=0.1, sigma_max=2.0, prob=0.5),
]

train_pipeline = [
    dict(type='LoadImageFromFile', file_client_args=file_client_args),
    dict(type='MultiView', num_views=2, transforms=[view_pipeline]),
    dict(type='PackSelfSupInputs', meta_keys=['img_path'])
]
```
[Refactor] Refactor docs directory (#419) * refactor directory * modify titles * fix lint * update index.rst * update * fix typo * update * fix typo * update model zoo * update index.rst * fix typo * fix typo 2022-08-17 12:06:41 +08:00			`# Add Transforms`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`In this tutorial, we introduce the basic steps to create your customized transforms. Before learning to create your customized transforms, it is recommended to learn the basic concept of transforms in file [transforms.md](transforms.md).`

[Refactor] Refactor docs directory (#419) * refactor directory * modify titles * fix lint * update index.rst * update * fix typo * update * fix typo * update model zoo * update index.rst * fix typo * fix typo 2022-08-17 12:06:41 +08:00			`- [Add Transforms](#add-transforms)`
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`- [Overview of Pipeline](#overview-of-pipeline)`
			`- [Creating a new transform in Pipeline](#creating-a-new-transform-in-pipeline)`
			`- [Step 1: Creating the transform](#step-1-creating-the-transform)`
			`- [Step 2: Add NewTransform to \_\_init\_\_py](#step-2-add-newtransform-to-__init__py)`
			`- [Step 3: Modify the config file](#step-3-modify-the-config-file)`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`## Overview of Pipeline`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`Pipeline` is an important component in `Dataset`, which is responsible for applying a series of data augmentations to images, such as `RandomResizedCrop`, `RandomFlip`, etc.
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00
			Here is a config example of `Pipeline` for `SimCLR` training:

[Docs] translate 2_data_pipeline.md and 3_new_module.md into Chinese and fix some typos. (#168) * [Docs] translate 2_data_pipeline.md into Chinese * [Docs] translate 3_new_module.md into Chinese * [Docs] Fix typos from py to python 2022-01-10 12:39:14 +08:00			```python
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`view_pipeline = [`
			`dict(type='RandomResizedCrop', size=224, backend='pillow'),`
			`dict(type='RandomFlip', prob=0.5),`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00			`dict(`
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`type='RandomApply',`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00			`transforms=[`
			`dict(`
			`type='ColorJitter',`
			`brightness=0.8,`
			`contrast=0.8,`
			`saturation=0.8,`
			`hue=0.2)`
			`],`
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`prob=0.8),`
			`dict(`
			`type='RandomGrayscale',`
			`prob=0.2,`
			`keep_channels=True,`
			`channel_weights=(0.114, 0.587, 0.2989)),`
			`dict(type='RandomGaussianBlur', sigma_min=0.1, sigma_max=2.0, prob=0.5),`
			`]`

			`train_pipeline = [`
			`dict(type='LoadImageFromFile', file_client_args=file_client_args),`
			`dict(type='MultiView', num_views=2, transforms=[view_pipeline]),`
			`dict(type='PackSelfSupInputs', meta_keys=['img_path'])`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00			`]`
			```

[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			Every augmentation in the `Pipeline` receives a `dict` as input and outputs a `dict` containing the augmented image and other related information.
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`## Creating a new transform in Pipeline`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`Here are the steps to create a new transform.`

			`### Step 1: Creating the transform`

			Write a new transform in [processing.py](https://github.com/open-mmlab/mmselfsup/tree/dev-1.x/mmselfsup/datasets/transforms/processing.py) and overwrite the `transform` function, which takes a `dict` as input:
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00
[Docs] translate 2_data_pipeline.md and 3_new_module.md into Chinese and fix some typos. (#168) * [Docs] translate 2_data_pipeline.md into Chinese * [Docs] translate 3_new_module.md into Chinese * [Docs] Fix typos from py to python 2022-01-10 12:39:14 +08:00			```python
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`@TRANSFORMS.register_module()`
			`class NewTransform(BaseTransform):`
			`"""Docstring for transform.`
			`"""`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`def transform(self, results: dict) -> dict:`
			`# apply transform`
			`return results`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00			```

[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`Note: For the implementation of transforms, you could apply functions in [mmcv](https://github.com/open-mmlab/mmcv/tree/dev-2.x/mmcv/image).`

			`### Step 2: Add NewTransform to \_\_init\_\_py`

			`Then, add the transform to [\_\_init\_\_.py](https://github.com/open-mmlab/mmselfsup/blob/1.x/mmselfsup/datasets/transforms/__init__.py).`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00
[Docs] translate 2_data_pipeline.md and 3_new_module.md into Chinese and fix some typos. (#168) * [Docs] translate 2_data_pipeline.md into Chinese * [Docs] translate 3_new_module.md into Chinese * [Docs] Fix typos from py to python 2022-01-10 12:39:14 +08:00			```python
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`...`
			`from .processing import NewTransform, ...`

			`__all__ = [`
			`..., 'NewTransform'`
			`]`
			```

			`### Step 3: Modify the config file`

			To use `NewTransform`, you can modify the config as the following:

			```python
			`view_pipeline = [`
			`dict(type='RandomResizedCrop', size=224, backend='pillow'),`
			`dict(type='RandomFlip', prob=0.5),`
			# add `NewTransform`
			`dict(type='NewTransform'),`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00			`dict(`
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`type='RandomApply',`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00			`transforms=[`
			`dict(`
			`type='ColorJitter',`
			`brightness=0.8,`
			`contrast=0.8,`
			`saturation=0.8,`
			`hue=0.2)`
			`],`
[Docs] Refine add-datasets and add-transform docs (#436) * update add datasets doc * refactor add transform * refine * add `step` to subtitle * fix typo * update link 2022-08-31 19:20:49 +08:00			`prob=0.8),`
			`dict(`
			`type='RandomGrayscale',`
			`prob=0.2,`
			`keep_channels=True,`
			`channel_weights=(0.114, 0.587, 0.2989)),`
			`dict(type='RandomGaussianBlur', sigma_min=0.1, sigma_max=2.0, prob=0.5),`
			`]`

			`train_pipeline = [`
			`dict(type='LoadImageFromFile', file_client_args=file_client_args),`
			`dict(type='MultiView', num_views=2, transforms=[view_pipeline]),`
			`dict(type='PackSelfSupInputs', meta_keys=['img_path'])`
[Feature]: Add docs and docker 2021-12-15 19:06:36 +08:00			`]`
			```