mmpretrain/docs/en/advanced_guides/evaluation.md

# Customize Evaluation Metrics

## Use metrics in MMPretrain

In MMPretrain, we have provided multiple metrics for both single-label classification and multi-label
classification:

**Single-label Classification**:

- [`Accuracy`](mmpretrain.evaluation.Accuracy)
- [`SingleLabelMetric`](mmpretrain.evaluation.SingleLabelMetric), including precision, recall, f1-score and
  support.

**Multi-label Classification**:

- [`AveragePrecision`](mmpretrain.evaluation.AveragePrecision), or AP (mAP).
- [`MultiLabelMetric`](mmpretrain.evaluation.MultiLabelMetric), including precision, recall, f1-score and
  support.

To use these metrics during validation and testing, we need to modify the `val_evaluator` and `test_evaluator`
fields in the config file.

Here is several examples:

1. Calculate top-1 and top-5 accuracy during both validation and test.

   ```python
   val_evaluator = dict(type='Accuracy', topk=(1, 5))
   test_evaluator = val_evaluator
   ```

2. Calculate top-1 accuracy, top-5 accuracy, precision and recall during both validation and test.

   ```python
   val_evaluator = [
     dict(type='Accuracy', topk=(1, 5)),
     dict(type='SingleLabelMetric', items=['precision', 'recall']),
   ]
   test_evaluator = val_evaluator
   ```

3. Calculate mAP (mean AveragePrecision), CP (Class-wise mean Precision), CR (Class-wise mean Recall), CF
   (Class-wise mean F1-score), OP (Overall mean Precision), OR (Overall mean Recall) and OF1 (Overall mean
   F1-score).

   ```python
   val_evaluator = [
     dict(type='AveragePrecision'),
     dict(type='MultiLabelMetric', average='macro'),  # class-wise mean
     dict(type='MultiLabelMetric', average='micro'),  # overall mean
   ]
   test_evaluator = val_evaluator
   ```

## Add new metrics

MMPretrain supports the implementation of customized evaluation metrics for users who pursue higher customization.

You need to create a new file under `mmpretrain/evaluation/metrics`, and implement the new metric in the file, for example, in `mmpretrain/evaluation/metrics/my_metric.py`. And create a customized evaluation metric class `MyMetric` which inherits [`BaseMetric in MMEngine`](mmengine.evaluator.BaseMetric).

The data format processing method `process` and the metric calculation method `compute_metrics` need to be overwritten respectively. Add it to the `METRICS` registry to implement any customized evaluation metric.

```python
from mmengine.evaluator import BaseMetric
from mmpretrain.registry import METRICS

@METRICS.register_module()
class MyMetric(BaseMetric):

    def process(self, data_batch: Sequence[Dict], data_samples: Sequence[Dict]):
    """ The processed results should be stored in ``self.results``, which will
        be used to computed the metrics when all batches have been processed.
        `data_batch` stores the batch data from dataloader,
        and `data_samples` stores the batch outputs from model.
    """
        ...

    def compute_metrics(self, results: List):
    """ Compute the metrics from processed results and returns the evaluation results.
    """
        ...
```

Then, import it in the `mmpretrain/evaluation/metrics/__init__.py` to add it into the `mmpretrain.evaluation` package.

```python
# In mmpretrain/evaluation/metrics/__init__.py
...
from .my_metric import MyMetric

__all__ = [..., 'MyMetric']
```

Finally, use `MyMetric` in the `val_evaluator` and `test_evaluator` field of config files.

```python
val_evaluator = dict(type='MyMetric', ...)
test_evaluator = val_evaluator
```

```{note}
More details can be found in {external+mmengine:doc}`MMEngine Documentation: Evaluation <design/evaluation>`.
```
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00			`# Customize Evaluation Metrics`

[Docs] Refine advanced guides (#1428) * refine * update description * update links * update links * update installation * refine 2023-03-29 16:23:57 +08:00			`## Use metrics in MMPretrain`
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00
[Docs] Refine advanced guides (#1428) * refine * update description * update links * update links * update installation * refine 2023-03-29 16:23:57 +08:00			`In MMPretrain, we have provided multiple metrics for both single-label classification and multi-label`
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00			`classification:`

			`Single-label Classification:`

[Docs] Update generate_readme.py and readme files. (#1388) * Update generate_readme.py and readme files. * Update reamde * Update docs * update metafile * update simmim readme * update * update mae * fix lint * update mocov2 * update readme pic * fix lint * Fix mmcls download links. * Fix Chinese docs. * Decrease readthedocs requirements. --------- Co-authored-by: fangyixiao18 <fangyx18@hotmail.com> 2023-03-02 13:29:07 +08:00			- [`Accuracy`](mmpretrain.evaluation.Accuracy)
			- [`SingleLabelMetric`](mmpretrain.evaluation.SingleLabelMetric), including precision, recall, f1-score and
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00			`support.`

			`Multi-label Classification:`

[Docs] Update generate_readme.py and readme files. (#1388) * Update generate_readme.py and readme files. * Update reamde * Update docs * update metafile * update simmim readme * update * update mae * fix lint * update mocov2 * update readme pic * fix lint * Fix mmcls download links. * Fix Chinese docs. * Decrease readthedocs requirements. --------- Co-authored-by: fangyixiao18 <fangyx18@hotmail.com> 2023-03-02 13:29:07 +08:00			- [`AveragePrecision`](mmpretrain.evaluation.AveragePrecision), or AP (mAP).
			- [`MultiLabelMetric`](mmpretrain.evaluation.MultiLabelMetric), including precision, recall, f1-score and
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00			`support.`

			To use these metrics during validation and testing, we need to modify the `val_evaluator` and `test_evaluator`
			`fields in the config file.`

			`Here is several examples:`

			`1. Calculate top-1 and top-5 accuracy during both validation and test.`

			```python
			`val_evaluator = dict(type='Accuracy', topk=(1, 5))`
			`test_evaluator = val_evaluator`
			```

			`2. Calculate top-1 accuracy, top-5 accuracy, precision and recall during both validation and test.`

			```python
			`val_evaluator = [`
			`dict(type='Accuracy', topk=(1, 5)),`
			`dict(type='SingleLabelMetric', items=['precision', 'recall']),`
			`]`
			`test_evaluator = val_evaluator`
			```

			`3. Calculate mAP (mean AveragePrecision), CP (Class-wise mean Precision), CR (Class-wise mean Recall), CF`
			`(Class-wise mean F1-score), OP (Overall mean Precision), OR (Overall mean Recall) and OF1 (Overall mean`
			`F1-score).`

			```python
			`val_evaluator = [`
			`dict(type='AveragePrecision'),`
			`dict(type='MultiLabelMetric', average='macro'), # class-wise mean`
			`dict(type='MultiLabelMetric', average='micro'), # overall mean`
			`]`
			`test_evaluator = val_evaluator`
			```

			`## Add new metrics`

[Docs] Refine advanced guides (#1428) * refine * update description * update links * update links * update installation * refine 2023-03-29 16:23:57 +08:00			`MMPretrain supports the implementation of customized evaluation metrics for users who pursue higher customization.`
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00
[Docs] Update generate_readme.py and readme files. (#1388) * Update generate_readme.py and readme files. * Update reamde * Update docs * update metafile * update simmim readme * update * update mae * fix lint * update mocov2 * update readme pic * fix lint * Fix mmcls download links. * Fix Chinese docs. * Decrease readthedocs requirements. --------- Co-authored-by: fangyixiao18 <fangyx18@hotmail.com> 2023-03-02 13:29:07 +08:00			You need to create a new file under `mmpretrain/evaluation/metrics`, and implement the new metric in the file, for example, in `mmpretrain/evaluation/metrics/my_metric.py`. And create a customized evaluation metric class `MyMetric` which inherits [`BaseMetric in MMEngine`](mmengine.evaluator.BaseMetric).
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00
			The data format processing method `process` and the metric calculation method `compute_metrics` need to be overwritten respectively. Add it to the `METRICS` registry to implement any customized evaluation metric.

			```python
			`from mmengine.evaluator import BaseMetric`
[Docs] Update generate_readme.py and readme files. (#1388) * Update generate_readme.py and readme files. * Update reamde * Update docs * update metafile * update simmim readme * update * update mae * fix lint * update mocov2 * update readme pic * fix lint * Fix mmcls download links. * Fix Chinese docs. * Decrease readthedocs requirements. --------- Co-authored-by: fangyixiao18 <fangyx18@hotmail.com> 2023-03-02 13:29:07 +08:00			`from mmpretrain.registry import METRICS`
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00
			`@METRICS.register_module()`
			`class MyMetric(BaseMetric):`

			`def process(self, data_batch: Sequence[Dict], data_samples: Sequence[Dict]):`
			""" The processed results should be stored in ``self.results``, which will
			`be used to computed the metrics when all batches have been processed.`
			`data_batch` stores the batch data from dataloader,
			and `data_samples` stores the batch outputs from model.
			`"""`
			`...`

			`def compute_metrics(self, results: List):`
			`""" Compute the metrics from processed results and returns the evaluation results.`
			`"""`
			`...`
			```

[Docs] Update generate_readme.py and readme files. (#1388) * Update generate_readme.py and readme files. * Update reamde * Update docs * update metafile * update simmim readme * update * update mae * fix lint * update mocov2 * update readme pic * fix lint * Fix mmcls download links. * Fix Chinese docs. * Decrease readthedocs requirements. --------- Co-authored-by: fangyixiao18 <fangyx18@hotmail.com> 2023-03-02 13:29:07 +08:00			Then, import it in the `mmpretrain/evaluation/metrics/__init__.py` to add it into the `mmpretrain.evaluation` package.
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00
			```python
[Docs] Update generate_readme.py and readme files. (#1388) * Update generate_readme.py and readme files. * Update reamde * Update docs * update metafile * update simmim readme * update * update mae * fix lint * update mocov2 * update readme pic * fix lint * Fix mmcls download links. * Fix Chinese docs. * Decrease readthedocs requirements. --------- Co-authored-by: fangyixiao18 <fangyx18@hotmail.com> 2023-03-02 13:29:07 +08:00			`# In mmpretrain/evaluation/metrics/__init__.py`
[Docs] Add custom evaluation docs (#1130) * [Docs] Add evaluation docs * minor fix * Fix docs. Co-authored-by: mzr1996 <mzr1996@163.com> 2022-11-01 18:54:06 +08:00			`...`
			`from .my_metric import MyMetric`

			`__all__ = [..., 'MyMetric']`
			```

			Finally, use `MyMetric` in the `val_evaluator` and `test_evaluator` field of config files.

			```python
			`val_evaluator = dict(type='MyMetric', ...)`
			`test_evaluator = val_evaluator`
			```

			```{note}
			More details can be found in {external+mmengine:doc}`MMEngine Documentation: Evaluation <design/evaluation>`.
			```