mmsegmentation/configs/ddrnet/ddrnet_23_in1k-pre_2xb6-120k_cityscapes-1024x1024.py

_base_ = [
    '../_base_/datasets/cityscapes_1024x1024.py',
    '../_base_/default_runtime.py',
]

# The class_weight is borrowed from https://github.com/openseg-group/OCNet.pytorch/issues/14 # noqa
# Licensed under the MIT License
class_weight = [
    0.8373, 0.918, 0.866, 1.0345, 1.0166, 0.9969, 0.9754, 1.0489, 0.8786,
    1.0023, 0.9539, 0.9843, 1.1116, 0.9037, 1.0865, 1.0955, 1.0865, 1.1529,
    1.0507
]
checkpoint = 'https://download.openmmlab.com/mmsegmentation/v0.5/ddrnet/pretrain/ddrnet23-in1kpre_3rdparty-9ca29f62.pth'  # noqa
crop_size = (1024, 1024)
data_preprocessor = dict(
    type='SegDataPreProcessor',
    size=crop_size,
    mean=[123.675, 116.28, 103.53],
    std=[58.395, 57.12, 57.375],
    bgr_to_rgb=True,
    pad_val=0,
    seg_pad_val=255)
norm_cfg = dict(type='SyncBN', requires_grad=True)
model = dict(
    type='EncoderDecoder',
    data_preprocessor=data_preprocessor,
    backbone=dict(
        type='DDRNet',
        in_channels=3,
        channels=64,
        ppm_channels=128,
        norm_cfg=norm_cfg,
        align_corners=False,
        init_cfg=dict(type='Pretrained', checkpoint=checkpoint)),
    decode_head=dict(
        type='DDRHead',
        in_channels=64 * 4,
        channels=128,
        dropout_ratio=0.,
        num_classes=19,
        align_corners=False,
        norm_cfg=norm_cfg,
        loss_decode=[
            dict(
                type='OhemCrossEntropy',
                thres=0.9,
                min_kept=131072,
                class_weight=class_weight,
                loss_weight=1.0),
            dict(
                type='OhemCrossEntropy',
                thres=0.9,
                min_kept=131072,
                class_weight=class_weight,
                loss_weight=0.4),
        ]),

    # model training and testing settings
    train_cfg=dict(),
    test_cfg=dict(mode='whole'))

train_dataloader = dict(batch_size=6, num_workers=4)

iters = 120000
# optimizer
optimizer = dict(type='SGD', lr=0.01, momentum=0.9, weight_decay=0.0005)
optim_wrapper = dict(type='OptimWrapper', optimizer=optimizer, clip_grad=None)
# learning policy
param_scheduler = [
    dict(
        type='PolyLR',
        eta_min=0,
        power=0.9,
        begin=0,
        end=iters,
        by_epoch=False)
]

# training schedule for 120k
train_cfg = dict(
    type='IterBasedTrainLoop', max_iters=iters, val_interval=iters // 10)
val_cfg = dict(type='ValLoop')
test_cfg = dict(type='TestLoop')
default_hooks = dict(
    timer=dict(type='IterTimerHook'),
    logger=dict(type='LoggerHook', interval=50, log_metric_by_epoch=False),
    param_scheduler=dict(type='ParamSchedulerHook'),
    checkpoint=dict(
        type='CheckpointHook', by_epoch=False, interval=iters // 10),
    sampler_seed=dict(type='DistSamplerSeedHook'),
    visualization=dict(type='SegVisualizationHook'))

randomness = dict(seed=304)
[Feature] Support DDRNet (#2855) Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. ## Motivation Support DDRNet Paper: [Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes](https://arxiv.org/pdf/2101.06085) official Code: https://github.com/ydhongHIT/DDRNet There is already a PR https://github.com/open-mmlab/mmsegmentation/pull/1722 , but it has been inactive for a long time. ## Current Result ### Cityscapes #### inference with converted official weights \| Method \| Backbone \| mIoU(official) \| mIoU(converted weight) \| \| ------ \| ------------- \| -------------- \| ---------------------- \| \| DDRNet \| DDRNet23-slim \| 77.8 \| 77.84 \| \| DDRNet \| DDRNet23 \| 79.5 \| 79.53 \| #### training with converted pretrained backbone \| Method \| Backbone \| Crop Size \| Lr schd \| Inf time(fps) \| Device \| mIoU \| mIoU(ms+flip) \| config \| download \| \| ------ \| ------------- \| --------- \| ------- \| ------- \| -------- \| ----- \| ------------- \| ------------ \| ------------ \| \| DDRNet \| DDRNet23-slim \| 1024x1024 \| 120000 \| 85.85 \| RTX 8000 \| 77.85 \| 79.80 \| [config](https://github.com/whu-pzhang/mmsegmentation/blob/ddrnet/configs/ddrnet/ddrnet_23-slim_in1k-pre_2xb6-120k_cityscapes-1024x1024.py) \| model \\| log \| \| DDRNet \| DDRNet23 \| 1024x1024 \| 120000 \| 33.41 \| RTX 8000 \| 79.53 \| 80.98 \| [config](https://github.com/whu-pzhang/mmsegmentation/blob/ddrnet/configs/ddrnet/ddrnet_23_in1k-pre_2xb6-120k_cityscapes-1024x1024.py) \| model \\| log \| The converted pretrained backbone weights download link: 1. [ddrnet23s_in1k_mmseg.pth](https://drive.google.com/file/d/1Ni4F1PMGGjuld-1S9fzDTmneLfpMuPTG/view?usp=sharing) 2. [ddrnet23_in1k_mmseg.pth](https://drive.google.com/file/d/11rsijC1xOWB6B0LgNQkAG-W6e1OdbCyJ/view?usp=sharing) ## To do - [x] support inference with converted official weights - [x] support training on cityscapes dataset --------- Co-authored-by: xiexinch <xiexinch@outlook.com> 2023-04-27 09:44:30 +08:00			`_base_ = [`
			`'../_base_/datasets/cityscapes_1024x1024.py',`
			`'../_base_/default_runtime.py',`
			`]`

			`# The class_weight is borrowed from https://github.com/openseg-group/OCNet.pytorch/issues/14 # noqa`
			`# Licensed under the MIT License`
			`class_weight = [`
			`0.8373, 0.918, 0.866, 1.0345, 1.0166, 0.9969, 0.9754, 1.0489, 0.8786,`
			`1.0023, 0.9539, 0.9843, 1.1116, 0.9037, 1.0865, 1.0955, 1.0865, 1.1529,`
			`1.0507`
			`]`
[Fix] Update ddrnet readme (#3198) 2023-07-14 11:16:16 +08:00			`checkpoint = 'https://download.openmmlab.com/mmsegmentation/v0.5/ddrnet/pretrain/ddrnet23-in1kpre_3rdparty-9ca29f62.pth' # noqa`
[Feature] Support DDRNet (#2855) Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. ## Motivation Support DDRNet Paper: [Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes](https://arxiv.org/pdf/2101.06085) official Code: https://github.com/ydhongHIT/DDRNet There is already a PR https://github.com/open-mmlab/mmsegmentation/pull/1722 , but it has been inactive for a long time. ## Current Result ### Cityscapes #### inference with converted official weights \| Method \| Backbone \| mIoU(official) \| mIoU(converted weight) \| \| ------ \| ------------- \| -------------- \| ---------------------- \| \| DDRNet \| DDRNet23-slim \| 77.8 \| 77.84 \| \| DDRNet \| DDRNet23 \| 79.5 \| 79.53 \| #### training with converted pretrained backbone \| Method \| Backbone \| Crop Size \| Lr schd \| Inf time(fps) \| Device \| mIoU \| mIoU(ms+flip) \| config \| download \| \| ------ \| ------------- \| --------- \| ------- \| ------- \| -------- \| ----- \| ------------- \| ------------ \| ------------ \| \| DDRNet \| DDRNet23-slim \| 1024x1024 \| 120000 \| 85.85 \| RTX 8000 \| 77.85 \| 79.80 \| [config](https://github.com/whu-pzhang/mmsegmentation/blob/ddrnet/configs/ddrnet/ddrnet_23-slim_in1k-pre_2xb6-120k_cityscapes-1024x1024.py) \| model \\| log \| \| DDRNet \| DDRNet23 \| 1024x1024 \| 120000 \| 33.41 \| RTX 8000 \| 79.53 \| 80.98 \| [config](https://github.com/whu-pzhang/mmsegmentation/blob/ddrnet/configs/ddrnet/ddrnet_23_in1k-pre_2xb6-120k_cityscapes-1024x1024.py) \| model \\| log \| The converted pretrained backbone weights download link: 1. [ddrnet23s_in1k_mmseg.pth](https://drive.google.com/file/d/1Ni4F1PMGGjuld-1S9fzDTmneLfpMuPTG/view?usp=sharing) 2. [ddrnet23_in1k_mmseg.pth](https://drive.google.com/file/d/11rsijC1xOWB6B0LgNQkAG-W6e1OdbCyJ/view?usp=sharing) ## To do - [x] support inference with converted official weights - [x] support training on cityscapes dataset --------- Co-authored-by: xiexinch <xiexinch@outlook.com> 2023-04-27 09:44:30 +08:00			`crop_size = (1024, 1024)`
			`data_preprocessor = dict(`
			`type='SegDataPreProcessor',`
			`size=crop_size,`
			`mean=[123.675, 116.28, 103.53],`
			`std=[58.395, 57.12, 57.375],`
			`bgr_to_rgb=True,`
			`pad_val=0,`
			`seg_pad_val=255)`
			`norm_cfg = dict(type='SyncBN', requires_grad=True)`
			`model = dict(`
			`type='EncoderDecoder',`
			`data_preprocessor=data_preprocessor,`
			`backbone=dict(`
			`type='DDRNet',`
			`in_channels=3,`
			`channels=64,`
			`ppm_channels=128,`
			`norm_cfg=norm_cfg,`
			`align_corners=False,`
[Fix] Update ddrnet readme (#3198) 2023-07-14 11:16:16 +08:00			`init_cfg=dict(type='Pretrained', checkpoint=checkpoint)),`
[Feature] Support DDRNet (#2855) Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. ## Motivation Support DDRNet Paper: [Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes](https://arxiv.org/pdf/2101.06085) official Code: https://github.com/ydhongHIT/DDRNet There is already a PR https://github.com/open-mmlab/mmsegmentation/pull/1722 , but it has been inactive for a long time. ## Current Result ### Cityscapes #### inference with converted official weights \| Method \| Backbone \| mIoU(official) \| mIoU(converted weight) \| \| ------ \| ------------- \| -------------- \| ---------------------- \| \| DDRNet \| DDRNet23-slim \| 77.8 \| 77.84 \| \| DDRNet \| DDRNet23 \| 79.5 \| 79.53 \| #### training with converted pretrained backbone \| Method \| Backbone \| Crop Size \| Lr schd \| Inf time(fps) \| Device \| mIoU \| mIoU(ms+flip) \| config \| download \| \| ------ \| ------------- \| --------- \| ------- \| ------- \| -------- \| ----- \| ------------- \| ------------ \| ------------ \| \| DDRNet \| DDRNet23-slim \| 1024x1024 \| 120000 \| 85.85 \| RTX 8000 \| 77.85 \| 79.80 \| [config](https://github.com/whu-pzhang/mmsegmentation/blob/ddrnet/configs/ddrnet/ddrnet_23-slim_in1k-pre_2xb6-120k_cityscapes-1024x1024.py) \| model \\| log \| \| DDRNet \| DDRNet23 \| 1024x1024 \| 120000 \| 33.41 \| RTX 8000 \| 79.53 \| 80.98 \| [config](https://github.com/whu-pzhang/mmsegmentation/blob/ddrnet/configs/ddrnet/ddrnet_23_in1k-pre_2xb6-120k_cityscapes-1024x1024.py) \| model \\| log \| The converted pretrained backbone weights download link: 1. [ddrnet23s_in1k_mmseg.pth](https://drive.google.com/file/d/1Ni4F1PMGGjuld-1S9fzDTmneLfpMuPTG/view?usp=sharing) 2. [ddrnet23_in1k_mmseg.pth](https://drive.google.com/file/d/11rsijC1xOWB6B0LgNQkAG-W6e1OdbCyJ/view?usp=sharing) ## To do - [x] support inference with converted official weights - [x] support training on cityscapes dataset --------- Co-authored-by: xiexinch <xiexinch@outlook.com> 2023-04-27 09:44:30 +08:00			`decode_head=dict(`
			`type='DDRHead',`
			`in_channels=64 * 4,`
			`channels=128,`
			`dropout_ratio=0.,`
			`num_classes=19,`
			`align_corners=False,`
			`norm_cfg=norm_cfg,`
			`loss_decode=[`
			`dict(`
			`type='OhemCrossEntropy',`
			`thres=0.9,`
			`min_kept=131072,`
			`class_weight=class_weight,`
			`loss_weight=1.0),`
			`dict(`
			`type='OhemCrossEntropy',`
			`thres=0.9,`
			`min_kept=131072,`
			`class_weight=class_weight,`
			`loss_weight=0.4),`
			`]),`

			`# model training and testing settings`
			`train_cfg=dict(),`
			`test_cfg=dict(mode='whole'))`

			`train_dataloader = dict(batch_size=6, num_workers=4)`

			`iters = 120000`
			`# optimizer`
			`optimizer = dict(type='SGD', lr=0.01, momentum=0.9, weight_decay=0.0005)`
			`optim_wrapper = dict(type='OptimWrapper', optimizer=optimizer, clip_grad=None)`
			`# learning policy`
			`param_scheduler = [`
			`dict(`
			`type='PolyLR',`
			`eta_min=0,`
			`power=0.9,`
			`begin=0,`
			`end=iters,`
			`by_epoch=False)`
			`]`

			`# training schedule for 120k`
			`train_cfg = dict(`
			`type='IterBasedTrainLoop', max_iters=iters, val_interval=iters // 10)`
			`val_cfg = dict(type='ValLoop')`
			`test_cfg = dict(type='TestLoop')`
			`default_hooks = dict(`
			`timer=dict(type='IterTimerHook'),`
			`logger=dict(type='LoggerHook', interval=50, log_metric_by_epoch=False),`
			`param_scheduler=dict(type='ParamSchedulerHook'),`
			`checkpoint=dict(`
			`type='CheckpointHook', by_epoch=False, interval=iters // 10),`
			`sampler_seed=dict(type='DistSamplerSeedHook'),`
			`visualization=dict(type='SegVisualizationHook'))`

			`randomness = dict(seed=304)`