mmsegmentation/configs/dpt/dpt_vit-b16_512x512_160k_ad...

_base_ = [
    '../_base_/models/dpt_vit-b16.py', '../_base_/datasets/ade20k.py',
    '../_base_/default_runtime.py', '../_base_/schedules/schedule_160k.py'
]
crop_size = (512, 512)
data_preprocessor = dict(size=crop_size)
model = dict(data_preprocessor=data_preprocessor)
# AdamW optimizer, no weight decay for position embedding & layer norm
# in backbone

optim_wrapper = dict(
    _delete_=True,
    type='OptimWrapper',
    optimizer=dict(
        type='AdamW', lr=0.00006, betas=(0.9, 0.999), weight_decay=0.01),
    paramwise_cfg=dict(
        custom_keys={
            'pos_embed': dict(decay_mult=0.),
            'cls_token': dict(decay_mult=0.),
            'norm': dict(decay_mult=0.)
        }))

param_scheduler = [
    dict(
        type='LinearLR', start_factor=1e-6, by_epoch=False, begin=0, end=1500),
    dict(
        type='PolyLR',
        eta_min=0.0,
        power=1.0,
        begin=1500,
        end=160000,
        by_epoch=False,
    )
]

# By default, models are trained on 8 GPUs with 2 images per GPU
train_dataloader = dict(batch_size=2, num_workers=2)
val_dataloader = dict(batch_size=1, num_workers=4)
test_dataloader = val_dataloader
[Feature] add DPT head (#605) * add DPT head * [fix] fix init error * use mmcv function * delete code * remove transpose clas * support NLC output shape * Delete post_process_layer.py * add unittest and docstring * rename variables * fix project error and add unittest * match dpt weights * add configs * fix vit pos_embed bug and dpt feature fusion bug * match vit output * fix gelu * minor change * update unitest * fix configs error * inference test * remove auxilary * use local pretrain * update training results * update yml * update fps and memory test * update doc * update readme * add yml * update doc * remove with_cp * update config * update docstring * remove dpt-l * add init_cfg and modify readme.md * Update dpt_vit-b16.py * zh-n README * use constructor instead of build function * prevent tensor being modified by ConvModule * fix unittest Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn> 2021-08-30 16:53:05 +08:00			`_base_ = [`
			`'../_base_/models/dpt_vit-b16.py', '../_base_/datasets/ade20k.py',`
			`'../_base_/default_runtime.py', '../_base_/schedules/schedule_160k.py'`
			`]`
[Refactor] Refactor decode_head and segmentors and add preprocess_cfg 2022-06-10 22:02:40 +08:00			`crop_size = (512, 512)`
[Refactory] Refactory BaseSegmentor based BaseModel 2022-06-19 14:32:09 +08:00			`data_preprocessor = dict(size=crop_size)`
			`model = dict(data_preprocessor=data_preprocessor)`
[Feature] add DPT head (#605) * add DPT head * [fix] fix init error * use mmcv function * delete code * remove transpose clas * support NLC output shape * Delete post_process_layer.py * add unittest and docstring * rename variables * fix project error and add unittest * match dpt weights * add configs * fix vit pos_embed bug and dpt feature fusion bug * match vit output * fix gelu * minor change * update unitest * fix configs error * inference test * remove auxilary * use local pretrain * update training results * update yml * update fps and memory test * update doc * update readme * add yml * update doc * remove with_cp * update config * update docstring * remove dpt-l * add init_cfg and modify readme.md * Update dpt_vit-b16.py * zh-n README * use constructor instead of build function * prevent tensor being modified by ConvModule * fix unittest Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn> 2021-08-30 16:53:05 +08:00			`# AdamW optimizer, no weight decay for position embedding & layer norm`
			`# in backbone`
[Refactor] Refacor default_hooks and train & val & test loops in configs 2022-06-08 14:28:35 +08:00
			`optim_wrapper = dict(`
add _delete_=True to optim wrapper 2022-07-06 17:26:52 +08:00			`_delete_=True,`
[Refactor] Refacor default_hooks and train & val & test loops in configs 2022-06-08 14:28:35 +08:00			`type='OptimWrapper',`
[Fix] Remove _delete_=True in optimizer 2022-06-20 17:53:36 +08:00			`optimizer=dict(`
			`type='AdamW', lr=0.00006, betas=(0.9, 0.999), weight_decay=0.01),`
[Feature] add DPT head (#605) * add DPT head * [fix] fix init error * use mmcv function * delete code * remove transpose clas * support NLC output shape * Delete post_process_layer.py * add unittest and docstring * rename variables * fix project error and add unittest * match dpt weights * add configs * fix vit pos_embed bug and dpt feature fusion bug * match vit output * fix gelu * minor change * update unitest * fix configs error * inference test * remove auxilary * use local pretrain * update training results * update yml * update fps and memory test * update doc * update readme * add yml * update doc * remove with_cp * update config * update docstring * remove dpt-l * add init_cfg and modify readme.md * Update dpt_vit-b16.py * zh-n README * use constructor instead of build function * prevent tensor being modified by ConvModule * fix unittest Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn> 2021-08-30 16:53:05 +08:00			`paramwise_cfg=dict(`
			`custom_keys={`
			`'pos_embed': dict(decay_mult=0.),`
			`'cls_token': dict(decay_mult=0.),`
			`'norm': dict(decay_mult=0.)`
			`}))`

[Refactor] Refactor lr_config 2022-06-08 17:25:00 +08:00			`param_scheduler = [`
			`dict(`
			`type='LinearLR', start_factor=1e-6, by_epoch=False, begin=0, end=1500),`
			`dict(`
			`type='PolyLR',`
			`eta_min=0.0,`
			`power=1.0,`
			`begin=1500,`
			`end=160000,`
			`by_epoch=False,`
			`)`
			`]`
[Feature] add DPT head (#605) * add DPT head * [fix] fix init error * use mmcv function * delete code * remove transpose clas * support NLC output shape * Delete post_process_layer.py * add unittest and docstring * rename variables * fix project error and add unittest * match dpt weights * add configs * fix vit pos_embed bug and dpt feature fusion bug * match vit output * fix gelu * minor change * update unitest * fix configs error * inference test * remove auxilary * use local pretrain * update training results * update yml * update fps and memory test * update doc * update readme * add yml * update doc * remove with_cp * update config * update docstring * remove dpt-l * add init_cfg and modify readme.md * Update dpt_vit-b16.py * zh-n README * use constructor instead of build function * prevent tensor being modified by ConvModule * fix unittest Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn> 2021-08-30 16:53:05 +08:00
			`# By default, models are trained on 8 GPUs with 2 images per GPU`
[Refactor] Refactor Dataset and its Transform Config files 2022-05-31 22:28:42 +08:00			`train_dataloader = dict(batch_size=2, num_workers=2)`
[Fix] Fix batch size of val_dataloader to 1. 2022-06-12 17:10:26 +08:00			`val_dataloader = dict(batch_size=1, num_workers=4)`
[Refactor] Refactor Dataset and its Transform Config files 2022-05-31 22:28:42 +08:00			`test_dataloader = val_dataloader`