mmsegmentation/configs/vit
谢昕辰 5245edb0a0 add configs for vit backbone plus decode_heads (#520)
* add config

* add cityscapes config

* add default value to docstring

* fix lint

* add deit-s and deit-b

* add readme

* add eps at norm_cfg

* add drop_path_rate experiment

* add deit case at init_weight

* add upernet result

* update result and add upernet 160k config

* update upernet result and fix settings

* Update iters number

* update result and delete some configs

* fix import error

* fix drop_path_rate

* update result and restore config

* update benchmark result

* remove cityscapes exp

* remove neck

* neck exp

* add more configs

* fix init error

* fix ffn setting

* update result

* update results

* update result

* update results and fill table

* delete or rename configs

* fix link delimiter

* rename configs and fix link

* rename neck to mln
2021-07-01 23:00:39 +08:00
..
README.md add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_deit-b16_512x512_80k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_deit-b16_512x512_160k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_deit-b16_ln_mln_512x512_160k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_deit-b16_mln_512x512_160k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_deit-s16_512x512_80k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_deit-s16_512x512_160k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_deit-s16_ln_mln_512x512_160k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_deit-s16_mln_512x512_160k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_vit-b16_ln_mln_512x512_160k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_vit-b16_mln_512x512_80k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00
upernet_vit-b16_mln_512x512_160k_ade20k.py add configs for vit backbone plus decode_heads (#520) 2021-07-01 23:00:39 +08:00

README.md

Vision Transformer

Introduction

@article{dosoViTskiy2020,
  title={An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale},
  author={DosoViTskiy, Alexey and Beyer, Lucas and Kolesnikov, Alexander and Weissenborn, Dirk and Zhai, Xiaohua and Unterthiner, Thomas and  Dehghani, Mostafa and Minderer, Matthias and Heigold, Georg and Gelly, Sylvain and Uszkoreit, Jakob and Houlsby, Neil},
  journal={arXiv preprint arXiv:2010.11929},
  year={2020}
}

Results and models

ADE20K

Method Backbone Crop Size Lr schd Mem (GB) Inf time (fps) mIoU mIoU(ms+flip) config download
UPerNet ViT-B + MLN 512x512 80000 9.20 6.94 47.71 49.51 config model | log
UPerNet ViT-B + MLN 512x512 160000 9.20 7.58 46.75 48.46 config model | log
UPerNet ViT-B + LN + MLN 512x512 160000 9.21 6.82 47.73 49.95 config model | log
UPerNet DeiT-S 512x512 80000 4.68 29.85 42.96 43.79 config model | log
UPerNet DeiT-S 512x512 160000 4.68 29.19 42.87 43.79 config model | log
UPerNet DeiT-S + MLN 512x512 160000 5.69 11.18 43.82 45.07 config model | log
UPerNet DeiT-S + LN + MLN 512x512 160000 5.69 12.39 43.52 45.01 config model | log
UPerNet DeiT-B 512x512 80000 7.75 9.69 45.24 46.73 config model | log
UPerNet DeiT-B 512x512 160000 7.75 10.39 45.36 47.16 config model | log
UPerNet DeiT-B + MLN 512x512 160000 9.21 7.78 45.46 47.16 config model | log
UPerNet DeiT-B + LN + MLN 512x512 160000 9.21 7.75 45.37 47.23 config model | log