mmsegmentation/configs/setr
谢昕辰 2f3f027c3d [Enhancement] md2yml pre-commit hook (#732)
* init script

* update scripts and generate new yml

* fix lint: deeplabv3plus.yml

* modify resolution representation

* remove  field

* format crop_size
2021-07-31 09:31:58 -07:00
..
README.md [Feature] Official implementation of SETR (#531) 2021-06-23 09:39:29 -07:00
setr.yml [Enhancement] md2yml pre-commit hook (#732) 2021-07-31 09:31:58 -07:00
setr_mla_512x512_160k_b8_ade20k.py [Feature] Official implementation of SETR (#531) 2021-06-23 09:39:29 -07:00
setr_mla_512x512_160k_b16_ade20k.py [Feature] Official implementation of SETR (#531) 2021-06-23 09:39:29 -07:00
setr_naive_512x512_160k_b16_ade20k.py [Feature] Official implementation of SETR (#531) 2021-06-23 09:39:29 -07:00
setr_pup_512x512_160k_b16_ade20k.py [Feature] Official implementation of SETR (#531) 2021-06-23 09:39:29 -07:00

README.md

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Introduction

@article{zheng2020rethinking,
  title={Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers},
  author={Zheng, Sixiao and Lu, Jiachen and Zhao, Hengshuang and Zhu, Xiatian and Luo, Zekun and Wang, Yabiao and Fu, Yanwei and Feng, Jianfeng and Xiang, Tao and Torr, Philip HS and others},
  journal={arXiv preprint arXiv:2012.15840},
  year={2020}
}

Results and models

ADE20K

Method Backbone Crop Size Batch Size Lr schd Mem (GB) Inf time (fps) mIoU mIoU(ms+flip) config download
SETR-Naive ViT-L 512x512 16 160000 18.40 4.72 48.28 49.56 config model | log
SETR-PUP ViT-L 512x512 16 160000 19.54 4.50 48.24 49.99 config model | log
SETR-MLA ViT-L 512x512 8 160000 10.96 - 47.34 49.05 config model | log
SETR-MLA ViT-L 512x512 16 160000 17.30 5.25 47.54 49.37 config model | log