History

谢昕辰 119bbd838d [Enhancement] Delete convert function and add instruction to ViT/Swin README.md (#791 ) * delete convert function and add instruction to README.md * unified model convert and README * remove url * fix import error * fix unittest * rename pretrain * rename vit and deit pretrain * Update upernet_deit-b16_512x512_160k_ade20k.py * Update upernet_deit-b16_512x512_80k_ade20k.py * Update upernet_deit-b16_ln_mln_512x512_160k_ade20k.py * Update upernet_deit-b16_mln_512x512_160k_ade20k.py * Update upernet_deit-s16_512x512_160k_ade20k.py * Update upernet_deit-s16_512x512_80k_ade20k.py * Update upernet_deit-s16_ln_mln_512x512_160k_ade20k.py * Update upernet_deit-s16_mln_512x512_160k_ade20k.py Co-authored-by: Jiarui XU <xvjiarui0826@gmail.com> Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>		2021-08-25 15:00:41 -07:00
..
README.md	[Feature] Official implementation of SETR (#531 )	2021-06-23 09:39:29 -07:00
setr.yml	Fix random behavior of update_model_index in pre-commit hook (#784 )	2021-08-15 23:33:08 +08:00
setr_mla_512x512_160k_b8_ade20k.py	[Enhancement] Delete convert function and add instruction to ViT/Swin README.md (#791 )	2021-08-25 15:00:41 -07:00
setr_mla_512x512_160k_b16_ade20k.py	[Feature] Official implementation of SETR (#531 )	2021-06-23 09:39:29 -07:00
setr_naive_512x512_160k_b16_ade20k.py	[Enhancement] Delete convert function and add instruction to ViT/Swin README.md (#791 )	2021-08-25 15:00:41 -07:00
setr_pup_512x512_160k_b16_ade20k.py	[Enhancement] Delete convert function and add instruction to ViT/Swin README.md (#791 )	2021-08-25 15:00:41 -07:00

README.md

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Introduction

@article{zheng2020rethinking,
  title={Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers},
  author={Zheng, Sixiao and Lu, Jiachen and Zhao, Hengshuang and Zhu, Xiatian and Luo, Zekun and Wang, Yabiao and Fu, Yanwei and Feng, Jianfeng and Xiang, Tao and Torr, Philip HS and others},
  journal={arXiv preprint arXiv:2012.15840},
  year={2020}
}

Results and models

ADE20K

Method	Backbone	Crop Size	Batch Size	Lr schd	Mem (GB)	Inf time (fps)	mIoU	mIoU(ms+flip)	config	download
SETR-Naive	ViT-L	512x512	16	160000	18.40	4.72	48.28	49.56	config	model \| log
SETR-PUP	ViT-L	512x512	16	160000	19.54	4.50	48.24	49.99	config	model \| log
SETR-MLA	ViT-L	512x512	8	160000	10.96	-	47.34	49.05	config	model \| log
SETR-MLA	ViT-L	512x512	16	160000	17.30	5.25	47.54	49.37	config	model \| log