History

Ze Liu b6c7c77a08 [WIP] Add Swin Transformer (#511 ) * add Swin Transformer * add Swin Transformer * fixed import * Add some swin training settings. * Fix some filename error. * Fix attribute name: pretrain -> pretrained * Upload mmcls implementation of swin transformer. * Refactor Swin Transformer to follow mmcls style. * Refactor init_weigths of swin_transformer.py * Fix lint * Match inference precision * Add some comments * Add swin_convert to load official style ckpt * Remove arg: auto_pad * 1. Complete comments for each block; 2. Correct weight convert function; 3. Fix the pad of Patch Merging; * Clean function args. * Fix vit unit test. * 1. Add swin transformer unit tests; 2. Fix some pad bug; 3. Modify config to adapt new swin implementation; * Modify config arg * Update readme.md of swin * Fix config arg error and Add some swin benchmark msg. * Add MeM and ms test content for readme.md of swin transformer. * Fix doc string of swin module * 1. Register swin transformer to model list; 2. Modify pth url which keep meta attribute; * Update swin.py * Merge config settings. * Modify config style. * Update README.md Add ViT link * Modify main readme.md Co-authored-by: Jiarui XU <xvjiarui0826@gmail.com> Co-authored-by: sennnnn <201730271412@mail.scut.edu.cn> Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>		2021-07-01 23:41:55 +08:00
..
README.md	[WIP] Add Swin Transformer (#511 )	2021-07-01 23:41:55 +08:00
upernet_swin_base_patch4_window7_512x512_160k_ade20k_pretrain_224x224_1K.py	[WIP] Add Swin Transformer (#511 )	2021-07-01 23:41:55 +08:00
upernet_swin_base_patch4_window7_512x512_160k_ade20k_pretrain_224x224_22K.py	[WIP] Add Swin Transformer (#511 )	2021-07-01 23:41:55 +08:00
upernet_swin_base_patch4_window12_512x512_160k_ade20k_pretrain_384x384_1K.py	[WIP] Add Swin Transformer (#511 )	2021-07-01 23:41:55 +08:00
upernet_swin_base_patch4_window12_512x512_160k_ade20k_pretrain_384x384_22K.py	[WIP] Add Swin Transformer (#511 )	2021-07-01 23:41:55 +08:00
upernet_swin_small_patch4_window7_512x512_160k_ade20k_pretrain_224x224_1K.py	[WIP] Add Swin Transformer (#511 )	2021-07-01 23:41:55 +08:00
upernet_swin_tiny_patch4_window7_512x512_160k_ade20k_pretrain_224x224_1K.py	[WIP] Add Swin Transformer (#511 )	2021-07-01 23:41:55 +08:00

README.md

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Introduction

[ALGORITHM]

@article{liu2021Swin,
  title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows},
  author={Liu, Ze and Lin, Yutong and Cao, Yue and Hu, Han and Wei, Yixuan and Zhang, Zheng and Lin, Stephen and Guo, Baining},
  journal={arXiv preprint arXiv:2103.14030},
  year={2021}
}

Results and models

ADE20K

Method	Backbone	Crop Size	pretrain	pretrain img size	Batch Size	Lr schd	Mem (GB)	Inf time (fps)	mIoU	mIoU(ms+flip)	config	download
UperNet	Swin-T	512x512	ImageNet-1K	224x224	16	160000	5.02	21.06	44.41	45.79	config	model \| log
UperNet	Swin-S	512x512	ImageNet-1K	224x224	16	160000	6.17	14.72	47.72	49.24	config	model \| log
UperNet	Swin-B	512x512	ImageNet-1K	224x224	16	160000	7.61	12.65	47.99	49.57	config	model \| log
UperNet	Swin-B	512x512	ImageNet-22K	224x224	16	160000	-	-	50.31	51.9	config	model \| log
UperNet	Swin-B	512x512	ImageNet-1K	384x384	16	160000	8.52	12.10	48.35	49.65	config	model \| log
UperNet	Swin-B	512x512	ImageNet-22K	384x384	16	160000	-	-	50.76	52.4	config	model \| log