mmsegmentation/configs/swin
Ze Liu b6c7c77a08
[WIP] Add Swin Transformer (#511)
* add Swin Transformer

* add Swin Transformer

* fixed import

* Add some swin training settings.

* Fix some filename error.

* Fix attribute name: pretrain -> pretrained

* Upload mmcls implementation of swin transformer.

* Refactor Swin Transformer to follow mmcls style.

* Refactor init_weigths of swin_transformer.py

* Fix lint

* Match inference precision

* Add some comments

* Add swin_convert to load official style ckpt

* Remove arg: auto_pad

* 1. Complete comments for each block;

2. Correct weight convert function;

3. Fix the pad of Patch Merging;

* Clean function args.

* Fix vit unit test.

* 1. Add swin transformer unit tests;

2. Fix some pad bug;

3. Modify config to adapt new swin implementation;

* Modify config arg

* Update readme.md of swin

* Fix config arg error and Add some swin benchmark msg.

* Add MeM and ms test content for readme.md of swin transformer.

* Fix doc string of swin module

* 1. Register swin transformer to model list;

2. Modify pth url which keep meta attribute;

* Update swin.py

* Merge config settings.

* Modify config style.

* Update README.md

Add ViT link

* Modify main readme.md

Co-authored-by: Jiarui XU <xvjiarui0826@gmail.com>
Co-authored-by: sennnnn <201730271412@mail.scut.edu.cn>
Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>
2021-07-01 23:41:55 +08:00
..
README.md [WIP] Add Swin Transformer (#511) 2021-07-01 23:41:55 +08:00
upernet_swin_base_patch4_window7_512x512_160k_ade20k_pretrain_224x224_1K.py [WIP] Add Swin Transformer (#511) 2021-07-01 23:41:55 +08:00
upernet_swin_base_patch4_window7_512x512_160k_ade20k_pretrain_224x224_22K.py [WIP] Add Swin Transformer (#511) 2021-07-01 23:41:55 +08:00
upernet_swin_base_patch4_window12_512x512_160k_ade20k_pretrain_384x384_1K.py [WIP] Add Swin Transformer (#511) 2021-07-01 23:41:55 +08:00
upernet_swin_base_patch4_window12_512x512_160k_ade20k_pretrain_384x384_22K.py [WIP] Add Swin Transformer (#511) 2021-07-01 23:41:55 +08:00
upernet_swin_small_patch4_window7_512x512_160k_ade20k_pretrain_224x224_1K.py [WIP] Add Swin Transformer (#511) 2021-07-01 23:41:55 +08:00
upernet_swin_tiny_patch4_window7_512x512_160k_ade20k_pretrain_224x224_1K.py [WIP] Add Swin Transformer (#511) 2021-07-01 23:41:55 +08:00

README.md

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Introduction

[ALGORITHM]

@article{liu2021Swin,
  title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows},
  author={Liu, Ze and Lin, Yutong and Cao, Yue and Hu, Han and Wei, Yixuan and Zhang, Zheng and Lin, Stephen and Guo, Baining},
  journal={arXiv preprint arXiv:2103.14030},
  year={2021}
}

Results and models

ADE20K

Method Backbone Crop Size pretrain pretrain img size Batch Size Lr schd Mem (GB) Inf time (fps) mIoU mIoU(ms+flip) config download
UperNet Swin-T 512x512 ImageNet-1K 224x224 16 160000 5.02 21.06 44.41 45.79 config model | log
UperNet Swin-S 512x512 ImageNet-1K 224x224 16 160000 6.17 14.72 47.72 49.24 config model | log
UperNet Swin-B 512x512 ImageNet-1K 224x224 16 160000 7.61 12.65 47.99 49.57 config model | log
UperNet Swin-B 512x512 ImageNet-22K 224x224 16 160000 - - 50.31 51.9 config model | log
UperNet Swin-B 512x512 ImageNet-1K 384x384 16 160000 8.52 12.10 48.35 49.65 config model | log
UperNet Swin-B 512x512 ImageNet-22K 384x384 16 160000 - - 50.76 52.4 config model | log