* Support DeiT backbone.
* Use hook to automatically resize pos embed
* Update ViT training setting
* Add deit configs and update docs
* Fix vit arch assertion
* Remove useless init function
* Add unit tests.
* Fix resize_pos_embed for DeiT
* Improve according to comments.
* [Squash] Refator ViT (from #295)
* Use base variable to simplify auto_aug setting
* Use common PatchEmbed, remove HybridEmbed and refactor ViT init
structure.
* Add `output_cls_token` option and change the output format of ViT and
input format of ViT head.
* Update unit tests and add test for `output_cls_token`.
* Support out_indices.
* Standardize config files
* Support resize position embedding.
* Add readme file of vit
* Rename config file
* Improve docs about ViT.
* Update docstring
* Use local version `MultiheadAttention` instead of mmcv version.
* Fix MultiheadAttention
* Support `qk_scale` argument in `MultiheadAttention`
* Improve docs and change `layer_cfg` to `layer_cfgs` and support
sequence.
* Use init_cfg to init Linear layer in VisionTransformerHead
* update metafile
* Update checkpoints and configs
* Imporve docstring.
* Update README
* Revert GAP modification.
* Add API inference in the docs and fix readthedocs config.
* Replace some relative link in docs.
* Format docstring for reStructuredText syntax.
* Fix vit paper link
* Fix docstring of `show_results` function in `BaseClassifier`.
* add imagenet bs 4096
* add vit_base_patch16_224_finetune
* add vit_base_patch16_224_pretrain
* add vit_base_patch16_384_finetune
* add vit_base_patch16_384_finetune
* add vit_b_p16_224_finetune_imagenet
* add vit_b_p16_224_pretrain_imagenet
* add vit_b_p16_384_finetune_imagenet
* add vit
* add vit
* add vit head
* vit unitest
* keep up with ClsHead
* test vit
* add flag to determiine whether to calculate acc during training
* Changes related to mmcv1.3.0
* change checkpoint saving interval to 10
* add label smooth
* default_runtime.py recovery
* docformatter
* docformatter
* delete 2 lines of comments
* delete configs/_base_/schedules/imagenet_bs4096.py
* add configs/_base_/schedules/imagenet_bs2048_AdamW.py
* rename imagenet_bs4096.py to imagenet_bs2048_AdamW.py
* add AutoAugment
* fix weight decay in vit
* change eval interval to 10
* add mytrain.py for test
* test before layers
* test attr in layers
* test classifier
* delete mytrain.py
* delete @torch.jit.ignore
* change eval interval back to 1
* add some comments to imagenet_bs2048_AdamW
* add some comments