diff --git a/configs/selfsup/mae/README.md b/configs/selfsup/mae/README.md index 41599b7a..caeec51c 100644 --- a/configs/selfsup/mae/README.md +++ b/configs/selfsup/mae/README.md @@ -29,12 +29,118 @@ methods that use only ImageNet-1K data. Transfer performance in downstream tasks ## Models and Benchmarks -Here, we report the results of the model, which is pre-trained on ImageNet-1k -for 400 epochs, the details are below: - -| Backbone | Pre-train epoch | Fine-tuning Top-1 | Pre-train Config | Fine-tuning Config | Download | -| :------: | :-------------: | :---------------: | :-----------------------------------------------------------------------------------------------------------------------: | :------------------------------------------------------------------------------------------------------------------------------------------------: | :-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | -| ViT-B/16 | 400 | 83.1 | [config](https://github.com/open-mmlab/mmselfsup/blob/master/configs/selfsup/mae/mae_vit-b-p16_8xb512-coslr-400e_in1k.py) | [config](https://github.com/open-mmlab/mmselfsup/blob/master/configs/benchmarks/classification/imagenet/vit-base-p16_ft-8xb128-coslr-100e_in1k.py) | [model](https://download.openmmlab.com/mmselfsup/mae/mae_vit-base-p16_8xb512-coslr-400e_in1k-224_20220223-85be947b.pth) \| [log](https://download.openmmlab.com/mmselfsup/mae/mae_vit-base-p16_8xb512-coslr-300e_in1k-224_20220210_140925.log.json) | +
Algorithm | +Backbone | +Epoch | +Batch Size | +Results (Top-1 %) | +Links | +|||
---|---|---|---|---|---|---|---|---|
Linear Eval | +Fine-tuning | +Pretrain | +Linear Eval | +Fine-tuning | +||||
MAE | +ViT-base | +300 | +4096 | +60.8 | +83.1 | +config | model | log | +config | model | log | +config | model | log | +
ViT-base | +400 | +4096 | +62.5 | +83.3 | +config | model | log | +config | model | log | +config | model | log | +|
ViT-base | +800 | +4096 | +65.1 | +83.3 | +config | model | log | +config | model | log | +config | model | log | +|
ViT-base | +1600 | +4096 | +67.1 | +83.5 | +config | model | log | +config | model | log | +config | model | log | +|
ViT-large | +400 | +4096 | +70.7 | +85.2 | +config | model | log | +config | model | log | +config | model | log | +|
ViT-large | +800 | +4096 | +73.7 | +85.4 | +config | model | log | +config | model | log | +config | model | log | +|
ViT-large | +1600 | +4096 | +75.5 | +85.7 | +config | model | log | +config | model | log | +config | model | log | +|
ViT-huge-FT-224 | +1600 | +4096 | +/ | +86.9 | +config | model | log | +/ | +config | model | log | +|
ViT-huge-FT-448 | +1600 | +4096 | +/ | +87.3 | +config | model | log | +/ | +config | model | log | +