* fix the resolution problem for clip-vision transformer part and swim transformer
fix the resolution problem for clip-vision transformer part and swim transformer
* Revert "Revert "fix resolution problem for swin transformer and clip vit (#3021)""
This reverts commit 174db431a82fb168c01b0be03fbb1d822314bbb1.
Update foundation_vit.py
Update foundation_vit.py
Revert "fix resolution problem for swin transformer and clip vit (#3021)"
This reverts commit 61f748de67.
* add backbone function
* fix static graph problem
* remove text encoder framework and add classifier header directly
* fix bug in clip when using classifier header
* updated
* support embeding
Note, support embed is only for check the model since there is no related text encoder
* compatible with param transfer
* update setting
* Update foundation_vit.py
Update .gitignore
fix time cost problem
Update swin_transformer.py
fix the speed and memory problem
reduce the unnecessary calculation when patch matches resolution
fix conflict
remove check resolution function
Revert "fix conflict"
This reverts commit d7a7dade71.
fix conflict
remove the conflict checkpoint function
【Hackathon 5th No.69】 分类大模型--人体视觉任务SOLIDER (#2995)
* add_solider
* add_solider
* add_solider
* add_solider
* add_solider
* add_solider
* add_solider
* add_solider
* add_solider
* add_solider
* add_solider
update doc about PPHGNetV2 (#3002)
fix clip patch embedding resolution problem
support non 224 resolution
integrate the pading function to one
adjust function name
fix the resolution problem for clip-vision transformer part and swim transformer
fix the resolution problem for clip-vision transformer part and swim transformer
* fix cache problem
using the huggingface plan and drop the cache
* Revert "fix cache problem"
This reverts commit 8f7ab55c75.
* fix resolution problem
* update big model backbone
* Revert "update big model backbone"
This reverts commit 04a39f701b.
* fix the resolution problem for clip-vision transformer part and swim transformer
fix the resolution problem for clip-vision transformer part and swim transformer
* adjust function name
* integrate the pading function to one
* support non 224 resolution
* fix clip patch embedding resolution problem
* fix conflict
remove the conflict checkpoint function
* Revert "fix conflict"
This reverts commit d7a7dade71.
* fix conflict
remove check resolution function
1. fix bias_attr to False for conv of PatchEmbed;
2. support return_tokens_mean for Head of CLIP;
3. support remove_cls_token_in_forward for CLIP;
4. support head_init_scale argument for ViT backbone;
5. support get_num_layers() and no_weight_decay() for ViT backbone.