177 Commits

Author SHA1 Message Date
Sunflower7788
ddda8b688e
add 3 new models : MV4 FasterNet StarNet (#3251) 2024-09-21 00:26:38 +08:00
cuicheng01
de0f57521d
update CLIP configs for PP-ShiTuV2-rec (#3239) 2024-09-05 10:49:04 +08:00
Tingquan Gao
91e8eb3632
update to be compatible with V100 (#3178) 2024-07-05 14:52:32 +08:00
Tingquan Gao
b1ee8f911b
update to be compatible with V100 (#3177) 2024-07-04 16:56:04 +08:00
gaotingquan
0bfed92cb2 perf:
1. use nn.GELU instead of QuickGELU
2. support FusedLinear
2024-05-29 11:24:48 +08:00
gaotingquan
40042f89fa dbg: support fused attn 2024-05-24 14:27:29 +08:00
Tingquan Gao
e3aaa3cefb
support fused attn (#3131) 2024-05-16 13:33:46 +08:00
Wang Huan
5b06e18b24 refine drop_path 2024-05-07 14:20:10 +08:00
wanghuancoder
80abf9f789
use tensor.shape bug not paddle.shape(tensor) (#3120) 2024-04-12 15:34:44 +08:00
sky
1ded6d1cbd
Update svtrnet.py (#3037)
* Update svtrnet.py

* Update svtrnet.py

remove test part

* isolated model

* Update svtrnet.py

* support static graph

* try to fix acc problem by remove torch format

* fix sercret bug for losing accuracy

* Revert "fix sercret bug for losing accuracy"

This reverts commit c8226e739005c7ebd0c0ca3772c568bccd958c4f.

* fix secret bug for losing accuracy metric

* Delete launch.json

* Update svtrnet.py

* fix 动转静

* Update svtrnet.py
2024-02-06 21:50:38 +08:00
sky
276e90d9a1
Bigmodel (#3032)
* fix the resolution problem for clip-vision transformer part and swim transformer

fix the resolution problem for clip-vision transformer part and swim transformer

* Revert "Revert "fix resolution problem for swin transformer and clip vit  (#3021)""

This reverts commit 174db431a82fb168c01b0be03fbb1d822314bbb1.

Update foundation_vit.py

Update foundation_vit.py

Revert "fix resolution problem for swin transformer and clip vit  (#3021)"

This reverts commit 61f748de6767daa2ee8c243146b2204689ecd1ba.

* add backbone function

* fix static graph problem

* remove text encoder framework and add classifier header directly

* fix bug in clip when using classifier header

* updated

* support embeding

Note, support embed is only for check the model since there is no related text encoder

* compatible with param transfer

* update setting
2024-02-06 21:46:25 +08:00
zhangyubo0722
adedfe603c
[WIP] add svtr (#3035)
* [WIP] add svtr

* add svtr_base and svtr_tiny
2023-11-06 19:15:55 +08:00
sky
61f748de67
fix resolution problem for swin transformer and clip vit (#3021)
* Update foundation_vit.py

Update .gitignore

fix time cost problem

Update swin_transformer.py

fix the speed and memory problem

reduce the unnecessary calculation when patch matches resolution

fix conflict

remove check resolution function

Revert "fix conflict"

This reverts commit d7a7dade71ea5f9a6ac178e113dbcbd65d5a64b3.

fix conflict

remove the conflict checkpoint function

【Hackathon 5th No.69】 分类大模型--人体视觉任务SOLIDER (#2995)

* add_solider

* add_solider

* add_solider

* add_solider

* add_solider

* add_solider

* add_solider

* add_solider

* add_solider

* add_solider

* add_solider

update doc about PPHGNetV2 (#3002)

fix clip patch embedding resolution problem

support non 224 resolution

integrate the pading function to one

adjust function name

fix the resolution problem for clip-vision transformer part and swim transformer

fix the resolution problem for clip-vision transformer part and swim transformer

* fix cache problem

using the huggingface plan and drop the cache

* Revert "fix cache problem"

This reverts commit 8f7ab55c75d8c16443506a975c21967de8c038f2.

* fix resolution problem

* update big model backbone

* Revert "update big model backbone"

This reverts commit 04a39f701b62120bbfa20f6526a0589c33c21e47.
2023-10-31 10:11:46 +08:00
zhangyubo0722
aae1e9543f
del load pretrained from url for resnet (#2997)
* del load pretrained from url for resnet

* del load_dygraph_pretrain_from_url

* del load_dygraph_pretrain_from_url

* modify save_load
2023-10-30 13:44:16 +08:00
sky
e1a7840816
【Feature】fix the resolution problem for clip-vision transformer part and swim … (#3001)
* fix the resolution problem for clip-vision transformer part and swim transformer

fix the resolution problem for clip-vision transformer part and swim transformer

* adjust function name

* integrate the pading function to one

* support non 224 resolution

* fix clip patch embedding resolution problem

* fix conflict

remove the conflict checkpoint function

* Revert "fix conflict"

This reverts commit d7a7dade71ea5f9a6ac178e113dbcbd65d5a64b3.

* fix conflict

remove check resolution function
2023-10-18 20:55:37 +08:00
xiongkun
2200f30052
fix sot-slow problem (#2976)
* fix sot-slow problem

* fix format

* add comment
2023-09-18 17:27:01 +08:00
feifei-111
9bcb71ab81
Fix resnext model, split main logic with guard (#2956)
* update

* update

* update
2023-09-18 17:25:57 +08:00
zhangyubo0722
ed67436647
del head_init_scale (#2947) 2023-09-01 20:15:54 +08:00
gaotingquan
cf5d629a64 fix 2023-06-06 11:19:01 +08:00
gaotingquan
4643fdee09 update pretrained url 2023-06-06 11:19:01 +08:00
gaotingquan
2823e48be5 fix head_init_scale 2023-05-26 15:40:48 +08:00
gaotingquan
6d924f85ee fix for clip
1. fix bias_attr to False for conv of PatchEmbed;
2. support return_tokens_mean for Head of CLIP;
3. support remove_cls_token_in_forward for CLIP;
4. support head_init_scale argument for ViT backbone;
5. support get_num_layers() and no_weight_decay() for ViT backbone.
2023-05-26 15:40:48 +08:00
gaotingquan
bb831c3baa code style 2023-05-17 15:19:13 +08:00
gaotingquan
07b9162bc0 fix pretrained url 2023-05-17 15:19:13 +08:00
gaotingquan
2091a59ff5 fix reference url 2023-05-17 15:19:13 +08:00
gaotingquan
fc9c59c4b1 update pretrained url 2023-05-17 15:19:13 +08:00
Yang Nie
c351dac67e add tinynet 2023-05-06 19:04:37 +08:00
zh-hike
d7bd275379 update foundation_vit from EVA_vit_huge to EVA_vit_giant 2023-04-23 10:16:08 +08:00
Yang Nie
cdd3c3a05c clear type hint 2023-04-19 21:21:06 +08:00
Yang Nie
692204eee6 fix code style 2023-04-19 21:21:06 +08:00
Yang Nie
deb8e98779 rename v2 to V2 2023-04-19 21:21:06 +08:00
Yang Nie
be6a22be18 add MobileViTv2 2023-04-19 21:21:06 +08:00
Yang Nie
b8a1589377 update data augment and init method for MobileViTv3-v2 2023-04-06 15:33:30 +08:00
Yang Nie
de4129baa6 update 2023-04-06 15:33:30 +08:00
Yang Nie
dc4fdba0ab add MobileViTv3 2023-04-06 15:33:30 +08:00
Yang Nie
beca8b2c1b add mobilenext
add cooldown config

update optimizer

fix ParamAttr & update  test_tipc

fix tipc

update tipc config

remove docs of `_make_divisible`

refactor the implementation of "no weight decay"

fix model name

remove cooldown config
2023-04-05 00:41:19 +08:00
Yang Nie
e0daf82dc0 rename micronet_m(\d) to MicroNet_M(\d) 2023-04-04 20:37:22 +08:00
Yang Nie
8a578a083e remove the comma at the end 2023-04-04 20:37:22 +08:00
Yang Nie
4962f71289 remove ChannelShuffle2 2023-04-04 20:37:22 +08:00
Yang Nie
a881c7a7fa remove useless comments 2023-04-04 20:37:22 +08:00
Yang Nie
d76defdefc fix import bug 2023-04-04 20:37:22 +08:00
Yang Nie
e262a5f64d add micronet 2023-04-04 20:37:22 +08:00
gaotingquan
6fdaf94a0d fix concat error when fp16 2023-04-04 19:49:00 +08:00
Yang Nie
1433161edd fix typo 2023-04-04 18:44:44 +08:00
Yang Nie
a2052232e6 add support for CvT_21_244, CvT_13_384, CvT_21_384 and CvT_W24_384 2023-04-04 18:44:44 +08:00
Yang Nie
4cfd2159e5 rename cvt_{depth}_{size}x{size} to CvT_{depth}_{size} 2023-04-04 18:44:44 +08:00
Yang Nie
d7a1127559 add CvT 2023-04-04 18:44:44 +08:00
gaotingquan
5c39dfa6ba rename gvt.py -> twins.py & twins-svt -> twins-alt-gvt 2023-03-30 17:29:49 +08:00
gaotingquan
0b3b621a81 fix concat error when fp16 2023-03-21 14:23:09 +08:00
gaotingquan
4e988692dd fix concat error when fp16 2023-03-21 14:23:09 +08:00