Commit Graph

1065 Commits (17a47c0e351e1ca432c9a967fcd4f30b02e02118)

Author SHA1 Message Date
Yassine 5c504b4ded flip these two 2023-09-27 10:24:12 -07:00
Yassine 8ba2038e6b fast_vit: propagate act_layer argument 2023-09-27 10:24:12 -07:00
Nguyen Nhat Hoang 95ba90157f Update tiny_vit.py to fix bug 2023-09-23 10:05:52 -07:00
belfner 245ad4f41a Added missing RuntimeError to builder functions of models that do not currently support feature extraction 2023-09-19 08:19:14 -07:00
Thorsten Hempel d4c21b95f4 Update repghost.py 2023-09-15 11:41:56 -07:00
Thorsten Hempel 7eb7d13845 Fix in_features for linear layer in reset_classifier. 2023-09-13 09:29:38 -07:00
Ross Wightman 0d124ffd4f Update README. Fine-grained layer-wise lr decay working for tiny_vit and both efficientvits. Minor fixes. 2023-09-01 15:05:29 -07:00
Ross Wightman 2f0fbb59b3 TinyViT weights on HF hub 2023-09-01 11:05:56 -07:00
Ross Wightman 507cb08acf TinyVitBlock needs adding as leaf for FX now, tweak a few dim names 2023-09-01 11:05:56 -07:00
Ross Wightman 9caf32b93f Move levit style pos bias resize with other rel pos bias utils 2023-09-01 11:05:56 -07:00
Ross Wightman 63417b438f TinyViT adjustments
* keep most of net in BCHW layout, performance appears same, can remove static resolution attribs and features easier to use
* add F.sdpa, decent gains in pt 2.1
* tweak crop pct based on eval
2023-09-01 11:05:56 -07:00
方曦 39aa44b192 Fixing tinyvit trace issue 2023-09-01 11:05:56 -07:00
方曦 aea3b9c854 Fixing tinyvit input_size issue 2023-09-01 11:05:56 -07:00
方曦 fabc4e5bcd Fixing tinyvit torchscript issue 2023-09-01 11:05:56 -07:00
方曦 bae949f830 fix attention_bias_cache in tinyvit 2023-09-01 11:05:56 -07:00
方曦 170a5b6e27 add tinyvit 2023-09-01 11:05:56 -07:00
Ross Wightman 983310d6a2 Fix #1935 , torch.amp.autocast -> torch.autocast (namespace issue for 1.10 pt compat) 2023-08-30 15:03:28 -07:00
Ross Wightman c8b2f28096 Fix a few typos, fix fastvit proj_drop, add code link 2023-08-28 21:26:29 -07:00
Ross Wightman fc5d705b83 dynamic_size -> dynamic_img_size, add dynamic_img_pad for padding option 2023-08-27 15:58:35 -07:00
Ross Wightman 1f4512fca3 Support dynamic_resize in eva.py models 2023-08-27 15:58:35 -07:00
Ross Wightman ea3519a5f0 Fix dynamic_resize for deit models (distilled or no_embed_cls) and vit w/o class tokens 2023-08-27 15:58:35 -07:00
Ross Wightman 4d8ecde6cc Fix torchscript for vit-hybrid dynamic_resize 2023-08-27 15:58:35 -07:00
Ross Wightman fdd8c7c2da Initial impl of dynamic resize for existing vit models (incl vit-resnet hybrids) 2023-08-27 15:58:35 -07:00
Ross Wightman 5d599a6a10 RepViT weights on HF hub 2023-08-25 10:39:02 -07:00
Ross Wightman 56c285445c Wrong pool size for 384x384 inception_next_base 2023-08-24 18:31:44 -07:00
Ross Wightman af9f56f3bf inception_next dilation support, weights on hf hub, classifier reset / global pool / no head fixes 2023-08-24 18:31:44 -07:00
Ross Wightman 2d33b9df6c Add features_only support to inception_next 2023-08-24 18:31:44 -07:00
Ross Wightman 3d8d7450ad InceptionNeXt using timm builder, more cleanup 2023-08-24 18:31:44 -07:00
Ross Wightman f4cf9775c3 Adding InceptionNeXt 2023-08-24 18:31:44 -07:00
Ross Wightman d6c348765a Fix first_conv for mobileone and fastvit 2023-08-23 22:50:37 -07:00
Ross Wightman 16334e4bec Fix two fastvit issues 2023-08-23 22:50:37 -07:00
Ross Wightman 5242ba6edc MobileOne and FastViT weights on HF hub, more code cleanup and tweaks, features_only working. Add reparam flag to validate and benchmark, support reparm of all models with fuse(), reparameterize() or switch_to_deploy() methods on modules 2023-08-23 22:50:37 -07:00
Ross Wightman 40dbaafef5 Stagify FastViT /w downsample to top of stage 2023-08-23 22:50:37 -07:00
Ross Wightman 8470eb1cb5 More fastvit & mobileone updates, ready for weight upload 2023-08-23 22:50:37 -07:00
Ross Wightman 8474508d07 More work on FastViT, use own impl of MobileOne, validation working with remapped weight, more refactor TODO 2023-08-23 22:50:37 -07:00
Ross Wightman c7a20cec13 Begin adding FastViT 2023-08-23 22:50:37 -07:00
Ross Wightman 7fd3674d0d Add mobileone and update repvgg 2023-08-23 22:50:37 -07:00
Ross Wightman 3055411c1b
Fix samvit bug, add F.sdpa support and ROPE option (#1920)
* Fix a bug I introduced in samvit, add F.sdpa support and ROPE option to samvit, neck is LayerNorm if not used and standard classifier used

* Add attn dropout to F.sdpa

* Fix fx trace for sam vit

* Fixing torchscript issues in samvit

* Another torchscript fix

* samvit head fc name fix
2023-08-20 21:22:59 -07:00
Ross Wightman 300f54a96f Another effcientvit (mit) tweak, fix torchscript/fx conflict with autocast disable 2023-08-20 15:07:25 -07:00
Ross Wightman dc18cda2e7 efficientvit (mit) msa attention q/k/v ops need to be in float32 to train w/o NaN 2023-08-20 11:49:36 -07:00
Ross Wightman be4e0d8f76 Update attrib comment to include v2 2023-08-19 23:39:09 -07:00
Ross Wightman 126a58e563 Combine ghostnetv2 with ghostnet, reduec redundancy, add weights to hf hub. 2023-08-19 23:33:43 -07:00
Ross Wightman 3f320a9e57 Merge branch 'Add-GhostNetV2' of github.com:yehuitang/pytorch-image-models into yehuitang-Add-GhostNetV2 2023-08-19 22:07:54 -07:00
Ross Wightman 7c2728c6fe
Merge pull request #1919 from ChengpengChen/main
Add RepGhost models and weights
2023-08-19 16:26:45 -07:00
Ross Wightman 69e0ca2e36 Weights on hf hub, bicubic yields slightly better eval 2023-08-19 16:25:45 -07:00
Ross Wightman b8011565bd
Merge pull request #1894 from seefun/master
add two different EfficientViT models
2023-08-19 09:24:14 -07:00
Ross Wightman 7d7589e8da Fixing efficient_vit torchscript, fx, default_cfg issues 2023-08-18 23:23:11 -07:00
Ross Wightman 58ea1c02c4 Add fixed_input_size flag to msra efficient_vit 2023-08-18 16:48:17 -07:00
Ross Wightman c28324a150 Update efficient_vit (msra), hf hub weights 2023-08-18 16:45:37 -07:00
Ross Wightman e700a32626 Cleanup of efficient_vit (mit), tweak eps for better AMP behaviour, formatting/cleanup, weights on hf hub 2023-08-18 16:06:07 -07:00
方曦 00f670fa69 fix bug in ci for efficientvits 2023-08-17 14:40:17 +08:00
Chengpeng Chen e7f97cb5ce Fix typos RepGhost models 2023-08-16 14:27:45 +08:00
Chengpeng Chen d1d0193615 Add RepGhost models and weights 2023-08-16 11:54:53 +08:00
Minseo Kang 7938f28542 Fix typo in efficientformer_v2 2023-08-16 03:29:01 +09:00
yehuitang b407794e3a
Add GhostNetV2 2023-08-13 18:20:27 +08:00
yehuitang fc865282e5
Add ghostnetv2.py 2023-08-13 18:16:26 +08:00
Ross Wightman da75cdd212
Merge pull request #1900 from huggingface/swin_maxvit_resize
Add support for resizing swin transformer, maxvit, coatnet at creation time
2023-08-11 15:05:28 -07:00
Ross Wightman 78a04a0e7d
Merge pull request #1911 from dsuess/1910-fixes-batchnormact-fx
Register norm_act layers as leaf modules
2023-08-11 14:34:16 -07:00
Yonghye Kwon 2048f6f20f
set self.num_features to neck_chans if neck_chans > 0 2023-08-11 13:45:06 +09:00
Ross Wightman 3a44e6c602 Fix #1912 CoaT model not loading w/ return_interm_layers 2023-08-10 11:15:58 -07:00
Daniel Suess 986de90360
Register orm_act layers as leaf modules 2023-08-10 15:37:26 +10:00
Ross Wightman c692715388 Some RepVit tweaks
* add head dropout to RepVit as all models have that arg
* default train to non-distilled head output via distilled_training flag (set_distilled_training) so fine-tune works by default w/o distillation script
* camel case naming tweaks to match other models
2023-08-09 12:41:12 -07:00
Ross Wightman c153cd4a3e Add more advanced interpolation method from BEiT and support non-square window & image size adaptation for
* beit/beit-v2
* maxxvit/coatnet
* swin transformer
And non-square windows for swin-v2
2023-08-08 16:41:16 -07:00
alec.tu bb2b6b5f09 fix num_classes not found 2023-08-07 15:16:03 +08:00
Ross Wightman 1dab536cb1 Fix torch.fx for swin padding change 2023-08-05 13:09:55 -07:00
Ross Wightman 7c0f492dbb Fix type annotation for torchscript 2023-08-04 23:03:52 -07:00
Ross Wightman 7790ea709b Add support for resizing swin transformer img_size and window_size on init and load from pretrained weights. Add support for non-square window_size to both swin v1/v2 2023-08-04 22:10:46 -07:00
Ross Wightman 81089b10a2 Remove unecessary LongTensor in EfficientFormer. Possibly maybe fix #1878 2023-08-03 16:38:53 -07:00
Ross Wightman d138a9bf88 Add gluon hrnet small weights, fix #1895 2023-08-03 12:15:04 -07:00
Ross Wightman 150356c493 Fix unfortunate selecsls case bug caused by aggressive IDE rename 2023-08-03 10:37:06 -07:00
Ross Wightman 6e8c53d0d3 Comment out beit url, no longer valid as now require long query string, leave for reference, must use HF hub now. 2023-08-03 10:00:46 -07:00
方曦 a56e2bbf19 fix efficientvit_msra pretrained load 2023-08-03 18:44:38 +08:00
方曦 e94c60b546 efficientvit_msra refactor 2023-08-03 17:45:50 +08:00
方曦 047bab6ab2 efficientvit_mit stage refactor 2023-08-03 14:59:35 +08:00
方曦 e8fb866ccf fix efficientvit_msra pool 2023-08-02 14:40:01 +08:00
方曦 43443f64eb fix efficientvits 2023-08-02 14:12:37 +08:00
方曦 82d1e99e1a add efficientvit(msra) 2023-08-01 18:51:08 +08:00
方曦 b91a77fab7 add EfficientVit (MIT) 2023-08-01 12:42:21 +08:00
Ross Wightman a9d0615f42 Fix ijepa vit issue with 448 model, minor formatting fixes 2023-07-26 20:46:27 -07:00
Ross Wightman 5874d1bfc7
Merge pull request #1876 from jameslahm/main
Add RepViT models
2023-07-26 14:38:41 -07:00
Ross Wightman b10310cc27 Add proper pool size for new resnexts 2023-07-26 14:36:03 -07:00
Ross Wightman b71d60cdb7 Two small fixes, num_classes in base class, add model tag 2023-07-26 13:18:49 -07:00
Ross Wightman 3561f8e885 Add seresnextaa201d_32x8d 12k and 1k weights 2023-07-26 13:17:05 -07:00
jameslahm 3318e7614d Add RepViT models 2023-07-21 14:56:53 +08:00
Ruslan Baikulov 158bf129c4 Replace deprecated NumPy aliases of builtin types 2023-07-03 22:24:25 +03:00
Ross Wightman c241081251
Merge pull request #1850 from huggingface/effnet_improve_features_only
Support other features only modes for EfficientNet. Fix #1848 fix #1849
2023-06-23 22:56:08 -07:00
Ross Wightman 47517dbefd Clean more feature extract issues
* EfficientNet/MobileNetV3/HRNetFeatures cls and FX mode support -ve index
* MobileNetV3 allows feature_cfg mode to bypass MobileNetV3Features
2023-06-14 14:46:22 -07:00
Ross Wightman a09c88ed0f Support other features only modes for EfficientNet 2023-06-14 12:57:39 -07:00
SeeFun c3f24a5ae5
‘add ViT weight from I-JEPA pretrain’ 2023-06-14 22:30:31 +08:00
Lengyue c308dbc6f2
update dinov2 layerscale init values 2023-05-24 12:20:17 -04:00
Ross Wightman e9373b1b92 Cleanup before samvit merge. Resize abs posembed on the fly, undo some line-wraps, remove redundant unbind, fix HF hub weight load 2023-05-18 16:43:48 -07:00
方曦 c1c6eeb909 fix loading pretrained weight for samvit 2023-05-18 08:49:29 +08:00
方曦 15de561f2c fix unit test for samvit 2023-05-17 12:51:12 +08:00
方曦 ea1f52df3e add ViT for Segment-Anything Model 2023-05-17 11:39:29 +08:00
Ross Wightman c5d3ee47f3 Add B/16 datacompxl CLIP weights 2023-05-16 11:27:20 -07:00
Philip Keller fc77e9ecc5
Update hub.py
fixed import of _hub modules
2023-05-12 21:48:46 +02:00
Ross Wightman f744bda994 use torch.jit.Final instead of Final for beit, eva 2023-05-12 09:12:14 -07:00
Ross Wightman 2e99bcaedd Update README, prep for version 0.9.0 release 2023-05-11 15:22:50 -07:00
Ross Wightman 3eaf729f3f F.sdpa for visformer fails w/o contiguous on qkv, make experimental 2023-05-11 11:37:37 -07:00
Ross Wightman cf1884bfeb Add 21k maxvit tf weights 2023-05-10 18:23:32 -07:00