1632 Commits

Author SHA1 Message Date
Ross Wightman
d88bed6535 Bit more Hiera fiddling 2024-04-21 09:36:57 -07:00
Ross Wightman
8a54d2a930 WIP Hiera implementation. Fix #2083. Trying to get image size adaptation to work. 2024-04-20 09:47:17 -07:00
Ross Wightman
de15b8b828 Next release will be 1.0 :o 2024-04-11 08:55:27 -07:00
Ross Wightman
c8da47a773
Update version.py 2024-04-11 08:45:50 -07:00
Ross Wightman
d6b95520f1
Merge pull request #2136 from huggingface/vit_features_only
Exploring vit features_only via new forward_intermediates() API, inspired by #2131
2024-04-11 08:38:20 -07:00
Ross Wightman
24f6d4f7f8 Fix #2127 move to ema device 2024-04-10 21:29:09 -07:00
Ross Wightman
4b2565e4cb More forward_intermediates() / FeatureGetterNet work
* include relpos vit
* refactor reduction / size calcs so hybrid vits work and dynamic_img_size works
* fix -ve feature indices when pruning
* fix mvitv2 w/ class token
* refine naming
* add tests
2024-04-10 15:11:34 -07:00
Ross Wightman
ef9c6fb846 forward_head(), consistent pre_logits handling to reduce likelihood of people manually replacing .head module having issues 2024-04-09 21:54:59 -07:00
Ross Wightman
679daef76a More forward_intermediates() & features_only work
* forward_intermediates() added to beit, deit, eva, mvitv2, twins, vit, vit_sam
* add features_only to forward intermediates to allow just intermediate features
* fix #2060
* fix #1374
* fix #657
2024-04-09 21:29:16 -07:00
Ross Wightman
c28ee2e904
Merge pull request #2145 from huggingface/fix_imagenet22k_ms_mapping
Add teddy-bear class back to first 1000 classes of imagenet22k_ms_synsets (line 851, index 850)
2024-04-09 14:56:31 -07:00
Ross Wightman
f5ea076a46
Merge pull request #2143 from huggingface/fix_asymm_set_grad_enable
Fix #2132, remove use of _C.set_grad_enable. Line endings were messed up too
2024-04-09 10:14:13 -07:00
Ross Wightman
286d941923 Add teddy-bear class back to first 1000 classes of imagenet22k_ms_synsets (index 851) 2024-04-09 09:33:08 -07:00
Ross Wightman
5c5ae8d401 Fix #2132, remove use of _C.set_grad_enable. Line endings were messed up too 2024-04-09 09:00:23 -07:00
Ross Wightman
17b892f703 Fix #2139, disable strict weight loading when head changes from classification 2024-04-09 08:41:37 -07:00
Ross Wightman
5fdc0b4e93 Exploring vit features_only using get_intermediate_layers() as per #2131 2024-04-07 11:24:45 -07:00
fzyzcjy
b44e4e45a2 more 2024-04-02 10:25:30 +08:00
fzyzcjy
8880a5cd5c
Update scheduler.py 2024-03-23 11:27:33 +08:00
Ross Wightman
34b41b143c Fiddling with efficientnet x/h defs, is it worth adding & training any? 2024-03-22 17:55:02 -07:00
Ross Wightman
c559c3911f Improve vit conversions. OpenAI convert pass through main convert for patch & pos resize. Fix #2120 2024-03-21 10:00:43 -07:00
Ross Wightman
256cf19148 Rename tinyclip models to fit existing 'clip' variants, use consistently mapped OpenCLIP compatible checkpoint on hf hub 2024-03-20 15:21:46 -07:00
Thien Tran
1a1d07d479 add other tinyclip 2024-03-19 07:27:09 +08:00
Thien Tran
dfffffac55 add tinyclip 8m 2024-03-19 07:02:17 +08:00
Ross Wightman
6ccb7d6a7c
Merge pull request #2111 from jamesljlster/enhance_vit_get_intermediate_layers
Vision Transformer (ViT) get_intermediate_layers: enhanced to support dynamic image size and saved computational costs from unused blocks
2024-03-18 13:41:18 -07:00
Cheng-Ling Lai
db06b56d34
Saved computational costs of get_intermediate_layers() from unused blocks 2024-03-17 21:34:06 +08:00
Cheng-Ling Lai
4731e4efc4
Modified ViT get_intermediate_layers() to support dynamic image size 2024-03-16 23:07:21 +08:00
Ross Wightman
ba641e07ae Add support for dynamo based onnx export 2024-03-13 12:05:26 -07:00
SmilingWolf
59cb0be595 SwinV2: add configurable act_layer argument
Defaults to "gelu", but makes it possible to pass "gelu_tanh".
Makes it easier to port weights from JAX/Flax, where the tanh
approximation is the default.
2024-03-05 22:04:17 +01:00
Ross Wightman
49992b0dc7
Update version.py
Update to 0.9.16 for release
2024-02-19 11:08:17 -08:00
Ross Wightman
35d6eef0df Version bump, add test markers back to toml 2024-02-16 09:04:00 -08:00
Ross Wightman
31e0dc0a5d Tweak hgnet before merge 2024-02-12 15:00:32 -08:00
Ross Wightman
3e03491e49 Merge branch 'master' of https://github.com/seefun/pytorch-image-models into seefun-master 2024-02-12 14:59:54 -08:00
Ross Wightman
958938845a
Update version.py 2024-02-10 23:10:50 -08:00
Ross Wightman
47c9bc4dc6 Fix device idx split 2024-02-10 21:41:14 -08:00
Ross Wightman
59239d9df5 Cleanup imports for vit relpos 2024-02-10 21:40:57 -08:00
Ross Wightman
ac1b08deb6 fix_init on vit & relpos vit 2024-02-10 20:15:37 -08:00
Ross Wightman
935950cc11 Fix F.sdpa attn drop prob 2024-02-10 20:14:47 -08:00
Ross Wightman
0737cf231d Add Next-ViT 2024-02-10 17:05:16 -08:00
Ross Wightman
d6c2cc91af Make NormMlpClassifier head reset args consistent with ClassifierHead 2024-02-10 16:25:33 -08:00
Ross Wightman
87fec3dc14 Update experimental vit model configs 2024-02-10 16:05:58 -08:00
Ross Wightman
7d3c2dc993 Add group_matcher for DaViT 2024-02-10 14:58:45 -08:00
Ross Wightman
7bc7798d0e Type annotation correctness for create_act 2024-02-10 14:57:58 -08:00
Ross Wightman
7d121ac2ef Small tweak of timm ToTensor for clarity 2024-02-10 14:57:40 -08:00
Ross Wightman
a08b57e801 Fix distributed flag bug w/ flex device handling 2024-02-03 16:26:15 -08:00
Ross Wightman
bee0471f91 forward() pass through for ema model, flag for ema warmup, comment about warmup 2024-02-03 16:24:45 -08:00
Ross Wightman
5e4a4b2adc Merge branch 'device_flex' into mesa_ema 2024-02-02 09:45:30 -08:00
Ross Wightman
dd84ef2cd5 ModelEmaV3 and MESA experiments 2024-02-02 09:45:04 -08:00
Ross Wightman
d0ff315eed Merge remote-tracking branch 'emav3/faster_ema' into mesa_ema 2024-01-27 14:52:10 -08:00
Ross Wightman
88889de923 Fix meshgrid deprecation warnings and backward compat with explicit 'ndgrid' and 'meshgrid' fn w/o indexing arg 2024-01-27 13:48:33 -08:00
Ross Wightman
d4386219c6 Improve type handling for arange & rel pos embeds, keep calculations in float32 until application (may change to apply in float32 in future). Prevent arange type hijacking by DeepSpeed Zero 2024-01-26 16:35:51 -08:00
Ross Wightman
3234daf783 Add missing deprecation mapping for a densenet and xcit model. Fix #2086. Tweak xcit pos embed use of arange for better low prec safety. 2024-01-24 22:04:04 -08:00