Ross Wightman
e700a32626
Cleanup of efficient_vit (mit), tweak eps for better AMP behaviour, formatting/cleanup, weights on hf hub
2023-08-18 16:06:07 -07:00
方曦
00f670fa69
fix bug in ci for efficientvits
2023-08-17 14:40:17 +08:00
Chengpeng Chen
e7f97cb5ce
Fix typos RepGhost models
2023-08-16 14:27:45 +08:00
Chengpeng Chen
d1d0193615
Add RepGhost models and weights
2023-08-16 11:54:53 +08:00
Minseo Kang
7938f28542
Fix typo in efficientformer_v2
2023-08-16 03:29:01 +09:00
yehuitang
b407794e3a
Add GhostNetV2
2023-08-13 18:20:27 +08:00
yehuitang
fc865282e5
Add ghostnetv2.py
2023-08-13 18:16:26 +08:00
Ross Wightman
da75cdd212
Merge pull request #1900 from huggingface/swin_maxvit_resize
...
Add support for resizing swin transformer, maxvit, coatnet at creation time
2023-08-11 15:05:28 -07:00
Ross Wightman
78a04a0e7d
Merge pull request #1911 from dsuess/1910-fixes-batchnormact-fx
...
Register norm_act layers as leaf modules
2023-08-11 14:34:16 -07:00
Yonghye Kwon
2048f6f20f
set self.num_features to neck_chans if neck_chans > 0
2023-08-11 13:45:06 +09:00
Ross Wightman
3a44e6c602
Fix #1912 CoaT model not loading w/ return_interm_layers
2023-08-10 11:15:58 -07:00
Daniel Suess
986de90360
Register orm_act layers as leaf modules
2023-08-10 15:37:26 +10:00
Ross Wightman
c692715388
Some RepVit tweaks
...
* add head dropout to RepVit as all models have that arg
* default train to non-distilled head output via distilled_training flag (set_distilled_training) so fine-tune works by default w/o distillation script
* camel case naming tweaks to match other models
2023-08-09 12:41:12 -07:00
Ross Wightman
c153cd4a3e
Add more advanced interpolation method from BEiT and support non-square window & image size adaptation for
...
* beit/beit-v2
* maxxvit/coatnet
* swin transformer
And non-square windows for swin-v2
2023-08-08 16:41:16 -07:00
alec.tu
bb2b6b5f09
fix num_classes not found
2023-08-07 15:16:03 +08:00
Ross Wightman
1dab536cb1
Fix torch.fx for swin padding change
2023-08-05 13:09:55 -07:00
Ross Wightman
7c0f492dbb
Fix type annotation for torchscript
2023-08-04 23:03:52 -07:00
Ross Wightman
7790ea709b
Add support for resizing swin transformer img_size and window_size on init and load from pretrained weights. Add support for non-square window_size to both swin v1/v2
2023-08-04 22:10:46 -07:00
Ross Wightman
81089b10a2
Remove unecessary LongTensor in EfficientFormer. Possibly maybe fix #1878
2023-08-03 16:38:53 -07:00
Ross Wightman
4224529ebe
Version 0.9.5 prep for release. README update
2023-08-03 15:16:46 -07:00
Ross Wightman
d138a9bf88
Add gluon hrnet small weights, fix #1895
2023-08-03 12:15:04 -07:00
Ross Wightman
76d166981d
Fix missing norm call in Mlp forward (not used by default, but can be enabled for normformer MLP scale). Fix #1851 fix #1852
2023-08-03 11:36:30 -07:00
Ross Wightman
8e4480e4b6
Patch and pos embed resample done in float32 always (cast to float and back). Fix #1811
2023-08-03 11:32:17 -07:00
Ross Wightman
150356c493
Fix unfortunate selecsls case bug caused by aggressive IDE rename
2023-08-03 10:37:06 -07:00
Ross Wightman
6e8c53d0d3
Comment out beit url, no longer valid as now require long query string, leave for reference, must use HF hub now.
2023-08-03 10:00:46 -07:00
方曦
a56e2bbf19
fix efficientvit_msra pretrained load
2023-08-03 18:44:38 +08:00
方曦
e94c60b546
efficientvit_msra refactor
2023-08-03 17:45:50 +08:00
方曦
047bab6ab2
efficientvit_mit stage refactor
2023-08-03 14:59:35 +08:00
方曦
e8fb866ccf
fix efficientvit_msra pool
2023-08-02 14:40:01 +08:00
方曦
43443f64eb
fix efficientvits
2023-08-02 14:12:37 +08:00
方曦
82d1e99e1a
add efficientvit(msra)
2023-08-01 18:51:08 +08:00
方曦
b91a77fab7
add EfficientVit (MIT)
2023-08-01 12:42:21 +08:00
Sepehr Sameni
40a518c194
use float in resample_abs_pos_embed_nhwc
...
since F.interpolate doesn't always support BFloat16
2023-07-28 16:01:42 -07:00
Ross Wightman
8cb0ddac45
Update README, version 0.9.4dev0
2023-07-27 17:07:31 -07:00
Ross Wightman
a9d0615f42
Fix ijepa vit issue with 448 model, minor formatting fixes
2023-07-26 20:46:27 -07:00
alec.tu
942726db31
import lion in __init__.py
2023-07-27 09:26:57 +08:00
Ross Wightman
5874d1bfc7
Merge pull request #1876 from jameslahm/main
...
Add RepViT models
2023-07-26 14:38:41 -07:00
Ross Wightman
b10310cc27
Add proper pool size for new resnexts
2023-07-26 14:36:03 -07:00
Ross Wightman
b71d60cdb7
Two small fixes, num_classes in base class, add model tag
2023-07-26 13:18:49 -07:00
Ross Wightman
3561f8e885
Add seresnextaa201d_32x8d 12k and 1k weights
2023-07-26 13:17:05 -07:00
jameslahm
3318e7614d
Add RepViT models
2023-07-21 14:56:53 +08:00
Ruslan Baikulov
158bf129c4
Replace deprecated NumPy aliases of builtin types
2023-07-03 22:24:25 +03:00
Ross Wightman
c241081251
Merge pull request #1850 from huggingface/effnet_improve_features_only
...
Support other features only modes for EfficientNet. Fix #1848 fix #1849
2023-06-23 22:56:08 -07:00
Ross Wightman
47517dbefd
Clean more feature extract issues
...
* EfficientNet/MobileNetV3/HRNetFeatures cls and FX mode support -ve index
* MobileNetV3 allows feature_cfg mode to bypass MobileNetV3Features
2023-06-14 14:46:22 -07:00
Ross Wightman
a09c88ed0f
Support other features only modes for EfficientNet
2023-06-14 12:57:39 -07:00
SeeFun
c3f24a5ae5
‘add ViT weight from I-JEPA pretrain’
2023-06-14 22:30:31 +08:00
Ross Wightman
2d597b126d
Missed extra nadam algo step for capturable path
2023-06-13 20:51:31 -07:00
Ross Wightman
4790c0fa16
Missed nadamw.py
2023-06-13 20:45:58 -07:00
Ross Wightman
dab0360e00
Add NadamW based on mlcommons algorithm, added multi-tensor step
2023-06-13 20:45:17 -07:00
Ross Wightman
700aebcdc4
Fix Pytorch 2.0 breakage for Lookahead optimizer adapter
2023-06-02 08:39:07 -07:00
Lengyue
c308dbc6f2
update dinov2 layerscale init values
2023-05-24 12:20:17 -04:00
Ross Wightman
7cea88e2c4
Pop eps for lion optimizer
2023-05-21 15:20:03 -07:00
Ross Wightman
e9373b1b92
Cleanup before samvit merge. Resize abs posembed on the fly, undo some line-wraps, remove redundant unbind, fix HF hub weight load
2023-05-18 16:43:48 -07:00
方曦
c1c6eeb909
fix loading pretrained weight for samvit
2023-05-18 08:49:29 +08:00
方曦
15de561f2c
fix unit test for samvit
2023-05-17 12:51:12 +08:00
方曦
ea1f52df3e
add ViT for Segment-Anything Model
2023-05-17 11:39:29 +08:00
Ross Wightman
960202cfcc
Dev version 0.9.3 for main
2023-05-16 11:28:00 -07:00
Ross Wightman
c5d3ee47f3
Add B/16 datacompxl CLIP weights
2023-05-16 11:27:20 -07:00
Ross Wightman
3d05c0e86f
Version 0.9.2
2023-05-14 08:03:04 -07:00
Philip Keller
fc77e9ecc5
Update hub.py
...
fixed import of _hub modules
2023-05-12 21:48:46 +02:00
Ross Wightman
cc77096350
Version 0.9.1
2023-05-12 09:47:47 -07:00
Ross Wightman
f744bda994
use torch.jit.Final instead of Final for beit, eva
2023-05-12 09:12:14 -07:00
Ross Wightman
2e99bcaedd
Update README, prep for version 0.9.0 release
2023-05-11 15:22:50 -07:00
Ross Wightman
3eaf729f3f
F.sdpa for visformer fails w/o contiguous on qkv, make experimental
2023-05-11 11:37:37 -07:00
Ross Wightman
cf1884bfeb
Add 21k maxvit tf weights
2023-05-10 18:23:32 -07:00
Ross Wightman
6c2edf4d74
Missed hub_id entries for byoanet models
2023-05-10 15:58:55 -07:00
Ross Wightman
cf101b0097
Version 0.8.23dev0 and README update
2023-05-10 14:41:22 -07:00
Ross Wightman
850ab4931f
Missed a few pretrained tags...
2023-05-10 12:16:30 -07:00
Ross Wightman
ff2464e2a0
Throw when pretrained weights not available and pretrained=True (principle of least surprise).
2023-05-10 10:44:34 -07:00
Ross Wightman
8ce9a2c00a
Merge pull request #1222 from Leoooo333/master
...
Fix mixup/one_hot device problem
2023-05-10 08:59:15 -07:00
Ross Wightman
fd592ec86c
Fix an issue with FastCollateMixup still using device
2023-05-10 08:55:38 -07:00
Ross Wightman
e0ec0f7252
Merge pull request #1643 from nateraw/docstrings-update
...
Update Docstring for create_model
2023-05-09 21:33:20 -07:00
Ross Wightman
627b6315ba
Add typing to dinov2 entrypt fns, use hf hub for mae & dinov2 weights
2023-05-09 20:42:11 -07:00
Ross Wightman
b9d43c7dca
Version 0.8.22dev0
2023-05-09 20:38:10 -07:00
Ross Wightman
960a882510
Remove label offsets and remove old weight url for 1001 class (background + in1k) TF origin weights
2023-05-09 18:00:41 -07:00
Ross Wightman
a01d8f86f4
Tweak DinoV2 add, add MAE ViT weights, add initial intermediate layer getter experiment
2023-05-09 17:59:22 -07:00
Ross Wightman
59bea4c306
Merge branch 'main' into dot_nine_cleanup
2023-05-09 12:27:32 -07:00
Leng Yue
5cc87e6485
Add dinov2 pretrained models ( #1797 )
...
* add dinov2 small, base, and large
* fix input size
* fix swiglu & dinov2 vit giant
* use SwiGLUPacked to replace GluMlp
* clean up & add ffn_layer placeholder for ParallelScalingBlock
2023-05-09 12:24:47 -07:00
Ross Wightman
e3363a7159
Support bitsandbytes optimizers in factory
2023-05-09 11:33:51 -07:00
Ross Wightman
21e57c0b9e
Add missing beitv2 in1k -> in1k models
2023-05-08 17:03:51 -07:00
Ross Wightman
8c6fccb879
Allow passing state_dict directly via pretrained cfg mechanism as an override
2023-05-08 15:15:44 -07:00
Ross Wightman
af48246a9a
Add SwiGLUPacked to layers __init__
2023-05-08 13:52:34 -07:00
Ross Wightman
3fdb31de2e
Small SwiGLU tweak, remove default LN arg in unpacked variant, add packed alias for GluMLP
2023-05-08 12:28:00 -07:00
Ross Wightman
e4e43190ce
Add typing to all model entrypoint fns, add old cache check env var to builder
2023-05-08 08:52:38 -07:00
Ross Wightman
cb3f9c23bb
Metaformer baselines for vision (final PR with cleanup) ( #1793 )
...
* update
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* merge with poolformer, initial version
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Revert "Update metaformers.py"
This reverts commit 2916f37f8d
.
* Revert "Update metaformers.py"
This reverts commit 1d882eb494
.
* Revert "Update metaformers.py"
This reverts commit 2209d0830e
.
* Revert "Update metaformers.py"
This reverts commit 32bede4e27
.
* Revert "Update metaformers.py"
This reverts commit 4ed934e000
.
* Revert "Update metaformers.py"
This reverts commit 3f0b075367
.
* Revert "Update metaformers.py"
This reverts commit 2fef9006d7
.
* Update metaformers.py
* Update metaformers.py
* rename model
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Stem/Downsample rework
* Update metaformers.py
* try NHWC
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Squashed commit of the following:
commit b7696a30a7
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Fri Feb 10 01:46:44 2023 -0800
Update metaformers.py
commit 41fe5c3626
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Fri Feb 10 01:03:47 2023 -0800
Update metaformers.py
commit a3aee37c35
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Fri Feb 10 00:32:04 2023 -0800
Update metaformers.py
commit f938beb81b
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Fri Feb 10 00:24:58 2023 -0800
Update metaformers.py
commit 10bde717e5
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Sun Feb 5 02:11:28 2023 -0800
Update metaformers.py
commit 39274bd45e
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Sun Feb 5 02:06:58 2023 -0800
Update metaformers.py
commit a2329ab8ec
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Sun Feb 5 02:03:34 2023 -0800
Update metaformers.py
commit 53b8ce5b8a
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Sun Feb 5 02:02:37 2023 -0800
Update metaformers.py
commit ab6225b941
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Sun Feb 5 01:04:55 2023 -0800
try NHWC
commit 02fcc30eaa
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Sat Feb 4 23:47:06 2023 -0800
Update metaformers.py
commit 366aae9304
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Sat Feb 4 23:37:30 2023 -0800
Stem/Downsample rework
commit 26a8e481a5
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Wed Feb 1 07:42:07 2023 -0800
Update metaformers.py
commit a913f5d438
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Wed Feb 1 07:41:24 2023 -0800
Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* channels first for whole network
* Channels first
* Update metaformers.py
* Use buffer for randformer
* Update metaformers.py
* Remove einsum
* don't test randformer for feature extraction
* arbitrary input sizes for randformer
* Squashed commit of the following:
commit 6c089ca4325ab10942fe56e0999dcc1a11e1d2f0
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Mon Mar 6 02:11:17 2023 -0800
Update metaformers.py
commit 521528a900e49ef8f462f5ccd795efb3a5d14214
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Mon Mar 6 02:06:08 2023 -0800
Update metaformers.py
commit 3827eec7963698ff727fbb13ace53594ceb374d5
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Mon Mar 6 02:03:08 2023 -0800
Update metaformers.py
commit ac1c6fea8adcd846e031ea0f5fa81ffe63d3c4bb
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Mon Mar 6 02:01:04 2023 -0800
Update metaformers.py
commit 26f3d343cdc46183543f83482187f669f3181ddf
Merge: d577129 f736730
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Mon Mar 6 01:57:29 2023 -0800
Merge branch 'metaformer_workspace' of https://github.com/fffffgggg54/pytorch-image-models into metaformer_workspace
commit d577129aaa23fb348a8bb93bcd17cf1d5a4e8ff8
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Mon Mar 6 01:57:20 2023 -0800
Update metaformers.py
commit f7367304e8f3b7a9a7f16e0a032bb72546afcc2a
Author: Fredo Guan <fredo.guan@hotmail.com>
Date: Mon Mar 6 01:56:11 2023 -0800
Metaformer baselines for vision (#12 )
* formatting, cleanup, fix dropout
* fix regression, pass kwargs
* fix poolformerv1 weights, formatting
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* Update metaformers.py
* some cleanup
* SDPA from ViT, fix imports
* Update metaformers.py
* fix head reset
* fast norm bias patch for metaformers
* Metaformer refactor, remove rand/ident models, fix issues, remove old poolformer
* Switch to hub weights
---------
Co-authored-by: Fredo Guan <fredo.guan@hotmail.com>
2023-05-05 11:18:26 -07:00
Ross Wightman
320bf9c469
Remove redundant types, kwargs back in own section (lesser of many evils?)
2023-05-01 14:21:48 -07:00
Ross Wightman
8fa86a28a8
Add datacomp L/14 (79.2 zs) image tower weights
2023-05-01 10:24:08 -07:00
Ross Wightman
5e64777804
0.8.21dev0
2023-04-28 13:46:59 -07:00
Ross Wightman
493c730ffc
Fix pit regression
2023-04-26 23:16:06 -07:00
Ross Wightman
437d344e03
Always some torchscript issues
2023-04-26 20:42:34 -07:00
Ross Wightman
528faa0e04
Some fixes
2023-04-26 17:46:20 -07:00
Ross Wightman
3386af8c86
Final push to get remaining models using multi-weight pretrained configs, almost all weights on HF hub
2023-04-26 15:52:13 -07:00
Ross Wightman
c0560cbf22
version 0.8.20dev0
2023-04-21 16:57:32 -07:00
Ross Wightman
7ad7ddb7ad
DenseNet, DPN, VoVNet, Aligned Xception weights on HF hub. DenseNet grad_checkpointing using timm API
2023-04-21 16:56:44 -07:00
Ross Wightman
864bfd43d0
hardcore nas weights on hf hub
2023-04-21 14:35:10 -07:00
Ross Wightman
6e4529ae35
TResNet weights now on HF hub, modified to remove InplaceABN dependency
2023-04-21 14:20:48 -07:00
Ross Wightman
04dcbc02ec
Fix weight remap for tresnet_v2_l
2023-04-21 09:05:04 -07:00
Ross Wightman
a08e5aed1d
More models w/ multi-weight support, moving to HF hub. Removing inplace_abn from all models including TResNet
2023-04-20 22:44:49 -07:00
Ross Wightman
2aabaef039
Merge pull request #1784 from huggingface/wip-voidbag-accumulate-grad
...
Accumulate gradients (adding to #1659 )
2023-04-20 08:15:28 -07:00
Ross Wightman
f4825a09ef
Merge pull request #212 from bryant1410/patch-1
...
Fix MultiEpochsDataLoader when there's no batching
2023-04-20 07:09:27 -07:00
Ross Wightman
4cd7fb88b2
clip gradients with update
2023-04-19 23:36:20 -07:00
Ross Wightman
df81d8d85b
Cleanup gradient accumulation, fix a few issues, a few other small cleanups in related code.
2023-04-19 23:11:00 -07:00
Ross Wightman
ab7ca62a6e
Merge branch 'main' of github.com:rwightman/pytorch-image-models into wip-voidbag-accumulate-grad
2023-04-19 11:08:12 -07:00
Ross Wightman
34df125be6
cait, volo, xvit hub weights
2023-04-14 10:13:13 -07:00
Ross Wightman
f6d5767551
cspnet models on HF hub w/ multi-weight support
2023-04-12 14:02:38 -07:00
Ross Wightman
aef6e562e4
Add onnx utils and export code, tweak padding and conv2d_same for better dynamic export with recent PyTorch
2023-04-11 17:03:57 -07:00
Ross Wightman
80b247d843
Update swin_v2 attn_mask buffer change in #1790 to apply to updated checkpoints in hub
2023-04-11 14:40:32 -07:00
Ross Wightman
1a1aca0cee
Merge pull request #1761 from huggingface/patch_drop_refactor
...
Implement patch dropout for eva / vision_transformer, refactor dropout args
2023-04-11 14:37:36 -07:00
Ross Wightman
c0670822d2
Small factory handling fix for pretrained tag vs cfg
2023-04-11 07:42:13 -07:00
Ross Wightman
2f25f73b90
Missed a fused_attn update in relpos vit
2023-04-10 23:30:50 -07:00
Ross Wightman
0b65b5c0ac
Add finalized eva CLIP weights pointing to remapped timm hub models
2023-04-10 23:13:12 -07:00
Ross Wightman
965d0a2d36
fast_attn -> fused_attn, implement global config for enable/disable fused_attn, add to more models. vit clip openai 336 weights.
2023-04-10 12:04:33 -07:00
Ross Wightman
4d135421a3
Implement patch dropout for eva / vision_transformer, refactor / improve consistency of dropout args across all vit based models
2023-04-07 20:27:23 -07:00
Marco Forte
c76818a592
skip attention mask buffers
...
Allows more flexibility in the resolutions accepted by SwinV2.
2023-04-07 18:50:02 +02:00
Ross Wightman
1bb3989b61
Improve kwarg passthrough for swin, vit, deit, beit, eva
2023-04-05 21:37:16 -07:00
Ross Wightman
35c94b836c
Update warning message for deprecated model names
2023-04-05 17:24:17 -07:00
Ross Wightman
9eaab795c2
Add some vit model deprecations
2023-04-05 17:21:03 -07:00
Ross Wightman
b17abd35b2
Version 0.8.19dev0
2023-04-05 16:37:16 -07:00
Ross Wightman
abff3f12ec
Wrong pool_size for 288 ft
2023-04-05 16:07:51 -07:00
Ross Wightman
356309959c
ResNet models on HF hub, multi-weight support, add torchvision v2 weights, new 12k pretrained and fine-tuned timm anti-aliased weights
2023-04-05 14:19:42 -07:00
Ross Wightman
7501972cd6
Version 0.8.18dev0
2023-03-31 16:51:26 -07:00
Ross Wightman
beef7f0a22
Add ImageNet-12k intermediate fine-tunes of convnext base & large CLIP models, add first 1k fine-tune of xxlarge
2023-03-31 16:45:01 -07:00
Ross Wightman
9aa1133bd2
Fix #1750 , uncomment weight that exists on HF hub, add FIXME to 3 others that are still on local storage
2023-03-31 14:49:30 -07:00
Ross Wightman
7326470514
Merge pull request #1746 from huggingface/eva02
...
Adding EVA02 weights and model defs
2023-03-31 12:17:00 -07:00
Ross Wightman
adeb9de7c6
Mismatch in eva pretrained_cfg vs model for one of the clip variants
2023-03-31 10:30:30 -07:00
Ross Wightman
0737bd3ec8
eva02 non-CLIP weights on HF hub, add initial eva02 clip model configs w/ postnorm variant & attn LN
2023-03-30 23:43:59 -07:00
Ross Wightman
ac67098147
Add final attr for fast_attn on beit / eva
2023-03-28 08:40:40 -07:00
Ross Wightman
1885bdc431
Merge pull request #1745 from huggingface/mw-mlp_mixer
...
MLP-Mixer multi-weight support, HF hub push
2023-03-28 07:55:17 -07:00
Ross Wightman
e9f427b953
Add hf hub entries for mlp_mixer
2023-03-27 22:50:43 -07:00
Ross Wightman
cff81deb78
multi-weight and hf hub for deit / deit3
2023-03-27 22:47:16 -07:00
Ross Wightman
3863d63516
Adding EVA02 weights and model defs, move beit based eva_giant to same eva.py file. Cleanup rotary pos, add lang oriented freq bands to be compat with eva design choice. Fix #1738
2023-03-27 17:16:07 -07:00
Ross Wightman
b12060996c
MLP-Mixer multi-weight support, hf hub push
2023-03-27 16:42:13 -07:00
Ross Wightman
d196fa536d
Fix last min torchscript regression in nfnet changes
2023-03-24 00:10:17 -07:00
Ross Wightman
33ada0cbca
Add group_matcher to focalnet for proper layer-wise LR decay
2023-03-23 23:21:49 -07:00
Ross Wightman
b271dc0e16
NFNet multi-weight support + HF hub push
2023-03-23 23:20:38 -07:00
Ross Wightman
a089bfba2d
Version 0.8.17dev0
2023-03-22 15:40:23 -07:00
Ross Wightman
dbd33e4b62
Update crop settings for new rexnet weights
2023-03-22 15:39:49 -07:00
Ross Wightman
da6bdd4560
Update resnetv2.py for multi-weight and HF hub weights
2023-03-22 15:38:04 -07:00
Ross Wightman
b3e816d6d7
Improve filtering behaviour for tag + non-tagged model wildcard consistency.
2023-03-22 10:21:22 -07:00
Ross Wightman
7aba64ebdb
Add update byobnet.py w/ models pushed to HF hub
2023-03-22 10:00:00 -07:00
Ross Wightman
e7ef8335bf
regnet.py multi-weight conversion, new ImageNet-12k pretrain/ft from timm for y_120 and y_160, also new tv v2, swag, & seer weights for push to Hf hub.
2023-03-21 15:51:49 -07:00
Ross Wightman
c78319adce
Add ImageNet-12k ReXNet-R 200 & 300 weights, and push existing ReXNet models to HF hub. Dilation support added to rexnet
2023-03-20 13:48:17 -07:00
Ross Wightman
8db20dc240
Fix #1726 , dropout not used in NormMlpClassifierHead. Make dropout more consistent across both classifier heads (nn.Dropout)
2023-03-20 09:37:05 -07:00
Ross Wightman
041de79f9e
Fix numel use in helpers for checkpoint remap
2023-03-20 09:36:48 -07:00
Ross Wightman
49b9c3be80
Include pretrained tag in deprecated mapping warning
2023-03-19 21:21:19 -07:00
Ross Wightman
fafac3317c
Version 0.8.16dev0
2023-03-18 15:09:20 -07:00
Ross Wightman
9fcfb8bcc1
Add Microsoft FocalNet specific ('ms') ImageNet-22k classifier layout
2023-03-18 14:57:34 -07:00
Ross Wightman
572f05096a
Swin and FocalNet weights on HF hub. Add model deprecation functionality w/ some registry tweaks.
2023-03-18 14:55:09 -07:00
Ross Wightman
5aebad3fbc
return_map back to out_map for _feature helpers
2023-03-16 14:50:55 -07:00
Ross Wightman
acfd85ad68
All swin models support spatial output, add output_fmt to v1/v2 and use ClassifierHead.
...
* update ClassifierHead to allow different input format
* add output format support to patch embed
* fix some flatten issues for a few conv head models
* add Format enum and helpers for tensor format (layout) choices
2023-03-15 23:21:51 -07:00
Ross Wightman
c30a160d3e
Merge remote-tracking branch 'origin/main' into focalnet_and_swin_refactor
2023-03-15 15:58:39 -07:00
Ross Wightman
ad94d737b7
Add support to ConvNextBlock for downsample and channel expansion to improve stand alone use. Fix #1699
2023-03-13 14:06:24 -07:00
Ross Wightman
3a636eee71
Fix #1713 missed assignement in 3-aug level fn, fix few other minor lint complaints in auto_augment.py
2023-03-11 14:32:23 -08:00
Piotr Sebastian Kluska
992bf7c3d4
chore: Modify the MobileVitV2Block to be coreml exportable
...
based on is_exportable() set variable controlling behaviour of the block
CoreMLTools support im2col from 6.2 version, unfortunately col2im
is still not supported.
Tested with exporting to ONNX, Torchscript, CoreML, and TVM.
2023-03-03 09:38:24 +01:00
Ross Wightman
4b8cfa6c0a
Add convnext_xxlarge CLIP image tower weights, version 0.8.15dev0
2023-02-26 21:51:48 -08:00
Ross Wightman
f9b56a1bfa
Version 0.8.14dev0
2023-02-26 13:38:51 -08:00
Ross Wightman
1c13ef7b46
Add default norm_eps=1e-5 for convnext_xxlarge, improve kwarg merging for all convnext models
2023-02-26 12:11:49 -08:00
Benjamin Bossan
a5b01ec04e
Add type annotations to _registry.py
...
Description
Add type annotations to _registry.py so that they will pass mypy
--strict.
Comment
I was reading the code and felt that this module would be easier to
understand with type annotations. Therefore, I went ahead and added the
annotations.
The idea with this PR is to start small to see if we can align on _how_
to annotate types. I've seen people in the past disagree on how strictly
to annotate the code base, so before spending too much time on this, I
wanted to check if you agree, Ross.
Most of the added types should be straightforward. Some notes on the
non-trivial changes:
- I made no assumption about the fn passed to register_model, but maybe
the type could be stricter. Are all models nn.Modules?
- If I'm not mistaken, the type hint for get_arch_name was incorrect
- I had to add a # type: ignore to model.__all__ = ...
- I made some minor code changes to list_models to facilitate the
typing. I think the changes should not affect the logic of the function.
- I removed list from list(sorted(...)) because sorted returns always a
list.
2023-02-22 09:19:30 -08:00
Ross Wightman
47f1de9bec
Version bump
2023-02-20 10:17:10 -08:00
Ross Wightman
4d9c3ae2fb
Add laion2b 320x320 ConvNeXt-Large CLIP weights
2023-02-18 16:34:03 -08:00
Ross Wightman
d0b45c9b4d
Make safetensor import option for now. Improve avg/clean checkpoints ext handling a bit (more consistent).
2023-02-18 16:06:42 -08:00
Ross Wightman
947c1d757a
Merge branch 'main' into focalnet_and_swin_refactor
2023-02-17 16:28:52 -08:00
Ross Wightman
cf324ea38f
Fix grad checkpointing in focalnet
2023-02-17 16:26:26 -08:00
Ross Wightman
848d200767
Overhaul FocalNet implementation
2023-02-17 16:24:59 -08:00
Ross Wightman
7266c5c716
Merge branch 'main' into focalnet_and_swin_refactor
2023-02-17 09:20:14 -08:00
Ross Wightman
7d9e321b76
Improve tracing of window attn models with simpler reshape logic
2023-02-17 07:59:06 -08:00
Ross Wightman
2e38d53dca
Remove dead line
2023-02-16 16:57:42 -08:00
Ross Wightman
f77c04ff36
Torchscript fixes/hacks for rms_norm, refactor ParallelScalingBlock with manual combination of input projections, closer paper match
2023-02-16 16:57:42 -08:00
Ross Wightman
122621daef
Add Final annotation to attn_fas to avoid symbol lookup of new scaled_dot_product_attn fn on old PyTorch in jit
2023-02-16 16:57:42 -08:00
Ross Wightman
621e1b2182
Add ideas from 'Scaling ViT to 22-B Params', testing PyTorch 2.0 fused F.scaled_dot_product_attention impl in vit, vit_relpos, maxxvit / coatnet.
2023-02-16 16:57:42 -08:00
Ross Wightman
a3d528524a
Version 0.8.12dev0
2023-02-16 16:27:29 -08:00
testbot
a09d403c24
changed warning to info
2023-02-16 16:20:31 -08:00
testbot
8470e29541
Add support to load safetensors weights
2023-02-16 16:20:31 -08:00
Ross Wightman
f35d6ea57b
Add multi-tensor (foreach) version of Lion in style of upcoming PyTorch 2.0 optimizers
2023-02-16 15:48:00 -08:00
Ross Wightman
709d5e0d9d
Add Lion optimizer
2023-02-14 23:55:05 -08:00
Ross Wightman
624266148d
Remove unused imports from _hub helpers
2023-02-09 17:47:26 -08:00
Ross Wightman
2cfff0581b
Add grad_checkpointing support to features_only, test in EfficientDet.
2023-02-09 17:45:40 -08:00
Ross Wightman
45af496197
Version 0.8.11dev0
2023-02-08 08:29:29 -08:00
Ross Wightman
9c14654a0d
Improve support for custom dataset label name/description through HF hub export, via pretrained_cfg
2023-02-08 08:29:20 -08:00
Ross Wightman
497be8343c
Update README and version
2023-02-06 23:43:14 -08:00
Ross Wightman
0d33127df2
Add 384x384 convnext_large_mlp laion2b fine-tune on in1k
2023-02-06 22:01:04 -08:00
Ross Wightman
7a0bd095cb
Update model prune loader to use pkgutil
2023-02-06 17:45:16 -08:00
Ross Wightman
0f2803de7a
Move ImageNet metadata (aka info) files to timm/data/_info. Add helper classes to make info available for labelling. Update inference.py for first use.
2023-02-06 17:45:03 -08:00
Taeksang Kim
7f29a46d44
Add gradient accumulation option to train.py
...
option: iters-to-accum(iterations to accmulate)
Gradient accumulation improves training performance(samples/s).
It can reduce the number of parameter sharing between each node.
This option can be helpful when network is bottleneck.
Signed-off-by: Taeksang Kim <voidbag@puzzle-ai.com>
2023-02-06 09:24:48 +09:00
Ross Wightman
7a13be67a5
Update version.py
2023-02-05 10:06:15 -08:00
Ross Wightman
13acac8c5e
Update head metadata for effformerv2
2023-02-04 23:11:51 -08:00
Ross Wightman
8682528096
Add first conv metadata for efficientformer_v2
2023-02-04 23:02:02 -08:00
Ross Wightman
72fba669a8
is_scripting() guard on checkpoint_seq
2023-02-04 14:21:49 -08:00
Ross Wightman
95ec255f7f
Finish timm mode api for efficientformer_v2, add grad checkpointing support to both efficientformers
2023-02-03 21:21:23 -08:00
Ross Wightman
9d03c6f526
Merge remote-tracking branch 'origin/main' into levit_efficientformer_redux
2023-02-03 14:47:01 -08:00
Ross Wightman
086bd55a94
Add EfficientFormer-V2, refactor EfficientFormer and Levit for more uniformity across the 3 related arch. Add features_out support to levit conv models and efficientformer_v2. All weights on hub.
2023-02-03 14:12:29 -08:00
Ross Wightman
2cb2699dc8
Apply fix from #1649 to main
2023-02-03 11:28:57 -08:00
Ross Wightman
b3042081b4
Add laion -> in1k fine-tuned base and large_mlp weights for convnext
2023-02-03 10:58:02 -08:00
Ross Wightman
316bdf8955
Add mlp head support for convnext_large, add laion2b CLIP weights, prep fine-tuned weight tags
2023-02-01 08:27:02 -08:00
Ross Wightman
6f28b562c6
Factor NormMlpClassifierHead from MaxxViT and use across MaxxViT / ConvNeXt / DaViT, refactor some type hints & comments
2023-01-27 14:57:01 -08:00
Ross Wightman
9a53c3f727
Finalize DaViT, some formatting and modelling simplifications (separate PatchEmbed to Stem + Downsample, weights on HF hub.
2023-01-27 13:54:04 -08:00
Fredo Guan
fb717056da
Merge remote-tracking branch 'upstream/main'
2023-01-26 10:49:15 -08:00
nateraw
14b84e8895
📝 update docstrings
2023-01-26 00:49:44 -05:00
nateraw
f0dc8a8267
📝 update docstrings for create_model
2023-01-25 21:10:41 -05:00
Ross Wightman
2bbc26dd82
version 0.8.8dev0
2023-01-25 18:02:48 -08:00
Ross Wightman
64667bfa0e
Add 'gigantic' vit clip variant for feature extraction and future fine-tuning
2023-01-25 18:02:10 -08:00
Ross Wightman
c2822568ec
Update version to 0.8.7dev0
2023-01-20 15:01:10 -08:00
Ross Wightman
36989cfae4
Factor out readme generation in hub helper, add more readme fields
2023-01-20 14:49:40 -08:00
Ross Wightman
32f252381d
Change order of checkpoitn filtering fn application in builder, try dict, model variant first
2023-01-20 14:48:54 -08:00
Ross Wightman
e9f1376cde
Cleanup resolve data config fns, add 'model' variant that takes model as first arg, make 'args' arg optional in original fn
2023-01-20 14:47:55 -08:00
Ross Wightman
bed350f5e5
Push all MaxxViT weights to HF hub, cleanup impl, add feature map extraction support and prompote to 'std' architecture. Fix norm head for proper embedding / feat map output. Add new in12k + ft 1k weights.
2023-01-20 14:45:25 -08:00
Ross Wightman
ca38e1e73f
Update ClassifierHead module, add reset() method, update in_chs -> in_features for consistency
2023-01-20 14:44:05 -08:00
Ross Wightman
8ab573cd26
Add convnext_tiny and convnext_small 384x384 fine-tunes of in12k weights, fix pool size for laion CLIP convnext weights
2023-01-20 14:40:16 -08:00
Fredo Guan
81ca323751
Davit update formatting and fix grad checkpointing ( #7 )
...
fixed head to gap->norm->fc as per convnext, along with option for norm->gap->fc
failed tests due to clip convnext models, davit tests passed
2023-01-15 14:34:56 -08:00
Ross Wightman
e9aac412de
Correct mean/std for CLIP convnexts
2023-01-14 22:53:56 -08:00
Ross Wightman
42bd8f7bcb
Add convnext_base CLIP image tower weights for fine-tuning / features
2023-01-14 21:16:29 -08:00
Ross Wightman
e520553e3e
Update batchnorm freezing to handle NormAct variants, Add GroupNorm1Act, update BatchNormAct2d tracing change from PyTorch
2023-01-12 16:55:47 -08:00
Ross Wightman
a2c14c2064
Add tiny/small in12k pretrained and fine-tuned ConvNeXt models
2023-01-11 14:50:39 -08:00
Ross Wightman
c061d5e401
Allow using class_map functionality w/ IterableDataset (TFDS/WDS) to remap class labels
2023-01-09 16:28:47 -08:00
Ross Wightman
01fdf44438
Initial focalnet import, more refactoring needed for timm.
2023-01-09 16:18:19 -08:00
Ross Wightman
01aea8c1bf
Version 0.8.6dev0
2023-01-09 13:38:31 -08:00
Ross Wightman
2e83bba142
Revert head norm changes to ConvNeXt as it broke some downstream use, alternate workaround for fcmae weights
2023-01-09 13:37:40 -08:00
Ross Wightman
1825b5e314
maxxvit type
2023-01-09 08:57:31 -08:00
Ross Wightman
5078b28f8a
More kwarg handling tweaks, maxvit_base_rw def added
2023-01-09 08:57:31 -08:00
Ross Wightman
c0d7388a1b
Improving kwarg merging in more models
2023-01-09 08:57:31 -08:00
Ross Wightman
ae9153052f
Update version.py
2023-01-06 17:17:35 -08:00
Ross Wightman
60ebb6cefa
Re-order vit pretrained entries for more sensible default weights (no .tag specified)
2023-01-06 16:12:33 -08:00
Ross Wightman
e861b74cf8
Pass through --model-kwargs (and --opt-kwargs for train) from command line through to model __init__. Update some models to improve arg overlay. Cleanup along the way.
2023-01-06 16:12:33 -08:00
Ross Wightman
add3fb864e
Working on improved model card template for push_to_hf_hub
2023-01-06 16:12:33 -08:00
Ross Wightman
dd0bb327e9
Update version.py
...
Ver 0.8.4dev0
2023-01-05 07:55:18 -08:00
Ross Wightman
6e5553da5f
Add ConvNeXt-V2 support (model additions and weights) ( #1614 )
...
* Add ConvNeXt-V2 support (model additions and weights)
* ConvNeXt-V2 weights on HF Hub, tweaking some tests
* Update README, fixing convnextv2 tests
2023-01-05 07:53:32 -08:00
Ross Wightman
6902c48a5f
Fix ResNet based models to work w/ norm layers w/o affine params. Reformat long arg lists into vertical form.
2022-12-29 16:32:26 -08:00
Ross Wightman
d5aa17e415
Remove print from auto_augment
2022-12-28 17:11:35 -08:00
Ross Wightman
7c846d9970
Better vmap compat across recent torch versions
2022-12-24 14:37:04 -08:00
Ross Wightman
4e24f75289
Merge pull request #1593 from rwightman/multi-weight_effnet_convnext
...
Update efficientnet.py and convnext.py to multi-weight, add new 12k pretrained weights
2022-12-23 10:09:08 -08:00
Ross Wightman
8ece53e194
Switch BEiT to HF hub weights
2022-12-22 21:43:04 -08:00
Ross Wightman
d1bfa9a000
Support HF datasets and TFSD w/ a sub-path by fixing split, fix #1598 ... add class mapping support to HF datasets in case class label isn't in info.
2022-12-22 21:34:13 -08:00
Ross Wightman
e2fc43bc63
Version 0.8.2dev0
2022-12-22 17:34:09 -08:00
Ross Wightman
9a51e4ea2e
Add FlexiViT models and weights, refactoring, push more weights
...
* push all vision_transformer*.py weights to HF hub
* finalize more pretrained tags for pushed weights
* refactor pos_embed files and module locations, move some pos embed modules to layers
* tweak hf hub helpers to aid bulk uploading and updating
2022-12-22 17:23:09 -08:00
Fredo Guan
10b3f696b4
Davit std ( #6 )
...
Separate patch_embed module
2022-12-16 21:50:28 -08:00
Ross Wightman
656e1776de
Convert mobilenetv3 to multi-weight, tweak PretrainedCfg metadata
2022-12-16 09:29:13 -08:00
Fredo Guan
546590c5f5
Merge branch 'rwightman:main' into main
2022-12-14 23:44:15 -08:00
Ross Wightman
6a01101905
Update efficientnet.py and convnext.py to multi-weight, add ImageNet-12k pretrained EfficientNet-B5 and ConvNeXt-Nano.
2022-12-14 20:33:23 -08:00
alec.tu
74d6afb4cd
Add Adan to __init__.py
2022-12-15 11:37:29 +08:00
Fredo Guan
84178fca60
Merge branch 'rwightman:main' into main
2022-12-12 23:13:58 -08:00
Fredo Guan
c43340ddd4
Davit std ( #5 )
...
* Update davit.py
* Update test_models.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* starting point
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update test_models.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Davit revised (#4 )
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
clean up
* Update test_models.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update davit.py
* Update test_models.py
* Update davit.py
2022-12-11 03:03:22 -08:00
Ross Wightman
e7da205345
Fix aa min_max level clamp
2022-12-10 16:43:28 -08:00
Ross Wightman
e3b2f5be0a
Add 3-Augment support to auto_augment.py, clean up weighted choice handling, and allow adjust per op prob via arg string
2022-12-10 16:25:50 -08:00
Ross Wightman
d5e7d6b27e
Merge remote-tracking branch 'origin/main' into refactor-imports
2022-12-09 14:49:44 -08:00
Ross Wightman
cda39b35bd
Add a deprecation phase to module re-org
2022-12-09 14:39:45 -08:00
Fredo Guan
edea013dd1
Davit std ( #3 )
...
Davit with all features working
2022-12-09 02:53:21 -08:00
Ross Wightman
7c4ed4d5a4
Add EVA-large models
2022-12-08 16:21:30 -08:00
Fredo Guan
434a03937d
Merge branch 'rwightman:main' into main
2022-12-08 08:05:16 -08:00
Ross Wightman
98047ef5e3
Add EVA FT results, hopefully fix BEiT test failures
2022-12-07 08:54:06 -08:00
Ross Wightman
3cc4d7a894
Fix missing register for 224 eva model
2022-12-07 08:54:06 -08:00
Ross Wightman
eba07b0de7
Add eva models to beit.py
2022-12-07 08:54:06 -08:00
Fredo Guan
3bd96609c8
Davit ( #1 )
...
Implement the davit model from https://arxiv.org/abs/2204.03645 and https://github.com/dingmyu/davit
2022-12-06 17:19:25 -08:00
Ross Wightman
927f031293
Major module / path restructure, timm.models.layers -> timm.layers, add _ prefix to all non model modules in timm.models
2022-12-06 15:00:06 -08:00
Ross Wightman
3785c234d7
Remove clip vit models that won't be ft and comment two that aren't uploaded yet
2022-12-05 10:21:34 -08:00
Ross Wightman
f82239b30e
multi-weight branch version -> 0.8.0dev
2022-12-05 10:21:34 -08:00
Ross Wightman
755570e2d6
Rename _pretrained.py -> pretrained.py, not feasible to change the other files to same scheme without breaking uses
2022-12-05 10:21:34 -08:00
Ross Wightman
72cfa57761
Add ported Tensorflow MaxVit weights. Add a few more CLIP ViT fine-tunes. Tweak some model tag names. Improve model tag name sorting. Update HF hub push config layout.
2022-12-05 10:21:34 -08:00
Ross Wightman
4d5c395160
MaxVit, ViT, ConvNeXt, and EfficientNet-v2 updates
...
* Add support for TF weights and modelling specifics to MaxVit (testing ported weights)
* More fine-tuned CLIP ViT configs
* ConvNeXt and MaxVit updated to new pretrained cfgs use
* EfficientNetV2, MaxVit and ConvNeXt high res models use squash crop/resize
2022-12-05 10:21:34 -08:00
Ross Wightman
3db4e346e0
Switch TFDS dataset to use INTEGER_ACCURATE jpeg decode by default
2022-12-05 10:21:34 -08:00
Ross Wightman
9da7e3a799
Add crop_mode for pretraind config / image transforms. Add support for dynamo compilation to benchmark/train/validate
2022-12-05 10:21:34 -08:00
Ross Wightman
b2b6285af7
Add two more FT clip weights
2022-12-05 10:21:34 -08:00
Ross Wightman
5895056dc4
Add openai b32 ft
2022-12-05 10:21:34 -08:00
Ross Wightman
9dea5143d5
Adding more clip ft variants
2022-12-05 10:21:34 -08:00
Ross Wightman
444dcba4ad
CLIP B16 12k weights added
2022-12-05 10:21:34 -08:00
Ross Wightman
dff4717cbf
Add clip b16 384x384 finetunes
2022-12-05 10:21:34 -08:00
Ross Wightman
883fa2eeaa
Add fine-tuned B/16 224x224 in1k clip models
2022-12-05 10:21:34 -08:00
Ross Wightman
9a3d2ac2d5
Add latest CLIP ViT fine-tune pretrained configs / model entrypt updates
2022-12-05 10:21:34 -08:00
Ross Wightman
42bbbddee9
Add missing model config
2022-12-05 10:21:34 -08:00
Ross Wightman
def68befa7
Updating vit model defs for mult-weight support trial (vit first). Prepping for CLIP (laion2b and openai) fine-tuned weights.
2022-12-05 10:21:34 -08:00
Ross Wightman
0dadb4a6e9
Initial multi-weight support, handled so old pretraing config handling co-exists with new tags.
2022-12-05 10:21:34 -08:00
Jerome Rony
3491506fec
Add foreach option for faster EMA
2022-11-30 14:06:58 -05:00
hongxin xiang
653bdc7105
Fix comment: https://github.com/rwightman/pytorch-image-models/pull/1564#issuecomment-1326743424
2022-11-25 09:52:52 +08:00
hongxin xiang
bdc9fad638
Fix compatible BUG: QMNIST and ImageNet datasets do not exist in torchvision 0.10.1.
2022-11-24 14:37:44 +08:00
Jerome Rony
6ec5cd6a99
Use in-place operations for EMA
2022-11-17 11:53:29 -05:00
Wauplin
9b114754db
refactor push_to_hub helper
2022-11-16 12:03:34 +01:00
Wauplin
ae0a0db7de
Create repo before cloning with Repository.clone_from
2022-11-15 15:17:20 +01:00
Ross Wightman
803254bb40
Fix spacing misalignment for fast norm path in LayerNorm modules
2022-10-24 21:43:49 -07:00
Ross Wightman
475ecdfa3d
cast env var args for dataset readers to int
2022-10-17 14:40:11 -07:00
Hoan Nguyen
39190f5f44
Remove inplace operators when calculating the loss
...
Remove inplace operators to overcome the following error when using `asymmetric_loss`
```
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation
```
2022-10-17 10:41:21 +02:00
Ross Wightman
6635bc3f7d
Merge pull request #1479 from rwightman/script_cleanup
...
Train / val script enhancements, non-GPU (ie CPU) device support, HF datasets support, TFDS/WDS dataloading improvements
2022-10-15 09:29:39 -07:00
Ross Wightman
0e6023f032
Merge pull request #1381 from ChristophReich1996/master
...
Fix typo in PositionalEncodingFourier
2022-10-14 18:34:33 -07:00
Ross Wightman
66f4af7090
Merge remote-tracking branch 'origin/master' into script_cleanup
2022-10-14 15:54:00 -07:00
Ross Wightman
d3961536c9
comment some debug logs for WDS dataset
2022-10-14 15:39:00 -07:00
Ross Wightman
e9dccc918c
Rename dataset/parsers -> dataset/readers, create_parser to create_reader, etc
2022-10-14 15:14:38 -07:00
Ross Wightman
8c28363dc9
Version 0.7.dev0 for master
2022-10-14 09:38:02 -07:00
nateraw
30bafd7347
🔖 add dev suffix to version tag
2022-10-13 17:08:33 -04:00
Ross Wightman
f67a7ee8bd
Set num_workers in Iterable WDS/TFDS datasets early so sample estimate is correct
2022-10-11 15:11:18 -07:00
Ross Wightman
cea8df3d0c
Version 0.6.12
2022-10-10 21:49:52 -07:00
Ross Wightman
9914f744dc
Add more maxxvit weights includ ConvNeXt conv block based experiments.
2022-10-10 21:49:18 -07:00
Ross Wightman
b1b024dfed
Scheduler update, add v2 factory method, support scheduling on updates instead of just epochs. Add LR to summary csv. Add lr_base scaling calculations to train script. Fix #1168
2022-10-07 10:43:04 -07:00
Ross Wightman
4f18d6dc5f
Fix logs in WDS parser
2022-10-07 10:06:17 -07:00
Mohamed Rashad
8fda68aff6
Fix repo id bug
...
This to fix this issue #1482
2022-10-05 16:26:06 +02:00
Ross Wightman
b8c8550841
Data improvements. Improve train support for in_chans != 3. Add wds dataset support from bits_and_tpu branch w/ fixes and tweaks. TFDS tweaks.
2022-09-29 16:42:58 -07:00
Alex Fafard
7327792f39
update to support pickle based dictionaries
2022-09-27 11:13:48 -04:00
Ross Wightman
1199c5a1a4
clip_laion2b models need 1e-5 eps for LayerNorm
2022-09-25 10:36:54 -07:00
Ross Wightman
87939e6fab
Refactor device handling in scripts, distributed init to be less 'cuda' centric. More device args passed through where needed.
2022-09-23 16:08:59 -07:00
Ross Wightman
c88947ad3d
Add initial Hugging Face Datasets parser impl.
2022-09-23 16:08:19 -07:00
Ross Wightman
e858912e0c
Add brute-force checkpoint remapping option
2022-09-23 16:07:03 -07:00
Ross Wightman
b293dfa595
Add CL SE module
2022-09-23 16:06:09 -07:00
Ross Wightman
2a296412be
Add Adan optimizer
2022-09-23 16:05:52 -07:00
Ross Wightman
5dc4343308
version 0.6.11
2022-09-23 13:54:56 -07:00
Ross Wightman
a383ef99f5
Make huggingface_hub necessary if it's the only source for a pretrained weight
2022-09-23 13:54:21 -07:00
Ross Wightman
33e30f8c8b
Remove layer-decay print
2022-09-18 21:33:03 -07:00
Ross Wightman
e069249a2d
Add hf hub entries for laion2b clip models, add huggingface_hub dependency, update some setup/reqs, torch >= 1.7
2022-09-16 21:39:05 -07:00
Ross Wightman
9d65557be3
Fix errant import
2022-09-15 17:47:23 -07:00
Ross Wightman
9709dbaaa9
Adding support for fine-tune CLIP LAION-2B image tower weights for B/32, L/14, H/14 and g/14. Still WIP
2022-09-15 17:25:59 -07:00
Ross Wightman
a520da9b49
Update tresnet features_info for v2
2022-09-13 20:54:54 -07:00
Ross Wightman
c8ab747bf4
BEiT-V2 checkpoints didn't remove 'module' from weights, adapt checkpoint filter
2022-09-13 17:56:49 -07:00
Ross Wightman
73049dc2aa
Fix type in dla weight update
2022-09-13 17:52:45 -07:00
Ross Wightman
3599c7e6a4
version 0.6.10
2022-09-13 16:37:02 -07:00
Ross Wightman
e11efa872d
Update a bunch of weights with external links to timm release assets. Fixes issue with *aliyuncs.com returning forbidden. Did pickle scan / verify and re-hash. Add TresNet-V2-L weights.
2022-09-13 16:35:26 -07:00
Ross Wightman
fa8c84eede
Update maxvit_tiny_256 weight to better iter, add coatnet / maxvit / maxxvit model defs for future runs
2022-09-07 12:37:37 -07:00
Ross Wightman
c1b3cea19d
Add maxvit_rmlp_tiny_rw_256 model def and weights w/ 84.2 top-1 @ 256, 84.8 @ 320
2022-09-07 10:27:11 -07:00
Ross Wightman
914544fc81
Add beitv2 224x224 checkpoints from https://github.com/microsoft/unilm/tree/master/beit2
2022-09-06 20:25:18 -07:00
Ross Wightman
dc90816f26
Add `maxvit_tiny_rw_224` weights 83.5 @ 224 and `maxvit_rmlp_pico_rw_256` relpos weights, 80.5 @ 256, 81.3 @ 320
2022-09-06 16:14:41 -07:00
Ross Wightman
f489f02ad1
Make gcvit window size ratio based to improve resolution changing support #1449 . Change default init to original.
2022-09-06 16:14:00 -07:00
Ross Wightman
7f1b223c02
Add maxvit_rmlp_nano_rw_256 model def & weights, make window/grid size dynamic wrt img_size by default
2022-08-29 15:49:32 -07:00
Ross Wightman
e6a4361306
pretrained_cfg entry for mvitv2_small_cls
2022-08-28 15:27:01 -07:00
Ross Wightman
f66e5f0e35
Fix class token support in MViT-V2, add small_class variant to ensure it's tested. Fix #1443
2022-08-28 15:24:04 -07:00
Ross Wightman
f1d2160d85
Update a few maxxvit comments, rename PartitionAttention -> PartitionAttenionCl for consistency with other blocks
2022-08-26 12:53:49 -07:00
Ross Wightman
eca6f0a25c
Fix syntax error (extra dataclass comma) in maxxvit.py
2022-08-26 11:29:09 -07:00
Ross Wightman
ff6a919cf5
Add --fast-norm arg to benchmark.py, train.py, validate.py
2022-08-25 17:20:46 -07:00
Ross Wightman
769ab4b98a
Clean up no_grad for trunc normal weight inits
2022-08-25 16:29:52 -07:00
Ross Wightman
48e1df8b37
Add norm/norm_act header comments
2022-08-25 16:29:34 -07:00
Ross Wightman
7c2660576d
Tweak init for convnext block using maxxvit/coatnext.
2022-08-25 15:30:59 -07:00
Ross Wightman
1d8d6f6072
Fix two default args in DenseNet blocks... fix #1427
2022-08-25 15:00:35 -07:00
Ross Wightman
527f9a4cb2
Updated to correct maxvit_nano weights...
2022-08-24 12:42:11 -07:00
Ross Wightman
b2e8426fca
Make k=stride=2 ('avg2') pooling default for coatnet/maxvit. Add weight links. Rename 'combined' partition to 'parallel'.
2022-08-24 11:01:20 -07:00
Ross Wightman
837c68263b
For ConvNeXt, use timm internal LayerNorm for fast_norm in non conv_mlp mode
2022-08-23 15:17:12 -07:00
Ross Wightman
cac0a4570a
More test fixes, pool size for 256x256 maxvit models
2022-08-23 13:38:26 -07:00
Ross Wightman
e939ed19b9
Rename internal creation fn for maxvit, has not been just coatnet for a while...
2022-08-22 17:44:51 -07:00
Ross Wightman
ffaf97f813
MaxxVit! A very configurable MaxVit and CoAtNet impl with lots of goodies..
2022-08-22 17:42:10 -07:00
Ross Wightman
8c9696c9df
More model and test fixes
2022-08-22 17:40:31 -07:00
Ross Wightman
ca52108c2b
Fix some model support functions
2022-08-19 10:20:51 -07:00
Ross Wightman
f332fc2db7
Fix some test failures, torchscript issues
2022-08-18 16:19:46 -07:00
Ross Wightman
6e559e9b5f
Add MViT (Multi-Scale) V2
2022-08-17 15:12:31 -07:00
Ross Wightman
43aa84e861
Add 'fast' layer norm that doesn't cast to float32, support APEX LN impl for slight speed gain, update norm and act factories, tweak SE for ability to disable bias (needed by GCVit)
2022-08-17 14:32:58 -07:00
Ross Wightman
c486aa71f8
Add GCViT
2022-08-17 14:29:18 -07:00
Ross Wightman
fba6ecd39b
Add EfficientFormer
2022-08-17 14:08:53 -07:00
Ross Wightman
ff4a38e2c3
Add PyramidVisionTransformerV2
2022-08-17 12:06:05 -07:00
Ross Wightman
1d8ada359a
Add timm ConvNeXt 'atto' weights, change test resolution for FB ConvNeXt 224x224 weights, add support for different dw kernel_size
2022-08-15 17:56:08 -07:00
Ross Wightman
2544d3b80f
ConvNeXt pico, femto, and nano, pico, femto ols (overlapping stem) weights and model defs
2022-08-05 17:05:50 -07:00
Ross Wightman
13565aad50
Add edgenext_base model def & weight link, update to improve ONNX export #1385
2022-08-05 16:58:34 -07:00
Ross Wightman
8ad4bdfa06
Allow ntuple to be used with string values
2022-07-28 16:18:18 -07:00
Christoph Reich
faae93e62d
Fix typo in PositionalEncodingFourier
2022-07-28 19:08:08 -04:00
Ross Wightman
7430a85d07
Update README, bump version to 0.6.8
2022-07-28 15:07:11 -07:00
Ross Wightman
ec6a28830f
Add DeiT-III 'medium' model defs and weights
2022-07-28 15:03:20 -07:00
Ross Wightman
d875a1d3f6
version 0.6.7
2022-07-27 12:41:06 -07:00
Ross Wightman
6f103a442b
Add convnext_nano weights, 80.8 @ 224, 81.5 @ 288
2022-07-26 16:40:27 -07:00
Ross Wightman
4042a94f8f
Add weights for two 'Edge' block (3x3->1x1) variants of CS3 networks.
2022-07-26 16:40:27 -07:00