Ross Wightman
|
9d65557be3
|
Fix errant import
|
2022-09-15 17:47:23 -07:00 |
Ross Wightman
|
9709dbaaa9
|
Adding support for fine-tune CLIP LAION-2B image tower weights for B/32, L/14, H/14 and g/14. Still WIP
|
2022-09-15 17:25:59 -07:00 |
Ross Wightman
|
a520da9b49
|
Update tresnet features_info for v2
|
2022-09-13 20:54:54 -07:00 |
Ross Wightman
|
c8ab747bf4
|
BEiT-V2 checkpoints didn't remove 'module' from weights, adapt checkpoint filter
|
2022-09-13 17:56:49 -07:00 |
Ross Wightman
|
73049dc2aa
|
Fix type in dla weight update
|
2022-09-13 17:52:45 -07:00 |
Ross Wightman
|
3599c7e6a4
|
version 0.6.10
|
2022-09-13 16:37:02 -07:00 |
Ross Wightman
|
e11efa872d
|
Update a bunch of weights with external links to timm release assets. Fixes issue with *aliyuncs.com returning forbidden. Did pickle scan / verify and re-hash. Add TresNet-V2-L weights.
|
2022-09-13 16:35:26 -07:00 |
Ross Wightman
|
fa8c84eede
|
Update maxvit_tiny_256 weight to better iter, add coatnet / maxvit / maxxvit model defs for future runs
|
2022-09-07 12:37:37 -07:00 |
Ross Wightman
|
c1b3cea19d
|
Add maxvit_rmlp_tiny_rw_256 model def and weights w/ 84.2 top-1 @ 256, 84.8 @ 320
|
2022-09-07 10:27:11 -07:00 |
Ross Wightman
|
914544fc81
|
Add beitv2 224x224 checkpoints from https://github.com/microsoft/unilm/tree/master/beit2
|
2022-09-06 20:25:18 -07:00 |
Ross Wightman
|
dc90816f26
|
Add `maxvit_tiny_rw_224` weights 83.5 @ 224 and `maxvit_rmlp_pico_rw_256` relpos weights, 80.5 @ 256, 81.3 @ 320
|
2022-09-06 16:14:41 -07:00 |
Ross Wightman
|
f489f02ad1
|
Make gcvit window size ratio based to improve resolution changing support #1449. Change default init to original.
|
2022-09-06 16:14:00 -07:00 |
Ross Wightman
|
7f1b223c02
|
Add maxvit_rmlp_nano_rw_256 model def & weights, make window/grid size dynamic wrt img_size by default
|
2022-08-29 15:49:32 -07:00 |
Ross Wightman
|
e6a4361306
|
pretrained_cfg entry for mvitv2_small_cls
|
2022-08-28 15:27:01 -07:00 |
Ross Wightman
|
f66e5f0e35
|
Fix class token support in MViT-V2, add small_class variant to ensure it's tested. Fix #1443
|
2022-08-28 15:24:04 -07:00 |
Ross Wightman
|
f1d2160d85
|
Update a few maxxvit comments, rename PartitionAttention -> PartitionAttenionCl for consistency with other blocks
|
2022-08-26 12:53:49 -07:00 |
Ross Wightman
|
eca6f0a25c
|
Fix syntax error (extra dataclass comma) in maxxvit.py
|
2022-08-26 11:29:09 -07:00 |
Ross Wightman
|
ff6a919cf5
|
Add --fast-norm arg to benchmark.py, train.py, validate.py
|
2022-08-25 17:20:46 -07:00 |
Ross Wightman
|
769ab4b98a
|
Clean up no_grad for trunc normal weight inits
|
2022-08-25 16:29:52 -07:00 |
Ross Wightman
|
48e1df8b37
|
Add norm/norm_act header comments
|
2022-08-25 16:29:34 -07:00 |
Ross Wightman
|
7c2660576d
|
Tweak init for convnext block using maxxvit/coatnext.
|
2022-08-25 15:30:59 -07:00 |
Ross Wightman
|
1d8d6f6072
|
Fix two default args in DenseNet blocks... fix #1427
|
2022-08-25 15:00:35 -07:00 |
Ross Wightman
|
527f9a4cb2
|
Updated to correct maxvit_nano weights...
|
2022-08-24 12:42:11 -07:00 |
Ross Wightman
|
b2e8426fca
|
Make k=stride=2 ('avg2') pooling default for coatnet/maxvit. Add weight links. Rename 'combined' partition to 'parallel'.
|
2022-08-24 11:01:20 -07:00 |
Ross Wightman
|
837c68263b
|
For ConvNeXt, use timm internal LayerNorm for fast_norm in non conv_mlp mode
|
2022-08-23 15:17:12 -07:00 |
Ross Wightman
|
cac0a4570a
|
More test fixes, pool size for 256x256 maxvit models
|
2022-08-23 13:38:26 -07:00 |
Ross Wightman
|
e939ed19b9
|
Rename internal creation fn for maxvit, has not been just coatnet for a while...
|
2022-08-22 17:44:51 -07:00 |
Ross Wightman
|
ffaf97f813
|
MaxxVit! A very configurable MaxVit and CoAtNet impl with lots of goodies..
|
2022-08-22 17:42:10 -07:00 |
Ross Wightman
|
8c9696c9df
|
More model and test fixes
|
2022-08-22 17:40:31 -07:00 |
Ross Wightman
|
ca52108c2b
|
Fix some model support functions
|
2022-08-19 10:20:51 -07:00 |
Ross Wightman
|
f332fc2db7
|
Fix some test failures, torchscript issues
|
2022-08-18 16:19:46 -07:00 |
Ross Wightman
|
6e559e9b5f
|
Add MViT (Multi-Scale) V2
|
2022-08-17 15:12:31 -07:00 |
Ross Wightman
|
43aa84e861
|
Add 'fast' layer norm that doesn't cast to float32, support APEX LN impl for slight speed gain, update norm and act factories, tweak SE for ability to disable bias (needed by GCVit)
|
2022-08-17 14:32:58 -07:00 |
Ross Wightman
|
c486aa71f8
|
Add GCViT
|
2022-08-17 14:29:18 -07:00 |
Ross Wightman
|
fba6ecd39b
|
Add EfficientFormer
|
2022-08-17 14:08:53 -07:00 |
Ross Wightman
|
ff4a38e2c3
|
Add PyramidVisionTransformerV2
|
2022-08-17 12:06:05 -07:00 |
Ross Wightman
|
1d8ada359a
|
Add timm ConvNeXt 'atto' weights, change test resolution for FB ConvNeXt 224x224 weights, add support for different dw kernel_size
|
2022-08-15 17:56:08 -07:00 |
Ross Wightman
|
2544d3b80f
|
ConvNeXt pico, femto, and nano, pico, femto ols (overlapping stem) weights and model defs
|
2022-08-05 17:05:50 -07:00 |
Ross Wightman
|
13565aad50
|
Add edgenext_base model def & weight link, update to improve ONNX export #1385
|
2022-08-05 16:58:34 -07:00 |
Ross Wightman
|
8ad4bdfa06
|
Allow ntuple to be used with string values
|
2022-07-28 16:18:18 -07:00 |
Ross Wightman
|
7430a85d07
|
Update README, bump version to 0.6.8
|
2022-07-28 15:07:11 -07:00 |
Ross Wightman
|
ec6a28830f
|
Add DeiT-III 'medium' model defs and weights
|
2022-07-28 15:03:20 -07:00 |
Ross Wightman
|
d875a1d3f6
|
version 0.6.7
|
2022-07-27 12:41:06 -07:00 |
Ross Wightman
|
6f103a442b
|
Add convnext_nano weights, 80.8 @ 224, 81.5 @ 288
|
2022-07-26 16:40:27 -07:00 |
Ross Wightman
|
4042a94f8f
|
Add weights for two 'Edge' block (3x3->1x1) variants of CS3 networks.
|
2022-07-26 16:40:27 -07:00 |
Ross Wightman
|
c8f69e04a9
|
Merge pull request #1365 from veritable-tech/fix-resize-pos-embed
Take `no_emb_class` into account when calling `resize_pos_embed`
|
2022-07-24 21:03:01 -07:00 |
Ceshine Lee
|
0b64117592
|
Take `no_emb_class` into account when calling `resize_pos_embed`
|
2022-07-24 19:11:45 +08:00 |
Jasha10
|
56c3a84db3
|
Update type hint for `register_notrace_module`
register_notrace_module is used to decorate types (i.e. subclasses of nn.Module).
It is not called on module instances.
|
2022-07-22 16:59:55 -05:00 |
Ross Wightman
|
1b278136c3
|
Change models with mean 0,0,0 std 1,1,1 from int to float for consistency as mentioned in #1355
|
2022-07-21 17:36:15 -07:00 |
Ross Wightman
|
909705e7ff
|
Remove some redundant requires_grad=True from nn.Parameter in third party code
|
2022-07-20 12:37:41 -07:00 |