Ross Wightman
|
7cfaeced67
|
Change adafactor_bv epsilon default
|
2024-11-12 20:49:01 -08:00 |
Ross Wightman
|
0b5ae49251
|
Remove adafactorbv numpy dep, hack fix for loading optimizer state w/ half prec momentum (need better one)
|
2024-11-12 20:49:01 -08:00 |
Ross Wightman
|
19090ea966
|
Need to init momentum with correct dtype
|
2024-11-12 20:49:01 -08:00 |
Ross Wightman
|
484a88f4b4
|
Remove unused beta2 fn, make eps grad^2 handling same across factorized and non-factorized cases
|
2024-11-12 20:49:01 -08:00 |
Ross Wightman
|
7c16adca83
|
An impl of adafactor as per big vision (scaling vit) changes
|
2024-11-12 20:49:01 -08:00 |
Ross Wightman
|
363b043c13
|
Extend train epoch schedule by warmup_epochs if warmup_prefix enable, allows schedule to reach end w/ prefix enabledy
|
2024-11-08 11:01:11 -08:00 |
Augustin Godinot
|
7f0c1b1f30
|
Add trust_remote_code argument to ReaderHfds
|
2024-11-08 08:16:36 -08:00 |
Wojtek Jasiński
|
eb94efb218
|
fix pos embed dynamic resampling for eva
|
2024-11-06 16:03:27 -08:00 |
Wojtek Jasiński
|
3c7822c621
|
fix pos embed dynamic resampling for deit
|
2024-11-06 16:03:27 -08:00 |
Wojtek Jasiński
|
3ae3f44288
|
Fix positional embedding resampling for non-square inputs in ViT
|
2024-11-06 16:03:27 -08:00 |
Ross Wightman
|
d4dde48dd5
|
Missed first_conv from resnet18d
|
2024-10-31 19:29:53 -07:00 |
Ross Wightman
|
e6263bf64d
|
Add resnet and resnet-v2 18/34 weights trained with mnv4 small based recipe
|
2024-10-31 16:39:35 -07:00 |
Ross Wightman
|
f5b58e31a2
|
Allow non train mode for wds reader to operate w/o sample count, exhaust iterator
|
2024-10-31 16:39:35 -07:00 |
Ross Wightman
|
f689c850b9
|
One more small c&p issue
|
2024-10-23 21:51:09 -07:00 |
Ross Wightman
|
baa7242dd3
|
Fix c&p error, slight reformat
|
2024-10-23 21:51:09 -07:00 |
Ross Wightman
|
1b5cae681c
|
Update some clip pretrained weights to point to new hub locations, add a few missing weights
|
2024-10-23 21:51:09 -07:00 |
Ross Wightman
|
310ffa32c5
|
Update version.py
dev version 1.0.12.dev0
|
2024-10-19 09:56:17 -07:00 |
Ross Wightman
|
015fbe457a
|
Merge branch 'MengqingCao-npu_support' into device_amp_cleanup
|
2024-10-18 14:50:44 -07:00 |
Ross Wightman
|
81b59faf77
|
Merge branch 'npu_support' of github.com:MengqingCao/pytorch-image-models into MengqingCao-npu_support
|
2024-10-18 14:50:00 -07:00 |
Ross Wightman
|
1766a01f96
|
Cleanup some amp related behaviour to better support different (non-cuda) devices
|
2024-10-18 13:54:16 -07:00 |
MengqingCao
|
37c731ca37
|
fix device check
|
2024-10-17 12:38:02 +00:00 |
Feraidoon Mehri
|
ca20e102fe
|
mambaout.py: fixed bug
|
2024-10-17 01:03:28 +03:30 |
Ross Wightman
|
8cb2548962
|
Version 1.0.11
|
2024-10-16 14:14:44 -07:00 |
Ross Wightman
|
89dffc5ff0
|
Another small fix for original mambaout models, no classifier nn.Linear when num_classe=0 on init
|
2024-10-16 12:36:36 -07:00 |
Ross Wightman
|
fad4538801
|
Elevate import deprecation warnings from DeprecationWarning to FutureWarning so messages are now seen
|
2024-10-16 11:30:01 -07:00 |
Ross Wightman
|
a1f379e712
|
Add intern300m vit w/ converted timm weights. Fix #2300
|
2024-10-16 10:29:06 -07:00 |
MengqingCao
|
234f975787
|
add npu support
|
2024-10-16 07:13:45 +00:00 |
Ross Wightman
|
60f517c883
|
Fix wrong name in _all_ for models._registry
|
2024-10-15 07:39:46 -07:00 |
Ross Wightman
|
b4a9a166c3
|
Version 1.0.10
|
2024-10-14 21:40:30 -07:00 |
Ross Wightman
|
c3052fa19e
|
Merge pull request #2298 from huggingface/preact_resnet18
Add resnet18/18d pre-act model configs for potential training.
|
2024-10-14 19:39:04 -07:00 |
Ross Wightman
|
abdf33145c
|
Add 34/34d pre-act resnet variants
|
2024-10-14 13:23:50 -07:00 |
Ross Wightman
|
c82ce86f8f
|
Add 384x384 mambaout_base_plus model weights
|
2024-10-14 12:28:57 -07:00 |
Ross Wightman
|
82ae247879
|
MambaOut weights on hub, configs finalized
|
2024-10-11 11:07:40 -07:00 |
Ross Wightman
|
7efb60c299
|
Add first_conv for mambaout
|
2024-10-09 14:11:40 -07:00 |
Ross Wightman
|
5dc5ee5b42
|
Add global_pool to mambaout __init__ and pass to heads
|
2024-10-09 14:11:40 -07:00 |
Ross Wightman
|
9d1dfe8dbe
|
Incorrectly named head_hidden_size
|
2024-10-09 14:11:40 -07:00 |
Ross Wightman
|
91e743f2dd
|
Mambaout tweaks
|
2024-10-09 14:11:40 -07:00 |
Ross Wightman
|
4542cf03f9
|
Add features_only, other bits to mambaout, define different base alternatives
|
2024-10-09 14:11:40 -07:00 |
Ross Wightman
|
c2da12c7e1
|
Update rw models, fix heads
|
2024-10-09 14:11:40 -07:00 |
Ross Wightman
|
f2086f51a0
|
Add mambaout builder support, pretrained weight remap
|
2024-10-09 14:11:40 -07:00 |
Ross Wightman
|
c6ef54eefa
|
Initial mambaout work
|
2024-10-09 14:11:40 -07:00 |
Ross Wightman
|
d9321b0e10
|
Add weights for fine-tuned siglip so400m. Add webli_i18n pretrained tags for the multi-lingual model variants (incl older base)
|
2024-10-09 09:04:44 -07:00 |
Ross Wightman
|
01b62264af
|
Add i18n variant of so400m model w/ weights. Add two in1k fine-tunes of original so400m 384x384 but at 378x378 (better matches patch14)
|
2024-10-08 23:40:24 -07:00 |
Ross Wightman
|
72f0edb7e8
|
missed first_conv for rnv2 18d
|
2024-10-08 12:38:54 -07:00 |
Ross Wightman
|
3ed603a2ce
|
Add resnet18/18d pre-act model configs for potential training. Fix #2289
|
2024-10-08 11:28:07 -07:00 |
Ross Wightman
|
41a79e0fcb
|
Add overlapped stem convnext zepto weights
|
2024-10-08 11:26:34 -07:00 |
Ross Wightman
|
545bd4056c
|
Tag along test_vit3 weights
|
2024-09-30 12:03:32 -07:00 |
Ross Wightman
|
69b687d4cc
|
Add zepto weights
|
2024-09-30 11:43:23 -07:00 |
Ross Wightman
|
c6e5557a5a
|
Mismatch pretrained_cfg
|
2024-09-30 11:43:23 -07:00 |
Ross Wightman
|
5d7bd2973e
|
convnext zepto, rmsnorm experiments
|
2024-09-30 11:43:23 -07:00 |