Ross Wightman
|
2aabaef039
|
Merge pull request #1784 from huggingface/wip-voidbag-accumulate-grad
Accumulate gradients (adding to #1659)
|
2023-04-20 08:15:28 -07:00 |
Ross Wightman
|
a83e9f2d3b
|
forward & backward in same no_sync context, slightly easier to read that splitting
|
2023-04-20 08:14:05 -07:00 |
Ross Wightman
|
f4825a09ef
|
Merge pull request #212 from bryant1410/patch-1
Fix MultiEpochsDataLoader when there's no batching
|
2023-04-20 07:09:27 -07:00 |
Ross Wightman
|
4cd7fb88b2
|
clip gradients with update
|
2023-04-19 23:36:20 -07:00 |
Ross Wightman
|
df81d8d85b
|
Cleanup gradient accumulation, fix a few issues, a few other small cleanups in related code.
|
2023-04-19 23:11:00 -07:00 |
Ross Wightman
|
ab7ca62a6e
|
Merge branch 'main' of github.com:rwightman/pytorch-image-models into wip-voidbag-accumulate-grad
|
2023-04-19 11:08:12 -07:00 |
Ross Wightman
|
ec6cca4b37
|
Add head-init-scale and head-init-bias args that works for all models, fix #1718
|
2023-04-14 17:59:23 -07:00 |
Ross Wightman
|
34df125be6
|
cait, volo, xvit hub weights
|
2023-04-14 10:13:13 -07:00 |
Ross Wightman
|
f6d5767551
|
cspnet models on HF hub w/ multi-weight support
|
2023-04-12 14:02:38 -07:00 |
Ross Wightman
|
21b1c2f6a1
|
Update README.md
|
2023-04-12 09:24:35 -07:00 |
Ross Wightman
|
3bd35c7004
|
Merge pull request #1766 from huggingface/onnx_export
Add long missing onnx utils and export code
|
2023-04-12 09:06:16 -07:00 |
Ross Wightman
|
aef6e562e4
|
Add onnx utils and export code, tweak padding and conv2d_same for better dynamic export with recent PyTorch
|
2023-04-11 17:03:57 -07:00 |
Ross Wightman
|
80b247d843
|
Update swin_v2 attn_mask buffer change in #1790 to apply to updated checkpoints in hub
|
2023-04-11 14:40:32 -07:00 |
Ross Wightman
|
1a1aca0cee
|
Merge pull request #1761 from huggingface/patch_drop_refactor
Implement patch dropout for eva / vision_transformer, refactor dropout args
|
2023-04-11 14:37:36 -07:00 |
Ross Wightman
|
5fa53c5f31
|
Merge pull request #1760 from MarcoForte/patch-1
skip SwinV2 attention mask buffers
|
2023-04-11 14:37:02 -07:00 |
Ross Wightman
|
c0670822d2
|
Small factory handling fix for pretrained tag vs cfg
|
2023-04-11 07:42:13 -07:00 |
Ross Wightman
|
2f25f73b90
|
Missed a fused_attn update in relpos vit
|
2023-04-10 23:30:50 -07:00 |
Ross Wightman
|
0b65b5c0ac
|
Add finalized eva CLIP weights pointing to remapped timm hub models
|
2023-04-10 23:13:12 -07:00 |
Ross Wightman
|
965d0a2d36
|
fast_attn -> fused_attn, implement global config for enable/disable fused_attn, add to more models. vit clip openai 336 weights.
|
2023-04-10 12:04:33 -07:00 |
Ross Wightman
|
4d135421a3
|
Implement patch dropout for eva / vision_transformer, refactor / improve consistency of dropout args across all vit based models
|
2023-04-07 20:27:23 -07:00 |
Marco Forte
|
c76818a592
|
skip attention mask buffers
Allows more flexibility in the resolutions accepted by SwinV2.
|
2023-04-07 18:50:02 +02:00 |
Ross Wightman
|
1bb3989b61
|
Improve kwarg passthrough for swin, vit, deit, beit, eva
|
2023-04-05 21:37:16 -07:00 |
Ross Wightman
|
a09e240cd6
|
Merge pull request #1756 from huggingface/mw-resnet
ResNet models on HF hub, multi-weight support
|
2023-04-05 17:42:48 -07:00 |
Ross Wightman
|
35c94b836c
|
Update warning message for deprecated model names
|
2023-04-05 17:24:17 -07:00 |
Ross Wightman
|
9eaab795c2
|
Add some vit model deprecations
|
2023-04-05 17:21:03 -07:00 |
Ross Wightman
|
b17abd35b2
|
Version 0.8.19dev0
|
2023-04-05 16:37:16 -07:00 |
Ross Wightman
|
647ba98d23
|
Update README
|
2023-04-05 16:37:07 -07:00 |
Ross Wightman
|
abff3f12ec
|
Wrong pool_size for 288 ft
|
2023-04-05 16:07:51 -07:00 |
Ross Wightman
|
356309959c
|
ResNet models on HF hub, multi-weight support, add torchvision v2 weights, new 12k pretrained and fine-tuned timm anti-aliased weights
|
2023-04-05 14:19:42 -07:00 |
Ross Wightman
|
7501972cd6
|
Version 0.8.18dev0
|
2023-03-31 16:51:26 -07:00 |
Ross Wightman
|
a84bf11887
|
Update README
|
2023-03-31 16:51:02 -07:00 |
Ross Wightman
|
beef7f0a22
|
Add ImageNet-12k intermediate fine-tunes of convnext base & large CLIP models, add first 1k fine-tune of xxlarge
|
2023-03-31 16:45:01 -07:00 |
Ross Wightman
|
9aa1133bd2
|
Fix #1750, uncomment weight that exists on HF hub, add FIXME to 3 others that are still on local storage
|
2023-03-31 14:49:30 -07:00 |
Ross Wightman
|
3e45db4853
|
Update README.md
|
2023-03-31 12:24:10 -07:00 |
Ross Wightman
|
7326470514
|
Merge pull request #1746 from huggingface/eva02
Adding EVA02 weights and model defs
|
2023-03-31 12:17:00 -07:00 |
Ross Wightman
|
adeb9de7c6
|
Mismatch in eva pretrained_cfg vs model for one of the clip variants
|
2023-03-31 10:30:30 -07:00 |
Ross Wightman
|
85f6cb6637
|
Update README
|
2023-03-31 08:42:51 -07:00 |
Ross Wightman
|
067c7281e2
|
Another test filter adjustment
|
2023-03-31 08:33:26 -07:00 |
Ross Wightman
|
3825812f1a
|
Update test filtering for enormoous
|
2023-03-31 00:04:56 -07:00 |
Ross Wightman
|
0737bd3ec8
|
eva02 non-CLIP weights on HF hub, add initial eva02 clip model configs w/ postnorm variant & attn LN
|
2023-03-30 23:43:59 -07:00 |
Ross Wightman
|
ac67098147
|
Add final attr for fast_attn on beit / eva
|
2023-03-28 08:40:40 -07:00 |
Ross Wightman
|
1885bdc431
|
Merge pull request #1745 from huggingface/mw-mlp_mixer
MLP-Mixer multi-weight support, HF hub push
|
2023-03-28 07:55:17 -07:00 |
Ross Wightman
|
2362f79062
|
Merge pull request #1748 from huggingface/mw-deit
Multi-weight and HF hub for deit / deit3
|
2023-03-28 07:54:58 -07:00 |
Ross Wightman
|
a84abe6656
|
Add eva02 to non-std test models
|
2023-03-27 22:56:52 -07:00 |
Ross Wightman
|
e9f427b953
|
Add hf hub entries for mlp_mixer
|
2023-03-27 22:50:43 -07:00 |
Ross Wightman
|
cff81deb78
|
multi-weight and hf hub for deit / deit3
|
2023-03-27 22:47:16 -07:00 |
Ross Wightman
|
3863d63516
|
Adding EVA02 weights and model defs, move beit based eva_giant to same eva.py file. Cleanup rotary pos, add lang oriented freq bands to be compat with eva design choice. Fix #1738
|
2023-03-27 17:16:07 -07:00 |
Ross Wightman
|
b12060996c
|
MLP-Mixer multi-weight support, hf hub push
|
2023-03-27 16:42:13 -07:00 |
Ross Wightman
|
56b90317cd
|
Change torchrun args to use _ instead of -, - is the new format, but looks like _ still works for backward compat with old versions. Fix #1742
|
2023-03-26 20:23:55 -07:00 |
Ross Wightman
|
d196fa536d
|
Fix last min torchscript regression in nfnet changes
|
2023-03-24 00:10:17 -07:00 |