Ross Wightman
6ccb7d6a7c
Merge pull request #2111 from jamesljlster/enhance_vit_get_intermediate_layers
...
Vision Transformer (ViT) get_intermediate_layers: enhanced to support dynamic image size and saved computational costs from unused blocks
2024-03-18 13:41:18 -07:00
Cheng-Ling Lai
db06b56d34
Saved computational costs of get_intermediate_layers() from unused blocks
2024-03-17 21:34:06 +08:00
Cheng-Ling Lai
4731e4efc4
Modified ViT get_intermediate_layers() to support dynamic image size
2024-03-16 23:07:21 +08:00
Ross Wightman
70ccf00c95
Merge pull request #2108 from huggingface/onnx_dynamo
...
Add support for dynamo based onnx export
2024-03-13 13:24:03 -07:00
Ross Wightman
ba641e07ae
Add support for dynamo based onnx export
2024-03-13 12:05:26 -07:00
Ross Wightman
2ec2f1aa73
Merge pull request #2105 from SmilingWolf/main
...
SwinV2: add configurable act_layer argument
2024-03-05 22:17:10 -08:00
SmilingWolf
59cb0be595
SwinV2: add configurable act_layer argument
...
Defaults to "gelu", but makes it possible to pass "gelu_tanh".
Makes it easier to port weights from JAX/Flax, where the tanh
approximation is the default.
2024-03-05 22:04:17 +01:00
Ross Wightman
6e6f3686a7
Update README.md
2024-02-19 11:32:03 -08:00
Ross Wightman
49992b0dc7
Update version.py
...
Update to 0.9.16 for release
2024-02-19 11:08:17 -08:00
Ross Wightman
79f2ce99aa
Update changes.md
2024-02-19 11:07:28 -08:00
Ross Wightman
9926be4706
Update README.md
2024-02-19 11:06:32 -08:00
Ross Wightman
3df4ffe914
Delete requirements-docs.txt
2024-02-16 09:15:38 -08:00
Ross Wightman
155b32a1c2
Merge pull request #2096 from huggingface/pyproject_pdm
...
Remove setup.py, replace with pyproject.toml and pdm helpers
2024-02-16 09:04:46 -08:00
Ross Wightman
35d6eef0df
Version bump, add test markers back to toml
2024-02-16 09:04:00 -08:00
Ross Wightman
01616aa314
Remove setup.py, replace with pyproject.toml and pdm helpers
2024-02-15 17:39:22 -08:00
Ross Wightman
8a713b09e5
Merge branch 'seefun-master'
2024-02-12 15:01:45 -08:00
Ross Wightman
31e0dc0a5d
Tweak hgnet before merge
2024-02-12 15:00:32 -08:00
Ross Wightman
3e03491e49
Merge branch 'master' of https://github.com/seefun/pytorch-image-models into seefun-master
2024-02-12 14:59:54 -08:00
Ross Wightman
958938845a
Update version.py
2024-02-10 23:10:50 -08:00
Ross Wightman
1b50b15145
Merge pull request #2092 from huggingface/mesa_ema
...
ModelEMAV3 + MESA experiments
2024-02-10 23:10:27 -08:00
Ross Wightman
47c9bc4dc6
Fix device idx split
2024-02-10 21:41:14 -08:00
Ross Wightman
59239d9df5
Cleanup imports for vit relpos
2024-02-10 21:40:57 -08:00
Ross Wightman
ac1b08deb6
fix_init on vit & relpos vit
2024-02-10 20:15:37 -08:00
Ross Wightman
935950cc11
Fix F.sdpa attn drop prob
2024-02-10 20:14:47 -08:00
Ross Wightman
0737cf231d
Add Next-ViT
2024-02-10 17:05:16 -08:00
Ross Wightman
d6c2cc91af
Make NormMlpClassifier head reset args consistent with ClassifierHead
2024-02-10 16:25:33 -08:00
Ross Wightman
87fec3dc14
Update experimental vit model configs
2024-02-10 16:05:58 -08:00
Ross Wightman
7d3c2dc993
Add group_matcher for DaViT
2024-02-10 14:58:45 -08:00
Ross Wightman
7bc7798d0e
Type annotation correctness for create_act
2024-02-10 14:57:58 -08:00
Ross Wightman
7d121ac2ef
Small tweak of timm ToTensor for clarity
2024-02-10 14:57:40 -08:00
Ross Wightman
5a58f4d3dc
Remove test MESA support, no signal that it's helpful so far
2024-02-10 14:38:01 -08:00
Ross Wightman
c7ac37693d
Add device arg to validate() calls in train.py
2024-02-04 10:14:57 -08:00
Ross Wightman
a08b57e801
Fix distributed flag bug w/ flex device handling
2024-02-03 16:26:15 -08:00
Ross Wightman
bee0471f91
forward() pass through for ema model, flag for ema warmup, comment about warmup
2024-02-03 16:24:45 -08:00
Ross Wightman
5e4a4b2adc
Merge branch 'device_flex' into mesa_ema
2024-02-02 09:45:30 -08:00
Ross Wightman
dd84ef2cd5
ModelEmaV3 and MESA experiments
2024-02-02 09:45:04 -08:00
Ross Wightman
d0ff315eed
Merge remote-tracking branch 'emav3/faster_ema' into mesa_ema
2024-01-27 14:52:10 -08:00
Ross Wightman
88889de923
Fix meshgrid deprecation warnings and backward compat with explicit 'ndgrid' and 'meshgrid' fn w/o indexing arg
2024-01-27 13:48:33 -08:00
Ross Wightman
fa247fd9ba
Update README.md
2024-01-27 10:54:10 -08:00
Ross Wightman
dea2dd5d25
Update README.md
...
Update optimizer list and references
2024-01-27 10:46:41 -08:00
Ross Wightman
2633fd4c81
Update changes.md
2024-01-27 10:31:12 -08:00
Ross Wightman
ef5485d609
Update README.md
...
Clear history to start of 2023
2024-01-27 10:30:47 -08:00
Ross Wightman
d4386219c6
Improve type handling for arange & rel pos embeds, keep calculations in float32 until application (may change to apply in float32 in future). Prevent arange type hijacking by DeepSpeed Zero
2024-01-26 16:35:51 -08:00
Ross Wightman
3234daf783
Add missing deprecation mapping for a densenet and xcit model. Fix #2086 . Tweak xcit pos embed use of arange for better low prec safety.
2024-01-24 22:04:04 -08:00
Ross Wightman
809a9e14e2
Pass train-crop-mode to create_loader/transforms from train.py args
2024-01-24 16:19:02 -08:00
Ross Wightman
a48ab818f5
Improving device flexibility in train. Fix #2081
2024-01-20 15:10:20 -08:00
Li zhuoqun
53a4888328
Add droppath and type hint to Xception.
2024-01-19 11:15:47 -08:00
kalazus
7f19a4cce7
fix fast catavgmax selection
2024-01-16 10:30:08 -08:00
lorenzbaraldi
8c663c4b86
Fixed index out of range in case of resume
2024-01-12 23:33:32 -08:00
Ross Wightman
36449617ff
Update README.md
2024-01-09 15:04:34 -08:00