pytorch-image-models

Commit Graph

Author	SHA1	Message	Date
Ross Wightman	a5a2ad2e48	Fix consistency, testing for forward_head w/ pre_logits, reset_classifier, models with pre_logits size != unpooled feature size * add test that model supports forward_head(x, pre_logits=True) * add head_hidden_size attr to all models and set differently from num_features attr when head has hidden layers * test forward_features() feat dim == model.num_features and pre_logits feat dim == self.head_hidden_size * more consistency in reset_classifier signature, add typing * asserts in some heads where pooling cannot be disabled Fix #2194	2024-06-07 13:53:00 -07:00
Yassine	884ef88818	fix all SDPA dropouts	2023-10-05 08:58:41 -07:00
Ross Wightman	3eaf729f3f	F.sdpa for visformer fails w/o contiguous on qkv, make experimental	2023-05-11 11:37:37 -07:00
Ross Wightman	e4e43190ce	Add typing to all model entrypoint fns, add old cache check env var to builder	2023-05-08 08:52:38 -07:00
Ross Wightman	3386af8c86	Final push to get remaining models using multi-weight pretrained configs, almost all weights on HF hub	2023-04-26 15:52:13 -07:00
Ross Wightman	965d0a2d36	fast_attn -> fused_attn, implement global config for enable/disable fused_attn, add to more models. vit clip openai 336 weights.	2023-04-10 12:04:33 -07:00
Ross Wightman	4d135421a3	Implement patch dropout for eva / vision_transformer, refactor / improve consistency of dropout args across all vit based models	2023-04-07 20:27:23 -07:00
Ross Wightman	927f031293	Major module / path restructure, timm.models.layers -> timm.layers, add _ prefix to all non model modules in timm.models	2022-12-06 15:00:06 -08:00
Ross Wightman	0862e6ebae	Fix correctness of some group matching regex (no impact on result), some formatting, missed forward_head for resnet	2022-03-19 14:58:54 -07:00
Ross Wightman	372ad5fa0d	Significant model refactor and additions: * All models updated with revised foward_features / forward_head interface * Vision transformer and MLP based models consistently output sequence from forward_features (pooling or token selection considered part of 'head') * WIP param grouping interface to allow consistent grouping of parameters for layer-wise decay across all model types * Add gradient checkpointing support to a significant % of models, especially popular architectures * Formatting and interface consistency improvements across models * layer-wise LR decay impl part of optimizer factory w/ scale support in scheduler * Poolformer and Volo architectures added	2022-02-28 13:56:23 -08:00
Ross Wightman	95cfc9b3e8	Merge remote-tracking branch 'origin/master' into norm_norm_norm	2022-01-25 22:20:45 -08:00
Ross Wightman	abc9ba2544	Transitioning default_cfg -> pretrained_cfg. Improving handling of pretrained_cfg source (HF-Hub, files, timm config, etc). Checkpoint handling tweaks.	2022-01-25 21:54:13 -08:00
Ross Wightman	656757d26b	Fix MobileNetV2 head conv size for multiplier < 1.0. Add some missing modification copyrights, fix starting date of some old ones.	2022-01-14 16:28:27 -08:00
Ross Wightman	c21b21660d	visformer supports spatial feat map, update pool_size in pretrained cfg to match	2022-01-07 14:31:43 -08:00
KAI ZHAO	b4b8d1ec18	fix hard-coded strides	2021-12-14 17:22:54 +08:00
Ross Wightman	f658a72e72	Cleanup re-use of Dropout modules in Mlp modules after some twitter feedback :p	2021-10-25 00:40:59 -07:00
Ross Wightman	b41cffaa93	Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713	2021-06-22 23:16:05 -07:00
Ross Wightman	8880f696b6	Refactoring, cleanup, improved test coverage. * Add eca_nfnet_l2 weights, 84.7 @ 384x384 * All 'non-std' (ie transformer / mlp) models have classifier / default_cfg test added * Fix #694 reset_classifer / num_features / forward_features / num_classes=0 consistency for transformer / mlp models * Add direct loading of npz to vision transformer (pure transformer so far, hybrid to come) * Rename vit_deit* to deit_* * Remove some deprecated vit hybrid model defs * Clean up classifier flatten for conv classifiers and unusual cases (mobilenetv3/ghostnet) * Remove explicit model fns for levit conv, just pass in arg	2021-06-12 16:40:02 -07:00
Ross Wightman	742c2d5247	Add Gather-Excite and Global Context attn modules. Refactor existing SE-like attn for consistency and refactor byob/byoanet for less redundancy.	2021-05-27 18:03:29 -07:00
Ross Wightman	5db7452173	Fix visformer in_chans stem handling	2021-05-25 14:11:36 -07:00
Ross Wightman	c4572cc5aa	Add Visformer-small weighs, tweak torchscript jit test img size.	2021-05-24 22:50:12 -07:00
Ross Wightman	bfc72f75d3	Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.	2021-05-24 21:13:26 -07:00
Ross Wightman	94d4b53352	Add temporary default_cfgs to visformer models so they pass tests	2021-05-15 08:41:31 -07:00
Ross Wightman	ecc7552c5c	Add levit, levit_c, and visformer model defs. Largely untested and not finished cleanup.	2021-05-14 17:16:34 -07:00

24 Commits (5d535d7a2d4b435b1b5c1177fd8f04a12b942b9a)