pytorch-image-models

Commit Graph

Author	SHA1	Message	Date
Ross Wightman	e4e43190ce	Add typing to all model entrypoint fns, add old cache check env var to builder	2023-05-08 08:52:38 -07:00
Ross Wightman	4d135421a3	Implement patch dropout for eva / vision_transformer, refactor / improve consistency of dropout args across all vit based models	2023-04-07 20:27:23 -07:00
Ross Wightman	e9f427b953	Add hf hub entries for mlp_mixer	2023-03-27 22:50:43 -07:00
Ross Wightman	b12060996c	MLP-Mixer multi-weight support, hf hub push	2023-03-27 16:42:13 -07:00
Ross Wightman	927f031293	Major module / path restructure, timm.models.layers -> timm.layers, add _ prefix to all non model modules in timm.models	2022-12-06 15:00:06 -08:00
Ross Wightman	e11efa872d	Update a bunch of weights with external links to timm release assets. Fixes issue with *aliyuncs.com returning forbidden. Did pickle scan / verify and re-hash. Add TresNet-V2-L weights.	2022-09-13 16:35:26 -07:00
Ross Wightman	1b278136c3	Change models with mean 0,0,0 std 1,1,1 from int to float for consistency as mentioned in #1355	2022-07-21 17:36:15 -07:00
Ross Wightman	0862e6ebae	Fix correctness of some group matching regex (no impact on result), some formatting, missed forward_head for resnet	2022-03-19 14:58:54 -07:00
Ross Wightman	372ad5fa0d	Significant model refactor and additions: * All models updated with revised foward_features / forward_head interface * Vision transformer and MLP based models consistently output sequence from forward_features (pooling or token selection considered part of 'head') * WIP param grouping interface to allow consistent grouping of parameters for layer-wise decay across all model types * Add gradient checkpointing support to a significant % of models, especially popular architectures * Formatting and interface consistency improvements across models * layer-wise LR decay impl part of optimizer factory w/ scale support in scheduler * Poolformer and Volo architectures added	2022-02-28 13:56:23 -08:00
Ross Wightman	5f81d4de23	Move DeiT to own file, vit getting crowded. Working towards fixing #1029 , make pooling interface for transformers and mlp closer to convnets. Still working through some details...	2022-01-26 22:53:57 -08:00
Ross Wightman	abc9ba2544	Transitioning default_cfg -> pretrained_cfg. Improving handling of pretrained_cfg source (HF-Hub, files, timm config, etc). Checkpoint handling tweaks.	2022-01-25 21:54:13 -08:00
Martins Bruveris	85c5ff26d7	Added DINO pretrained ResMLP models.	2021-11-24 15:02:46 +02:00
Ross Wightman	20a2be14c3	Add gMLP-S weights, 79.6 top-1	2021-06-23 10:40:30 -07:00
Ross Wightman	b41cffaa93	Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713	2021-06-22 23:16:05 -07:00
Ross Wightman	8f4a0222ed	Add GMixer-24 MLP model weights, trained w/ TPU + PyTorch XLA	2021-06-18 16:49:28 -07:00
Ross Wightman	511a8e8c96	Add official ResMLP weights.	2021-06-14 17:03:16 -07:00
Ross Wightman	4d96165989	Merge branch 'master' into cleanup_xla_model_fixes	2021-06-12 23:19:25 -07:00
Ross Wightman	8880f696b6	Refactoring, cleanup, improved test coverage. * Add eca_nfnet_l2 weights, 84.7 @ 384x384 * All 'non-std' (ie transformer / mlp) models have classifier / default_cfg test added * Fix #694 reset_classifer / num_features / forward_features / num_classes=0 consistency for transformer / mlp models * Add direct loading of npz to vision transformer (pure transformer so far, hybrid to come) * Rename vit_deit* to deit_* * Remove some deprecated vit hybrid model defs * Clean up classifier flatten for conv classifiers and unusual cases (mobilenetv3/ghostnet) * Remove explicit model fns for levit conv, just pass in arg	2021-06-12 16:40:02 -07:00
Ross Wightman	d413eef1bf	Add ResMLP-24 model weights that I trained in PyTorch XLA on TPU-VM. 79.2 top-1.	2021-06-08 14:22:05 -07:00
Ross Wightman	2f5ed2dec1	Update `init_values` const for 24 and 36 layer ResMLP models	2021-06-07 17:15:04 -07:00
Ross Wightman	bfc72f75d3	Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.	2021-05-24 21:13:26 -07:00
talrid	dc1a4efd28	mixer_b16_224_miil, mixer_b16_224_miil_in21k models	2021-05-20 10:35:50 +03:00
Ross Wightman	d5af752117	Add preliminary gMLP and ResMLP impl to Mlp-Mixer	2021-05-19 09:55:05 -07:00
Ross Wightman	e7f0db8664	Fix drop/drop_path arg on MLP-Mixer model. Fix #641	2021-05-16 08:31:52 -07:00
Ross Wightman	b2c305c2aa	Move Mlp and PatchEmbed modules into layers. Being used in lots of models now...	2021-05-06 14:03:23 -07:00
Ross Wightman	2d8b09fe8b	Add official pretrained weights to MLP-Mixer, complete model cfgs.	2021-05-05 15:59:40 -07:00
Ross Wightman	12efffa6b1	Initial MLP-Mixer attempt...	2021-05-05 00:59:45 -07:00

27 Commits (a2e4a4c14849bc6012467b073a31d2a48036fd0e)