Ross Wightman
94bcdebd73
Add latest weights trained on TPU-v3 VM instances
2022-03-18 21:35:41 -07:00
Ross Wightman
0557c8257d
Fix bug introduced in non layer_decay weight_decay application. Remove debug print, fix arg desc.
2022-02-28 17:06:32 -08:00
Ross Wightman
372ad5fa0d
Significant model refactor and additions:
...
* All models updated with revised foward_features / forward_head interface
* Vision transformer and MLP based models consistently output sequence from forward_features (pooling or token selection considered part of 'head')
* WIP param grouping interface to allow consistent grouping of parameters for layer-wise decay across all model types
* Add gradient checkpointing support to a significant % of models, especially popular architectures
* Formatting and interface consistency improvements across models
* layer-wise LR decay impl part of optimizer factory w/ scale support in scheduler
* Poolformer and Volo architectures added
2022-02-28 13:56:23 -08:00
Ross Wightman
2c3870e107
semobilevit_s for good measure
2022-01-31 22:36:09 -08:00
Ross Wightman
bcaeb91b03
Version to 0.6.0, possible interface incompatibilities vs 0.5.x
2022-01-31 15:42:14 -08:00
Ross Wightman
58ba49c8ef
Add MobileViT models (w/ ByobNet base). Close #1038 .
2022-01-31 15:39:34 -08:00
Ross Wightman
5f81d4de23
Move DeiT to own file, vit getting crowded. Working towards fixing #1029 , make pooling interface for transformers and mlp closer to convnets. Still working through some details...
2022-01-26 22:53:57 -08:00
Ross Wightman
95cfc9b3e8
Merge remote-tracking branch 'origin/master' into norm_norm_norm
2022-01-25 22:20:45 -08:00
Ross Wightman
abc9ba2544
Transitioning default_cfg -> pretrained_cfg. Improving handling of pretrained_cfg source (HF-Hub, files, timm config, etc). Checkpoint handling tweaks.
2022-01-25 21:54:13 -08:00
Ross Wightman
07379c6d5d
Add vit_base2_patch32_256 for a model between base_patch16 and patch32 with a slightly larger img size and width
2022-01-24 14:46:47 -08:00
Ross Wightman
cf4334391e
Update benchmark and validate scripts to output results in JSON with a fixed delimiter for use in multi-process launcher
2022-01-24 14:46:47 -08:00
Ross Wightman
1331c145a3
Add train benchmark results, adjust name scheme for inference and train benchmark files.
2022-01-23 14:08:30 -08:00
Ross Wightman
a517bf6a7a
Merge pull request #1105 from kozistr/refactor/remove-condition
...
Remove checking `smoothing` parameter
2022-01-21 13:40:22 -08:00
kozistr
56a6b38f76
refactor: remove if-condition
2022-01-21 14:19:11 +09:00
Ross Wightman
447677616f
version 0.5.5
2022-01-20 21:18:30 -08:00
Ross Wightman
499c4749d7
Add update NCHW and NHWC inference benchmark numbers for current models. Flip name of 'sam' vit models in results files
2022-01-20 10:40:04 -08:00
Ross Wightman
83b40c5a58
Last batch of small model weights (for now). mobilenetv3_small 050/075/100 and updated mnasnet_small with lambc/lamb optimizer.
2022-01-19 10:02:02 -08:00
Ross Wightman
7f73252716
Merge pull request #1094 from Mi-Peng/lars
...
fix lars
2022-01-19 08:39:49 -08:00
Mi-Peng
cdcd0a92ca
fix lars
2022-01-19 17:49:43 +08:00
Ross Wightman
2d4b7e7080
Update results csvs for latest release
2022-01-18 22:56:45 -08:00
Ross Wightman
1aa617cb3b
Add AvgPool2d anti-aliasing support to ResNet arch (as per OpenAI CLIP models), add a few blur aa models as well
2022-01-18 21:57:24 -08:00
Ross Wightman
f0f9eccda8
Add --fuser arg to train/validate/benchmark scripts to select jit fuser type
2022-01-17 13:54:25 -08:00
Ross Wightman
010b486590
Add Dino pretrained weights (no head) for vit models. Add support to tests and helpers for models w/ no classifier (num_classes=0 in pretrained cfg)
2022-01-17 12:20:02 -08:00
Ross Wightman
738a9cd635
unbiased=False for torch.var_mean path of ConvNeXt LN. Fix #1090
2022-01-17 09:25:06 -08:00
Ross Wightman
e0c4eec4b6
Default conv_mlp to False across the board for ConvNeXt, causing issues on more setups than it's improving right now...
2022-01-16 14:20:08 -08:00
Ross Wightman
b669f4a588
Add ConvNeXt 22k->1k fine-tuned and 384 22k-1k fine-tuned weights after testing
2022-01-15 15:44:36 -08:00
Ross Wightman
6dcbaf211a
Update README.md
2022-01-14 20:11:45 -08:00
Ross Wightman
a8d103e18b
Giant/gigantic vits snuck through in a test a broke GitHub test runner, add filter
2022-01-14 17:23:35 -08:00
Ross Wightman
ef72ad4177
Extra vit_huge model likely to cause test issue (non in21k variant), adding to filters
2022-01-14 16:28:27 -08:00
Ross Wightman
e967c72875
Update REAMDE.md. Sneak in g/G (giant / gigantic?) ViT defs from scaling paper
2022-01-14 16:28:27 -08:00
Ross Wightman
9ca3437178
Add some more small model weights lcnet, mnas, mnv2
2022-01-14 16:28:27 -08:00
Ross Wightman
fa6463c936
Version 0.5.4
2022-01-14 16:28:27 -08:00
Ross Wightman
fa81164378
Fix stem width for really small mobilenetv3 arch defs
2022-01-14 16:28:27 -08:00
Ross Wightman
edd3d73695
Add missing dropout for head reset in ConvNeXt default head
2022-01-14 16:28:27 -08:00
Ross Wightman
b093dcb46d
Some convnext cleanup, remove in place mul_ for gamma, breaking symbolic trace, cleanup head a bit...
2022-01-14 16:28:27 -08:00
Ross Wightman
18934debc5
Add initial ConvNeXt impl (mods of official code)
2022-01-14 16:28:27 -08:00
Ross Wightman
656757d26b
Fix MobileNetV2 head conv size for multiplier < 1.0. Add some missing modification copyrights, fix starting date of some old ones.
2022-01-14 16:28:27 -08:00
Ross Wightman
ccfeb06936
Fix out_indices handling breakage, should have left as per vgg approach.
2022-01-07 19:30:51 -08:00
Ross Wightman
a9f91483a6
Fix #1078 , DarkNet has 6 feature maps. Make vgg and darknet out_indices handling/comments equivalent
2022-01-07 15:08:32 -08:00
Ross Wightman
c21b21660d
visformer supports spatial feat map, update pool_size in pretrained cfg to match
2022-01-07 14:31:43 -08:00
Ross Wightman
9c11dfd9cb
Fix fbnetv3 pretrained cfg changes
2022-01-07 14:09:50 -08:00
Ross Wightman
1406cddc2e
FBNetV3 timm trained weights added for b/d/g variants. Update version to 0.5.2 for pypi release.
2022-01-07 12:05:08 -08:00
Ross Wightman
02ae11e526
Leaving repeat aug sampler indices as tensor thrashes worker shared process memory
2022-01-06 22:33:09 -08:00
Ross Wightman
4df51f3932
Add lcnet_100 and mnasnet_small weights
2022-01-06 22:21:05 -08:00
Ross Wightman
5ccf682a8f
Remove deprecated bn-tf train arg and create_model handler. Add evos/evob models back into fx test filter until norm_norm_norm branch merged.
2022-01-06 18:08:39 -08:00
Ross Wightman
b9a715c86a
Add more small model defs for MobileNetV3/V2/LCNet
2022-01-06 16:06:43 -08:00
Ross Wightman
b27c21b09a
Update drop_path and drop_block (fast impl) to be symbolically traceable, slightly faster
2022-01-06 16:04:58 -08:00
Ross Wightman
25d1526092
Update pytest for GitHub runner to use --forked with xdist, hopefully eliminate memory buildup
2022-01-06 16:04:23 -08:00
Ross Wightman
214c84a235
Disable use of timm nn.Linear wrapper since AMP autocast + torchscript use appears fixed
2022-01-06 16:01:51 -08:00
Ross Wightman
de5fa791c6
Merge branch 'master' into norm_norm_norm
2022-01-03 11:37:00 -08:00