Ross Wightman
f54612f648
Merge branch 'select_kernel' into attention
2020-02-09 12:59:24 -08:00
Ross Wightman
4defbbbaa8
Fix module name mistake, start layers sub-package
2020-02-09 12:44:26 -08:00
Ross Wightman
7011cd0902
A little bit of ECA cleanup
2020-02-09 12:41:18 -08:00
Ross Wightman
46471df7b2
Merge pull request #82 from VRandme/eca
...
ECA-Net Efficient Channel Attention
2020-02-09 12:31:05 -08:00
Ross Wightman
d0eb59ef46
Remove unused default_init for EfficientNets, experimenting with fanout calc for #84
2020-02-09 11:33:32 -08:00
Chris Ha
e6a762346a
Implement Adaptive Kernel selection
...
When channel size is given,
calculate adaptive kernel size according to original paper.
Otherwise use the given kernel size(k_size), which defaults to 3
2020-02-09 11:58:03 +09:00
Ross Wightman
13e8da2b46
SelectKernel split_input works best when input channels split like grouped conv, but output is full width. Disable zero_init for SK nets, seems a bad combo.
2020-02-07 22:42:04 -08:00
Chris Ha
6db087a1ff
Merge remote-tracking branch 'upstream/master' into eca
2020-02-07 19:36:35 +09:00
Chris Ha
904c618040
Update EcaModule.py
...
Make pylint happy
(commas, unused imports, missed imports)
2020-02-07 19:36:18 +09:00
Chris Ha
db91ba053b
EcaModule(CamelCase)
...
CamelCased EcaModule.
Renamed all instances of ecalayer to EcaModule.
eca_module.py->EcaModule.py
2020-02-07 19:28:07 +09:00
Ross Wightman
5c4991a088
Add PyTorch trained EfficientNet-ES weights from Andrew Lavin
2020-02-06 12:53:55 -08:00
Chris Ha
d04ff95eda
Merge branch 'master' into eca
2020-02-06 22:44:52 +09:00
Chris Ha
d63ae121d5
Clean up eca_module code
...
functionally similar
adjusted rwightman's version of reshaping and viewing.
Use F.pad for circular eca version for cleaner code
2020-02-06 22:44:33 +09:00
Ross Wightman
d66819d1f3
Indentation mistake. Fixes #81
2020-02-04 22:56:00 -08:00
Chris Ha
f87fcd7e88
Implement Eca modules
...
implement ECA module by
1. adopting original eca_module.py into models folder
2. adding use_eca layer besides every instance of SE layer
2020-02-04 23:15:29 +09:00
Ross Wightman
4808b3c32f
Bump version for PyPi update, fix few out of date README items/mistakes, add README updates for TF EfficientNet-B8 (RandAugment)
2020-02-03 11:44:17 -08:00
Ross Wightman
7d07ebb660
Adding some configs to sknet, incl ResNet50 variants from 'Compounding ... Assembled Techniques' paper and original SKNet50
2020-02-01 23:28:48 -08:00
Ross Wightman
a9d2424fd1
Add separate zero_init_last_bn function to support more block variety without a mess
2020-02-01 22:11:00 -08:00
Ross Wightman
355aa152d5
Just leave it float for now, will look at fp16 later. Remove unused reference code.
2020-02-01 22:11:00 -08:00
Ross Wightman
ef457555d3
BlockDrop working on GPU
2020-02-01 22:11:00 -08:00
Ross Wightman
3ff19079f9
Missed nn_ops.py from last commit
2020-02-01 22:11:00 -08:00
Ross Wightman
9f11b4e8a2
Add ConvBnAct layer to parallel integrated SelectKernelConv, add support for DropPath and DropBlock to ResNet base and SK blocks
2020-02-01 22:11:00 -08:00
Ross Wightman
cefc9b7761
Move SelectKernelConv to conv2d_layers and more
...
* always apply attention in SelectKernelConv, leave MixedConv for no attention alternative
* make MixedConv torchscript compatible
* refactor first/previous dilation name to make more sense in ResNet* networks
2020-02-01 22:11:00 -08:00
Ross Wightman
9abe610931
Used wrong channel var for split
2020-02-01 22:11:00 -08:00
Ross Wightman
58e28dc7e7
Move Selective Kernel blocks/convs to their own sknet.py file
2020-02-01 22:11:00 -08:00
Ross Wightman
a93bae6dc5
A SelectiveKernelBasicBlock for more experiments
2020-02-01 22:11:00 -08:00
Ross Wightman
ad087b4b17
Missed bias=False in selection conv
2020-02-01 22:11:00 -08:00
Ross Wightman
c8b3d6b81a
Initial impl of Selective Kernel Networks. Very much a WIP.
2020-02-01 22:11:00 -08:00
Ross Wightman
1daa303744
Add support to Dataset for class id mapping file, clean up a bit of old logic. Add results file arg for validation and update script.
2020-02-01 18:07:32 -08:00
Ross Wightman
91534522f9
Add newly added TF ported EfficientNet-B8 weights (RandAugment)
2020-02-01 18:01:14 -08:00
Ross Wightman
12dbc74742
New ResNet50 JSD + RandAugment weights
2020-01-31 10:55:54 -08:00
Ross Wightman
2f41905ba5
Update ResNet50 weights to AuxMix trained 78.994 top-1. A few commentes re 'tiered_narrow' tn variant.
2020-01-12 17:55:58 -08:00
Ross Wightman
d9a6a9d0af
Merge pull request #74 from rwightman/augmix-jsd
...
AugMix, JSD loss, SplitBatchNorm (Auxiliary BN), and more
2020-01-11 12:04:29 -08:00
Ross Wightman
3eb4a96eda
Update AugMix, JSD, etc comments and references
2020-01-11 12:02:05 -08:00
Ross Wightman
a28117ea46
Add tiered narrow ResNet (tn) and weights for seresnext26tn_32x4d
2020-01-11 11:29:01 -08:00
Ross Wightman
833066b540
A few minor things in SplitBN
2020-01-05 20:07:03 -08:00
Ross Wightman
7547119891
Add SplitBatchNorm. AugMix, Rand/AutoAugment, Split (Aux) BatchNorm, Jensen-Shannon Divergence, RandomErasing all working together
2020-01-05 19:58:59 -08:00
Ross Wightman
2e955cfd0c
Update RandomErasing with some improved arg names, tweak to aspect range
2020-01-05 14:31:48 -08:00
Ross Wightman
3cc0f91e23
Fix augmix variable name scope overlap, default non-blended mode
2020-01-05 14:27:27 -08:00
Ross Wightman
ec0dd4053a
Add updated RandAugment trained EfficientNet-B0 trained weights from @michaelklachko
2020-01-03 17:18:46 -08:00
Ross Wightman
40fea63ebe
Add checkpoint averaging script. Add headers, shebangs, exec perms to all scripts
2020-01-03 14:57:46 -08:00
Ross Wightman
4666cc9aed
Add --pin-mem arg to enable dataloader pin_memory (showing more benefit in some scenarios now), also add --torchscript arg to validate.py for testing models with jit.script
2020-01-02 16:22:06 -08:00
Ross Wightman
53001dd292
ResNet / Res2Net additions:
...
* ResNet torchscript compat
* output_stride arg supported to limit network stride via dilations (support for dilation added to Res2Net)
* allow activation layer to be changed via act_layer arg
2020-01-01 17:15:56 -08:00
Ross Wightman
f96b3e5e92
InceptionResNetV2 torchscript compatible
2020-01-01 17:13:37 -08:00
Ross Wightman
19d93fe454
Add selecsls60 weights
2019-12-31 16:49:04 -08:00
Ross Wightman
0062c15fb0
Update checkpoint url with modelzoo compatible ones.
2019-12-30 15:59:19 -08:00
Ross Wightman
b5315e66b5
Streamline SelecSLS model without breaking checkpoint compat. Move cfg handling out of model class. Update feature/pooling behaviour to match current.
2019-12-30 15:44:47 -08:00
Ross Wightman
d59a756c16
Run PyCharm autoformat on selecsls and change mix cap variables and model names to all lower
2019-12-30 14:30:46 -08:00
Ross Wightman
fb3a0f4bb8
Merge pull request #65 from mehtadushy/selecsls
...
Incorporate SelecSLS Models
2019-12-30 14:23:53 -08:00
Ross Wightman
19fc205a4d
Update comments on the new SE-ResNeXt26 models
2019-12-28 17:33:10 -08:00