Ross Wightman
|
cf5fec5047
|
Cleanup experimental vit weight init a bit
|
2021-03-20 09:44:24 -07:00 |
|
Ross Wightman
|
f42f1df26c
|
Improve evenness of per-worker split for validation set with TFDS
|
2021-03-18 23:16:14 -07:00 |
|
Ross Wightman
|
cbcb76d72c
|
Should have included Conv2d layers in original weight init. Lets see what the impact is...
|
2021-03-18 23:15:48 -07:00 |
|
Ross Wightman
|
4de57ccf01
|
Add weight init scheme that's closer to JAX impl
|
2021-03-18 15:35:22 -07:00 |
|
Ross Wightman
|
17cdee7354
|
Fix C&P patch_size error, and order of op patch_size arg resolution bug. Remove a test vit model.
|
2021-03-01 16:53:32 -08:00 |
|
Ross Wightman
|
0706d05d52
|
Benchmark models listed in txt file. Add more hybrid vit variants for testing
|
2021-02-28 16:00:33 -08:00 |
|
Ross Wightman
|
2db2d87ff7
|
Add epoch-repeats arg to multiply the number of dataset passes per epoch. Currently for iterable datasets (read TFDS wrapper) only.
|
2021-02-23 17:31:42 -08:00 |
|
Ross Wightman
|
de97be9146
|
Spell out diff between my small and deit small vit models.
|
2021-02-23 16:22:55 -08:00 |
|
Ross Wightman
|
f0ffdf89b3
|
Add numerous experimental ViT Hybrid models w/ ResNetV2 base. Update the ViT naming for hybrids. Fix #426 for pretrained vit resizing.
|
2021-02-23 15:54:55 -08:00 |
|
Ross Wightman
|
0e16d4e9fb
|
Add benchmark.py script, and update optimizer factory to be more friendly to use outside of argparse interface.
|
2021-02-23 15:38:12 -08:00 |
|
Ross Wightman
|
4bc103f504
|
Fix CUDA crash w/ channels-last + CSP models. Remove use of chunk()
|
2021-02-23 13:15:52 -08:00 |
|
Ross Wightman
|
8563609b28
|
Update notes in ScaledStdConv impl
|
2021-02-18 12:44:08 -08:00 |
|
Ross Wightman
|
678ba4e0a2
|
Add NFNet-F model weights ported from DeepMind Haiku impl and new set of models w/ compatible config.
|
2021-02-18 12:28:46 -08:00 |
|
Ross Wightman
|
9de2ec5e44
|
Update README for AGC and bump version to 0.4.4
|
2021-02-16 09:13:03 -08:00 |
|
Ross Wightman
|
4f49b94311
|
Initial AGC impl. Still testing.
|
2021-02-15 23:22:44 -08:00 |
|
Ross Wightman
|
5f9aff395c
|
Fix stem width in NFNet-F models, add some more comments, add some 'light' NFNet models for testing.
|
2021-02-13 16:58:51 -08:00 |
|
Ross Wightman
|
d86dbe45c2
|
Update README.md and few more comments
|
2021-02-12 22:07:18 -08:00 |
|
Ross Wightman
|
0d253e2c5e
|
Fix issue with nfnet tests, bit more cleanup.
|
2021-02-12 21:05:41 -08:00 |
|
Ross Wightman
|
cb06c7a910
|
Add NFNet-F models and tweak existing NF models.
|
2021-02-12 18:28:56 -08:00 |
|
Ross Wightman
|
e4de077021
|
Add first 'Normalizer Free' models. nf_regnet_b1 79.3 @ 288x288 test, and nf_resnet50 80.3 @ 256x256 test (80.68 @ 288x288).
|
2021-02-11 13:20:11 -08:00 |
|
Ross Wightman
|
d8e69206be
|
Merge pull request #419 from rwightman/byob_vgg_models
More models, GPU-Efficient Nets, RepVGG, classic VGG, and flexible Byob backbone.
|
2021-02-10 15:44:09 -08:00 |
|
Ross Wightman
|
ca9b078ac7
|
Update README.md and docs. Version bumped to 0.4.3
|
2021-02-10 14:46:07 -08:00 |
|
Ross Wightman
|
6853b07bbd
|
Improve RegVGG block identity/vs non for clariy and fix attn usage. Add comments.
|
2021-02-10 14:40:29 -08:00 |
|
Ross Wightman
|
0356e773f5
|
Default to native PyTorch AMP instead of APEX amp. Too many APEX issues cropping up lately.
|
2021-02-10 14:31:18 -08:00 |
|
Reuben
|
94ca140b67
|
update collections.abc import
|
2021-02-10 23:54:35 +11:00 |
|
Ross Wightman
|
b4e216e377
|
Fix a few small things.
|
2021-02-09 17:33:43 -08:00 |
|
Ross Wightman
|
dc85e5a237
|
Add ByobNet w/ GPU-EfficientNets and RepVGG. Also add classic vgg models.
|
2021-02-09 16:22:52 -08:00 |
|
Ross Wightman
|
1bcc69e0ad
|
Use in_channels for depthwise groups, allows using out_channels=N * in_channels (does not impact existing models). Fix #354.
|
2021-02-09 16:22:52 -08:00 |
|
Ross Wightman
|
9811e229f7
|
Fix regression in models with 1001 class pretrained weights. Improve batchnorm arg and BatchNormAct layer handling in several models.
|
2021-02-09 16:22:52 -08:00 |
|
Ross Wightman
|
a39c3ee216
|
Merge branch 'master' into eca-weights
|
2021-02-08 11:52:31 -08:00 |
|
Ross Wightman
|
e9d6fe293c
|
Update README for new weights. Version 0.4.2
|
2021-02-08 11:51:16 -08:00 |
|
Ross Wightman
|
666de85cf1
|
Move stride in EdgeResidual block to 3x3 expansion conv. Fix #414
|
2021-02-07 22:10:18 -08:00 |
|
Ross Wightman
|
3b57490a63
|
Fix some half removed resnet model defs, pooling for ecaresnet269d
|
2021-02-07 22:09:25 -08:00 |
|
Ross Wightman
|
68a4144882
|
Add new weights for ecaresnet26t/50t/269d models. Remove distinction between 't' and 'tn' (tiered models), tn is now t. Add test time img size spec to default cfg.
|
2021-02-06 16:30:02 -08:00 |
|
Ross Wightman
|
b9843f954b
|
Merge pull request #282 from tigert1998/patch-1
Add symbolic for SwishJitAutoFn to support onnx
|
2021-02-04 12:18:40 -08:00 |
|
hwangdeyu
|
7a4be5c035
|
add operator HardSwishJitAutoFn export to onnx
|
2021-02-03 09:06:53 +08:00 |
|
Ross Wightman
|
4203efa36d
|
Fix #387 so that checkpoint saver works with max history of 1. Add checkpoint-hist arg to train.py.
|
2021-01-31 20:14:51 -08:00 |
|
Ross Wightman
|
f0e65e37b7
|
Fix NF-ResNet101 model defs
|
2021-01-30 23:26:19 -08:00 |
|
Ross Wightman
|
2c988c3b6e
|
Update README.md for NF-nets, bump version to 0.4.1 for merge
|
2021-01-30 23:19:45 -08:00 |
|
Ross Wightman
|
2de54d174a
|
Fix pool size defs for NFNet models, add a comment.
|
2021-01-30 18:02:33 -08:00 |
|
Ross Wightman
|
90980de4a9
|
Fix up a few details in NFResNet models, managed stable training. Add support for gamma gain to be applied in activation or ScaleStdConv. Some tweaks to ScaledStdConv.
|
2021-01-30 16:32:07 -08:00 |
|
Ross Wightman
|
5a8e1e643e
|
Initial Normalizer-Free Reg/ResNet impl. A bit of related layer refactoring.
|
2021-01-27 22:06:57 -08:00 |
|
Ross Wightman
|
38d8f67570
|
Fix potential issue with change to num_classes arg in train/validate.py defaulting to None (rely on model def / default_cfg)
|
2021-01-25 11:53:34 -08:00 |
|
Ross Wightman
|
587780e56b
|
Update README.md and bump version to 0.4.0
|
2021-01-25 11:22:11 -08:00 |
|
Ross Wightman
|
bb50ac4708
|
Add DeiT distilled weights and distilled model def. Remove some redudant ViT model args.
|
2021-01-25 11:05:23 -08:00 |
|
Ross Wightman
|
c16e965037
|
Add some ViT comments and fix a few minor issues.
|
2021-01-24 23:18:35 -08:00 |
|
Ross Wightman
|
22748f1a2d
|
Convert samples/targets in ParserImageInTar to numpy arrays, slightly less mem usage for massive datasets. Add a few more se/eca model defs to resnet.py
|
2021-01-22 16:54:33 -08:00 |
|
Ross Wightman
|
5d4c3d0af3
|
Add enhanced ParserImageInTar that can read images from tars within tars, folders with multiple tars, etc. Additional comment cleanup.
|
2021-01-22 10:52:04 -08:00 |
|
Ross Wightman
|
55f7dfa9ea
|
Refactor vision_transformer entrpy fns, add pos embedding resize support for fine tuning, add some deit models for testing
|
2021-01-18 16:11:02 -08:00 |
|
Ross Wightman
|
d55bcc0fee
|
Finishing adding stochastic depth support to BiT ResNetV2 models
|
2021-01-16 16:32:03 -08:00 |
|