Commit Graph

1405 Commits (6064d16a2dfe89b1d3706df338cecfdcee395d1f)
 

Author SHA1 Message Date
Ross Wightman ed41d32637 Add repr to auto_augment and random_erasing impl 2021-10-28 17:33:36 -07:00
Ross Wightman 135a48d024 Fix sam result again for imagenetv2 2021-10-28 14:46:45 -07:00
Ross Wightman aaff2d82d0 Add new 50ts attn models to benchmark/meta csv files 2021-10-28 14:32:47 -07:00
Ross Wightman 1e17863b7b Fixed botne*t26 model results, add some 50ts self-attn variants 2021-10-28 13:55:24 -07:00
Ross Wightman ae72d009fa Add weights for lambda_resnet50ts, halo2botnet50ts, lamhalobotnet50ts, updated halonet50ts 2021-10-27 22:08:54 -07:00
Ross Wightman 13178ba73a Add benchmark and metadata csv files 2021-10-26 10:50:46 -07:00
Ross Wightman b745d30a3e Fix formatting of last commit 2021-10-25 15:15:14 -07:00
Ross Wightman 3478f1d7f1 Traceability fix for vit models for some experiments 2021-10-25 15:13:08 -07:00
Ross Wightman f658a72e72 Cleanup re-use of Dropout modules in Mlp modules after some twitter feedback :p 2021-10-25 00:40:59 -07:00
Ross Wightman 71f00bfe9e Don't run profile if model is torchscripted 2021-10-24 22:41:20 -07:00
Ross Wightman 7da1b0b61c
Merge pull request #933 from t-vi/unbind
use .unbind instead of explicitly listing the indices
2021-10-24 22:33:18 -07:00
Ross Wightman 5882e62ada Add activation count to fvcore based profiling in benchmark.py 2021-10-24 15:30:38 -07:00
Ross Wightman 51f488b7f5 Update results.csv files with latest weights 2021-10-24 14:48:33 -07:00
Thomas Viehmann f805ba86d9 use .unbind instead of explicitly listing the indices 2021-10-24 21:08:47 +02:00
Ross Wightman 57992509f9 Fix some formatting in utils/model.py 2021-10-23 20:35:36 -07:00
Ross Wightman 0fe4fd3f1f add d8 and e8 regnetz models with group size 8 2021-10-23 20:34:21 -07:00
Ross Wightman 25e7c8c5e5 Update broken resnetv2_50 weight url, add resnetv1_101 a1h recipe weights for 224x224 train 2021-10-20 22:14:12 -07:00
Ross Wightman f7325c7b71 Support either deepspeed or fvcore for flop profiling 2021-10-20 15:17:30 -07:00
Ross Wightman 66253790d4 Add `--bench profile` mode for benchmark.py to just run deepspeed detailed profile on model 2021-10-19 16:06:38 -07:00
Ross Wightman 13a8bf7972 Add train size override and deepspeed GMACs counter (if deepspeed installed) to benchmark.py 2021-10-19 15:15:01 -07:00
Ross Wightman 0ba73e6bcb
Update README.md 2021-10-19 14:38:56 -07:00
Ross Wightman b6caa356d2 Fixed eca_botnext26ts_256 weights added, 79.27 2021-10-19 12:44:28 -07:00
Ross Wightman c02334d9fa Add weights for regnetz_d and haloregnetz_c, update regnetz_c weights. Add commented PyTorch XLA code for halo attention 2021-10-19 12:32:09 -07:00
Ross Wightman 02daf2ab94 Add option to include relative pos embedding in the attention scaling as per references. See discussion #912 2021-10-12 15:37:01 -07:00
Ross Wightman 2c33ca6d8c
Merge pull request #913 from ground0state/master
Fix bugs that Mixup does not work when device is cpu
2021-10-12 14:09:56 -07:00
masafumi 047a5ec05f Fix bugs that Mixup does not work device=cpu 2021-10-12 23:51:46 +09:00
Ross Wightman cd34913278 Remove some outdated comments, botnet networks working great now. 2021-10-11 22:43:41 -07:00
Ross Wightman 6ed4cdccca Update lambda_resnet26t weights with better set 2021-10-10 16:32:54 -07:00
Ross Wightman 288ece0e9f
Merge pull request #910 from tmp-iclr/master
Add ConvMixer
2021-10-10 16:00:58 -07:00
ICLR Author 44d6d51668 Add ConvMixer 2021-10-09 21:09:51 -04:00
Ross Wightman a85df34993 Update lambda_resnet26rpt weights to 78.9, add better halonet26t weights at 79.1 with tweak to attention dim 2021-10-08 17:44:13 -07:00
Ross Wightman 38804c721b Checkpoint clean fn useable stand alone 2021-10-08 17:43:53 -07:00
Ross Wightman b544ad4d3f regnetz model default cfg tweaks 2021-10-06 21:14:59 -07:00
Ross Wightman d80653cb99 Merge branch 'alexander-soare-freeze-functionality' 2021-10-06 17:01:41 -07:00
Ross Wightman e5da481073 Small post-merge tweak for freeze/unfreeze, add to __init__ for utils 2021-10-06 17:00:27 -07:00
Ross Wightman 5ca72dcc75 Merge branch 'freeze-functionality' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-freeze-functionality 2021-10-06 16:51:03 -07:00
Ross Wightman e2b8d44ff0 Halo, bottleneck attn, lambda layer additions and cleanup along w/ experimental model defs
* align interfaces of halo, bottleneck attn and lambda layer
* add qk_ratio to all of above, control q/k dim relative to output dim
* add experimental haloregnetz, and trionet (lambda + halo + bottle) models
2021-10-06 16:32:48 -07:00
Ross Wightman e0b3a3fab3 Make test-pooling flag for validate.py opt in 2021-10-06 16:12:20 -07:00
Alexander Soare 431e60c83f Add acknowledgements for freeze_batch_norm inspiration 2021-10-06 14:28:49 +01:00
Ross Wightman fbf59c04ee Change crop ratio on correct resnet50 variant. 2021-10-04 22:31:08 -07:00
Ross Wightman ae1ff5792f Clean a1/a2/3 rsb _0 checkpoints properly, fix v2 loading. 2021-10-04 16:46:00 -07:00
Ross Wightman d123042605
Update README.md 2021-10-03 21:38:47 -07:00
Ross Wightman cd638d50a5
Merge pull request #880 from rwightman/fixes_bce_regnet
A collection of fixes, model experiments, etc
2021-10-03 19:37:01 -07:00
Ross Wightman 93901e992f Version bump to 0.5.0 for pending release post RSB and ATTN updates 2021-10-03 17:34:57 -07:00
Ross Wightman da0d39bedd Update default crop_pct for byoanet 2021-10-03 17:33:16 -07:00
Ross Wightman cc9bedf373 Add initial ResNet Strikes Back weights for ResNet50 and ResNetV2-50 models 2021-10-03 17:32:02 -07:00
Ross Wightman 64495505b7 Add updated lambda resnet26 and botnet26 checkpoints with fixes applied 2021-10-03 17:31:39 -07:00
Ross Wightman b2094f4ee8 support bits checkpoints in avg/load 2021-10-03 17:31:22 -07:00
Ross Wightman 007bc39323 Some halo and bottleneck attn code cleanup, add halonet50ts weights, use optimal crop ratios 2021-10-02 15:51:42 -07:00
Alexander Soare 6d2acec1bb Fix ordering of tests 2021-10-02 16:10:11 +01:00