Ross Wightman
b049a5c5c6
Merge remote-tracking branch 'origin/master' into norm_norm_norm
2022-03-21 13:41:43 -07:00
Ross Wightman
04db5833eb
Merge pull request #986 from hankyul2/master
...
fix: typo of argment parser desc in train.py
2022-03-21 12:13:51 -07:00
Ross Wightman
0557c8257d
Fix bug introduced in non layer_decay weight_decay application. Remove debug print, fix arg desc.
2022-02-28 17:06:32 -08:00
Ross Wightman
372ad5fa0d
Significant model refactor and additions:
...
* All models updated with revised foward_features / forward_head interface
* Vision transformer and MLP based models consistently output sequence from forward_features (pooling or token selection considered part of 'head')
* WIP param grouping interface to allow consistent grouping of parameters for layer-wise decay across all model types
* Add gradient checkpointing support to a significant % of models, especially popular architectures
* Formatting and interface consistency improvements across models
* layer-wise LR decay impl part of optimizer factory w/ scale support in scheduler
* Poolformer and Volo architectures added
2022-02-28 13:56:23 -08:00
Ross Wightman
95cfc9b3e8
Merge remote-tracking branch 'origin/master' into norm_norm_norm
2022-01-25 22:20:45 -08:00
Ross Wightman
abc9ba2544
Transitioning default_cfg -> pretrained_cfg. Improving handling of pretrained_cfg source (HF-Hub, files, timm config, etc). Checkpoint handling tweaks.
2022-01-25 21:54:13 -08:00
Ross Wightman
f0f9eccda8
Add --fuser arg to train/validate/benchmark scripts to select jit fuser type
2022-01-17 13:54:25 -08:00
Ross Wightman
5ccf682a8f
Remove deprecated bn-tf train arg and create_model handler. Add evos/evob models back into fx test filter until norm_norm_norm branch merged.
2022-01-06 18:08:39 -08:00
han
ab5ae32f75
fix: typo of argment parser desc in train.py
...
- Remove duplicated `of`
2021-11-24 09:32:05 +09:00
Ross Wightman
ba65dfe2c6
Dataset work
...
* support some torchvision datasets
* improvements to TFDS wrapper for subsplit handling (fix #942 ), shuffle seed
* add class-map support to train (fix #957 )
2021-11-09 22:34:15 -08:00
Ross Wightman
cd638d50a5
Merge pull request #880 from rwightman/fixes_bce_regnet
...
A collection of fixes, model experiments, etc
2021-10-03 19:37:01 -07:00
Ross Wightman
d9abfa48df
Make broadcast_buffers disable its own flag for now (needs more testing on interaction with dist_bn)
2021-10-01 13:43:55 -07:00
Ross Wightman
80075b0b8a
Add worker_seeding arg to allow selecting old vs updated data loader worker seed for (old) experiment repeatability
2021-09-28 16:37:45 -07:00
Shoufa Chen
908563d060
fix `use_amp`
...
Fix https://github.com/rwightman/pytorch-image-models/issues/881
2021-09-26 12:32:22 +08:00
Ross Wightman
0387e6057e
Update binary cross ent impl to use thresholding as an option (convert soft targets from mixup/cutmix to 0, 1)
2021-09-23 15:45:39 -07:00
Ross Wightman
0639d9a591
Fix updated validation_batch_size fallback
2021-09-02 14:44:53 -07:00
Ross Wightman
5db057dca0
Fix misnamed arg, tweak other train script args for better defaults.
2021-09-02 14:15:49 -07:00
Ross Wightman
fb94350896
Update training script and loader factory to allow use of scheduler updates, repeat augment, and bce loss
2021-09-01 17:46:40 -07:00
SamuelGabriel
7c19c35d9f
Global instead of local rank.
2021-06-09 19:11:58 +02:00
Ross Wightman
e15e68d881
Fix #566 , summary.csv writing to pwd on local_rank != 0. Tweak benchmark mem handling to see if it reduces likelihood of 'bad' exceptions on OOM.
2021-04-15 23:03:56 -07:00
Ross Wightman
e685618f45
Merge pull request #550 from amaarora/wandb
...
Wandb Support
2021-04-15 09:26:35 -07:00
Ross Wightman
7c97e66f7c
Remove commented code, add more consistent seed fn
2021-04-12 09:51:36 -07:00
Aman Arora
5772c55c57
Make wandb optional
2021-04-10 01:34:20 -04:00
Aman Arora
f54897cc0b
make wandb not required but rather optional as huggingface_hub
2021-04-10 01:27:23 -04:00
Aman Arora
f13f7508a9
Keep changes to minimal and use args.experiment as wandb project name if it exists
2021-04-10 00:50:52 -04:00
Aman Arora
f8bb13f640
Default project name to None
2021-04-10 00:44:05 -04:00
Aman Arora
3f028ebc0f
import wandb in summary.py
2021-04-08 03:48:51 -04:00
Aman Arora
a9e5d9e5ad
log loss as before
2021-04-08 03:41:40 -04:00
Aman Arora
624c9b6949
log to wandb only if using using wandb
2021-04-08 03:40:22 -04:00
Aman Arora
00c8e0b8bd
Make use of wandb configurable
2021-04-08 03:35:59 -04:00
Aman Arora
8e6fb861e4
Add wandb support
2021-04-08 03:22:29 -04:00
Ross Wightman
37c71a5609
Some further create_optimizer_v2 tweaks, remove some redudnant code, add back safe model str. Benchmark step times per batch.
2021-04-01 22:34:55 -07:00
Ross Wightman
288682796f
Update benchmark script to add precision arg. Fix some downstream (DeiT) compat issues with latest changes. Bump version to 0.4.7
2021-04-01 16:40:12 -07:00
Ross Wightman
a5310a3451
Merge remote-tracking branch 'origin/benchmark-fixes-vit_hybrids' into pit_and_vit_update
2021-04-01 12:15:34 -07:00
Ross Wightman
e2e3290fbf
Add '--experiment' to train args for fixed exp name if desired, 'train' not added to output folder if specified.
2021-03-20 12:02:17 -07:00
Ross Wightman
d584e7f617
Support for huggingface hub via create_model and default_cfgs.
...
* improve consistency of model creation helper fns
* add comments to some of the model helpers
* support passing external default_cfgs so they can be sourced from hub
2021-03-16 22:48:26 -07:00
Ross Wightman
2db2d87ff7
Add epoch-repeats arg to multiply the number of dataset passes per epoch. Currently for iterable datasets (read TFDS wrapper) only.
2021-02-23 17:31:42 -08:00
Ross Wightman
0e16d4e9fb
Add benchmark.py script, and update optimizer factory to be more friendly to use outside of argparse interface.
2021-02-23 15:38:12 -08:00
Ross Wightman
01653db104
Missed clip-mode arg for repo train script
2021-02-15 23:27:16 -08:00
Ross Wightman
4f49b94311
Initial AGC impl. Still testing.
2021-02-15 23:22:44 -08:00
Ross Wightman
d8e69206be
Merge pull request #419 from rwightman/byob_vgg_models
...
More models, GPU-Efficient Nets, RepVGG, classic VGG, and flexible Byob backbone.
2021-02-10 15:44:09 -08:00
Ross Wightman
0356e773f5
Default to native PyTorch AMP instead of APEX amp. Too many APEX issues cropping up lately.
2021-02-10 14:31:18 -08:00
Csaba Kertesz
5114c214fc
Change the Python interpreter to Python 3.x in the scripts
2021-02-09 21:20:28 +02:00
Ross Wightman
4203efa36d
Fix #387 so that checkpoint saver works with max history of 1. Add checkpoint-hist arg to train.py.
2021-01-31 20:14:51 -08:00
Ross Wightman
38d8f67570
Fix potential issue with change to num_classes arg in train/validate.py defaulting to None (rely on model def / default_cfg)
2021-01-25 11:53:34 -08:00
Ross Wightman
5d4c3d0af3
Add enhanced ParserImageInTar that can read images from tars within tars, folders with multiple tars, etc. Additional comment cleanup.
2021-01-22 10:52:04 -08:00
Ross Wightman
9d5d4b8df6
Fix silly train.py typo during dataset work
2021-01-16 16:32:21 -08:00
Ross Wightman
855d6cc217
More dataset work including factories and a tensorflow datasets (TFDS) wrapper
...
* Add parser/dataset factory methods for more flexible dataset & parser creation
* Add dataset parser that wraps TFDS image classification datasets
* Tweak num_classes handling bug for 21k models
* Add initial deit models so they can be benchmarked in next csv results runs
2021-01-15 17:26:20 -08:00
Ross Wightman
fd9061dbf7
Remove debug print from train.py
2021-01-04 12:16:06 -08:00
Ross Wightman
59ec7e6a53
Merge branch 'master' into imagenet21k_datasets_more
2021-01-04 12:11:05 -08:00