Ross Wightman
|
e861b74cf8
|
Pass through --model-kwargs (and --opt-kwargs for train) from command line through to model __init__. Update some models to improve arg overlay. Cleanup along the way.
|
2023-01-06 16:12:33 -08:00 |
Ross Wightman
|
87939e6fab
|
Refactor device handling in scripts, distributed init to be less 'cuda' centric. More device args passed through where needed.
|
2022-09-23 16:08:59 -07:00 |
Ross Wightman
|
0dbd9352ce
|
Add bulk_runner script and updates to benchmark.py and validate.py for better error handling in bulk runs (used for benchmark and validation result runs). Improved batch size decay stepping on retry...
|
2022-07-18 18:04:54 -07:00 |
Ross Wightman
|
f0f9eccda8
|
Add --fuser arg to train/validate/benchmark scripts to select jit fuser type
|
2022-01-17 13:54:25 -08:00 |
Ross Wightman
|
e5da481073
|
Small post-merge tweak for freeze/unfreeze, add to __init__ for utils
|
2021-10-06 17:00:27 -07:00 |
Ross Wightman
|
7c97e66f7c
|
Remove commented code, add more consistent seed fn
|
2021-04-12 09:51:36 -07:00 |
Ross Wightman
|
4f49b94311
|
Initial AGC impl. Still testing.
|
2021-02-15 23:22:44 -08:00 |
Ross Wightman
|
27bbc70d71
|
Add back old ModelEma and rename new one to ModelEmaV2 to avoid compat breaks in dependant code. Shuffle train script, add a few comments, remove DataParallel support, support experimental torchscript training.
|
2020-11-29 16:22:19 -08:00 |
Ross Wightman
|
532e3b417d
|
Reorg of utils into separate modules
|
2020-09-07 13:58:09 -07:00 |