Ross Wightman
|
5242ba6edc
|
MobileOne and FastViT weights on HF hub, more code cleanup and tweaks, features_only working. Add reparam flag to validate and benchmark, support reparm of all models with fuse(), reparameterize() or switch_to_deploy() methods on modules
|
2023-08-23 22:50:37 -07:00 |
Fredo Guan
|
81ca323751
|
Davit update formatting and fix grad checkpointing (#7)
fixed head to gap->norm->fc as per convnext, along with option for norm->gap->fc
failed tests due to clip convnext models, davit tests passed
|
2023-01-15 14:34:56 -08:00 |
Ross Wightman
|
87939e6fab
|
Refactor device handling in scripts, distributed init to be less 'cuda' centric. More device args passed through where needed.
|
2022-09-23 16:08:59 -07:00 |
Ross Wightman
|
0dbd9352ce
|
Add bulk_runner script and updates to benchmark.py and validate.py for better error handling in bulk runs (used for benchmark and validation result runs). Improved batch size decay stepping on retry...
|
2022-07-18 18:04:54 -07:00 |
Ross Wightman
|
f0f9eccda8
|
Add --fuser arg to train/validate/benchmark scripts to select jit fuser type
|
2022-01-17 13:54:25 -08:00 |
Ross Wightman
|
e5da481073
|
Small post-merge tweak for freeze/unfreeze, add to __init__ for utils
|
2021-10-06 17:00:27 -07:00 |
Ross Wightman
|
7c97e66f7c
|
Remove commented code, add more consistent seed fn
|
2021-04-12 09:51:36 -07:00 |
Ross Wightman
|
4f49b94311
|
Initial AGC impl. Still testing.
|
2021-02-15 23:22:44 -08:00 |
Ross Wightman
|
27bbc70d71
|
Add back old ModelEma and rename new one to ModelEmaV2 to avoid compat breaks in dependant code. Shuffle train script, add a few comments, remove DataParallel support, support experimental torchscript training.
|
2020-11-29 16:22:19 -08:00 |
Ross Wightman
|
532e3b417d
|
Reorg of utils into separate modules
|
2020-09-07 13:58:09 -07:00 |