Ross Wightman
13cf68850b
Remove poorly named metrics from torch imagenet example origins. Use top1/top5 in csv output for consistency with existing validation results files, acc elsewhere. Fixes #111
2020-04-10 14:41:08 -07:00
Andrew Lavin
d92cc4da54
Modified save_checkpoint to always save last checkpoint. Fixes #98 .
2020-03-09 22:41:33 -07:00
Ross Wightman
f098fda2ca
Add map_location='cpu' to ModelEma resume, should improve #72
2020-02-12 13:23:56 -08:00
Ross Wightman
a435ea1327
Change reduce_bn to distribute_bn, add ability to choose between broadcast and reduce (mean). Add crop_pct arg to allow selecting validation crop while training.
2019-12-19 22:56:54 -08:00
Ross Wightman
3bff2b21dc
Add support for keeping running bn stats the same across distributed training nodes before eval/save
2019-12-05 22:35:40 -08:00
Ross Wightman
3d9c8a6489
Add support for new AMP checkpointing support w/ amp.state_dict
2019-08-29 15:19:18 -07:00
Ross Wightman
6cdf35e670
Add explicit half/fp16 support to loader and validation script
2019-07-05 13:52:25 -07:00
Ross Wightman
c6b32cbe73
A number of tweaks to arguments, epoch handling, config
...
* reorganize train args
* allow resolve_data_config to be used with dict args, not just arparse
* stop incrementing epoch before save, more consistent naming vs csv, etc
* update resume and start epoch handling to match above
* stop auto-incrementing epoch in scheduler
2019-06-28 13:49:20 -07:00
Ross Wightman
827a3d6010
Add current checkpoints output back to CheckpointSaver (via logger)
2019-06-21 11:57:43 -07:00
Ross Wightman
6fc886acaf
Remove all prints, change most to logging calls, tweak alignment of batch logs, improve setup.py
2019-06-20 17:29:25 -07:00
Ross Wightman
aa4354f466
Big re-org, working towards making pip/module as 'timm'
2019-06-19 17:20:51 -07:00