Commit Graph

244 Commits (fbf41e09134b113f8e79ae01b4eee40d00797b2d)

Author SHA1 Message Date
Glenn Jocher 65857adf52 update train.py ckpt loading 2020-07-18 14:34:12 -07:00
Glenn Jocher 03489aaafb comment tb_writer.add_hparams(hyp, {}) 2020-07-15 10:38:32 -07:00
Glenn Jocher 1e94bcf3d2
Merge pull request #384 from jancio/master
Log hyperparameters in tensorboard
2020-07-14 12:32:27 -07:00
Glenn Jocher 120d40c06a
Update train.py
This updates the PR to a one-liner to minimize additions. Perhaps we can include opt in the future but let's start with this for now.
2020-07-14 12:32:08 -07:00
Janko Ondras 38acc5f3c5 Fix img_size naming in hyperparameters logging 2020-07-14 08:48:46 +02:00
Glenn Jocher a1c8406af3 EMA and non_blocking=True 2020-07-13 20:19:10 -07:00
Glenn Jocher 140d84cca1 comment updates 2020-07-13 12:17:52 -07:00
Janko Ondras e558963244 Log hyperparameters in tensorboard
Log both hyperparameters and command line options in tensorboard.
2020-07-13 12:55:43 +02:00
Glenn Jocher 01a73ec08e multi-gpu ckpt filesize bug fix #253 2020-07-11 12:39:27 -07:00
Glenn Jocher a586751904 multi-gpu ckpt filesize bug fix #253 2020-07-11 12:35:21 -07:00
Glenn Jocher 5de4e25d68 update tensorboard metric 2020-07-11 11:39:02 -07:00
Glenn Jocher 98fc483abc train.py results.txt to bucket bug fix 2020-07-11 09:31:53 -07:00
Glenn Jocher 1c13e67b33 evolution bug fix #346 2020-07-09 22:08:42 -07:00
Glenn Jocher e16e9e43e1 new nc=len(names) check 2020-07-09 17:10:43 -07:00
Glenn Jocher cb527d3af9 new nc=len(names) check 2020-07-09 17:03:12 -07:00
Glenn Jocher 603ea0bfdc update log_dir to runs/exp #107 2020-07-09 15:58:07 -07:00
Glenn Jocher 72d5b58b9a disable LR plot to suppress warning message 2020-07-09 15:16:57 -07:00
Glenn Jocher 24c5a941f0 --resume EMA fix #292 2020-07-09 15:09:06 -07:00
Alex Stoken 9d631408a2
Move hyp and opt yaml save to top of train()
Fixes bug where scaled values were saved in hyp.yaml, which would cause continuity issues with --resume
2020-07-09 16:18:55 -05:00
Glenn Jocher bf6f41567a hyperparameter printout update 2020-07-08 17:21:00 -07:00
Glenn Jocher dc5e18390a
Merge branch 'master' into advanced_logging 2020-07-08 17:01:19 -07:00
Glenn Jocher 6b134d93c5
Update train.py 2020-07-08 16:58:13 -07:00
Glenn Jocher 16f6834486 update train.py and experimental.py 2020-07-08 14:23:34 -07:00
Alex Stoken 52bac22f09 Add in --resume functionality with option to specify path or to get most recent run 2020-07-07 10:42:28 -05:00
Alex Stoken f517ba81c7
Merge branch 'master' into advanced_logging 2020-07-06 16:52:11 -05:00
Laughing 956511dafd
fix LR bug 2020-07-05 15:08:24 +08:00
Glenn Jocher bb3c346916 model.yaml nc inherited from dataset.yaml 2020-07-04 17:51:54 -07:00
Glenn Jocher df224a0d8f EMA bug fix #279 2020-07-03 11:56:14 -07:00
Glenn Jocher 3bdea3f697 strip_optimizer() bug fix #253 2020-07-02 21:24:26 -07:00
Glenn Jocher e02a189a3a
Merge pull request #245 from yxNONG/patch-2
Unify the check point of single and multi GPU
2020-07-02 12:05:46 -07:00
Glenn Jocher 597ed4ce63
Update train.py 2020-07-02 12:00:55 -07:00
Glenn Jocher 13f69777a6 typo fix 2020-07-02 09:26:03 -07:00
yxNONG 1aa2b67933
Update train.py 2020-07-02 13:51:52 +08:00
Glenn Jocher 86784cfdbf --resume bug fix #252 2020-06-30 21:43:53 -07:00
Glenn Jocher ad4c22cbfe --resume bug fix #187 2020-06-30 16:16:29 -07:00
Glenn Jocher 3b16c865f0 assert --epochs 2020-06-30 14:08:08 -07:00
yxNONG cdb9bde181
Unify the check point of single and multi GPU
save the model.hyp etc to checkpoint when use multi GPU training
2020-06-30 19:06:28 +08:00
Glenn Jocher b203c9b7ff update train.py incompatible model message fix #222 2020-06-29 12:45:25 -07:00
Glenn Jocher 37e13f8846 update mosaic border 2020-06-27 13:50:15 -07:00
Alex Stoken e18e6811dc
Merge branch 'master' into advanced_logging 2020-06-27 10:13:03 -05:00
Glenn Jocher 22fb2b0c25 refactor dataloader 2020-06-26 18:56:13 -07:00
Glenn Jocher 256a3e89d2 small dataset bug fix #140 2020-06-25 17:52:56 -07:00
Glenn Jocher b50fdf16af model.names multi-GPU bug fix #94 2020-06-24 22:22:13 -07:00
Alex Stoken de191655e4 Fix yaml saving (don't sort keys), reorder --opt keys, bug fix hyp dict accessor 2020-06-24 17:21:54 -05:00
Alex Stoken 2d396bea00 Fix bug in --help from percent sign in help string 2020-06-24 16:57:12 -05:00
Glenn Jocher b8557f87e3 add stride to datasets.py 2020-06-24 13:02:27 -07:00
Alex Stoken 611aacf1bf Turn opt into dictionary before sending it to yaml 2020-06-24 10:49:08 -05:00
Alex Stoken bc4ef4861b Default optimizer SGD 2020-06-24 10:07:43 -05:00
Alex Stoken 7abf202cad Mode all optimizer settings to 'hyp.yaml', integrate proper momentum with Adam optimizer 2020-06-24 10:03:21 -05:00
Alex Stoken 7edbf6570e Fix help message for cfg files 2020-06-24 09:45:57 -05:00
Alex Stoken d64ad0fbf3 Remove --resume functionality and related checks/logic. 2020-06-24 09:17:27 -05:00
Glenn Jocher 6c1b87a42e update google_utils import 2020-06-22 23:00:23 -07:00
Glenn Jocher 1f1917ef56 remove fast, add merge 2020-06-21 13:37:11 -07:00
Alex Stoken e572bb0803 Add plot_results save location to log_dir 2020-06-21 09:36:28 -05:00
Lornatang 899f1d4bde Fix DDP bug in single process multiple device use cases 2020-06-20 13:00:03 +08:00
Glenn Jocher cdf1eac9f7
Merge pull request #107 from Lornatang/fix-reference-bugs
fix refrence bug
2020-06-19 14:51:41 -07:00
Glenn Jocher cce95e744d backbone as FP16, save default to FP32 2020-06-18 00:13:18 -07:00
Glenn Jocher d9b64c27c2 save ckpt in FP16 #119 2020-06-17 22:34:13 -07:00
Glenn Jocher 9fdb0fbacf AutoAnchor bug fix # 117 2020-06-17 19:51:15 -07:00
Alex Stoken c8152c81a6 Syntax fixes 2020-06-17 16:32:13 -05:00
Alex Stoken 9b7386f603 Add save_dir arg to test.test, use arg as location for saving batch jpgs 2020-06-17 16:08:46 -05:00
Alex Stoken 945307beba Add save_dir to plot_lr_scheduler and plot_labels
Set save_dir = log_dir in train.py
2020-06-17 16:03:18 -05:00
Alex Stoken 3b2b330872 Move results.txt from weights/ to log_dir 2020-06-17 15:55:45 -05:00
Alex Stoken ade023cff2 Fix hyp file read in and dict update.
Add example of hyp yaml
2020-06-17 10:59:20 -05:00
Lornatang 2368603484 fix refrence bug
In torch==1.5, the import of the API has changed. Although it does not interrupt the operation of the program, it seems to me to be an implicit error and may throw an exception in later versions.
2020-06-17 09:56:26 +08:00
Glenn Jocher 8db51c7002 tb_writer bug fix 2020-06-16 16:05:28 -07:00
Alex Stoken 5f2eeba233 remove old print statements 2020-06-16 17:09:39 -05:00
Glenn Jocher afe1df385b dist.destroy_process_group() bug fix 2020-06-16 15:08:14 -07:00
Alex Stoken 333f678b37 add update default hyp dict with provided yaml 2020-06-16 16:36:20 -05:00
Alex Stoken a448c3bcd7 add logic for resuming and getting hyp for resume run 2020-06-16 16:30:12 -05:00
Alex Stoken 25e51bcec7 add util function to get most recent last.pt file
added logic in train.py __main__ to handle resuming from a run
2020-06-16 15:50:27 -05:00
Alex Stoken 490f1e7b9c add save_dir arg to plot_lr_scheduler, default to current dir.
Uncomment plot_lr_scheduler in train() and pass log_dir as save location
2020-06-16 15:13:03 -05:00
Alex Stoken 4418809cf5 change weights dir (wdir) to be unique to each run, under log_dir 2020-06-16 15:09:51 -05:00
Alex Stoken d9f446cd81 add save yaml of opt and hyp to tensorboard log_dir in train() 2020-06-16 15:06:13 -05:00
Alex Stoken a85e6d0fc0 add parser arg for hyp yaml file 2020-06-16 14:53:32 -05:00
Glenn Jocher 5a50491fa1 check_anchors bug fix 2020-06-16 10:36:35 -07:00
Glenn Jocher 05b8ee5ca4 check_anchors() bug fix #102 2020-06-16 10:34:16 -07:00
Glenn Jocher ec81c7b5f2 check_anchors() bug fix #90 2020-06-16 10:14:04 -07:00
Glenn Jocher 8b26e89006 AutoAnchor bug fix #72 2020-06-16 00:53:34 -07:00
Glenn Jocher bdd9fee841 update fast mode 2020-06-16 00:29:54 -07:00
Glenn Jocher 1c0b6236e3 update fast mode 2020-06-16 00:11:29 -07:00
Glenn Jocher 915b1481fc default check_git_status() to True 2020-06-15 16:18:46 -07:00
Glenn Jocher 14523bb030 FP16 to FP32 ckpt load 2020-06-15 13:18:39 -07:00
Glenn Jocher c5966abba8 glob search bug fix #77 2020-06-15 12:08:57 -07:00
Glenn Jocher 31f3310029 assert best possible recall > 0.9 before training 2020-06-13 15:05:41 -07:00
Lauritzen Kasper Primdal c3d4d321d3 Ensures weights/ dir exists
Allows train.py to be run outside of yolov5/ directory.
2020-06-13 14:44:23 +02:00
Glenn Jocher 099e6f5ebd --img-size stride-multiple verification 2020-06-12 22:10:46 -07:00
Glenn Jocher 22d6088205 speed-reproducibility fix #17 2020-06-05 13:07:09 -07:00
Glenn Jocher 55ca5c74d2 multi-scale fix #16 2020-06-05 12:57:16 -07:00
Glenn Jocher 7c2832cd49 assert equal model and dataset classes 2020-06-04 17:21:22 -07:00
Glenn Jocher 11121e39ed updates 2020-06-04 16:52:07 -07:00
Glenn Jocher eb97b2e413 NMS fast mode 2020-06-03 13:02:59 -07:00
Glenn Jocher ce36905358 updates 2020-05-30 00:12:45 -07:00
Glenn Jocher 1e84a23f38 initial commit 2020-05-29 17:04:54 -07:00