Commit Graph

783 Commits (e56d6edf19522c941f3cb724da8bcf331b61d85a)
 

Author SHA1 Message Date
Zaida Zhou 19aa1eb780
[Fix] Save checkpoint again to update best_ckpt of ckpt (#1168) 2023-06-02 14:42:56 +08:00
Mashiro 9d9f2b761e
[Fix] Fix unit test failed for limited CI resources (#1163) 2023-06-02 11:33:05 +08:00
Zaida Zhou cf477d15a2
[Docs] Add the usage of clearml (#1180) 2023-06-01 21:54:30 +08:00
i-aki-y 6df9621a06
[Feature] Add support for full wandb's define_metric arguments (#1099) 2023-06-01 21:50:29 +08:00
vugia truong 68414516aa
[Feature] Add vis backend for clearml (#878) (#1091) 2023-06-01 17:41:34 +08:00
Zaida Zhou 4a9e379c1a
[Feature] Support Sophia optimizers (#1170) 2023-05-30 15:44:06 +08:00
Xiang Xu a92f87099f
[Fix] Fix the AttributeError raised by _get_device_id (#1172) 2023-05-29 13:03:20 +08:00
Adrian Joshua Strutt 0ff89f7c36
[Enhancement] Handle the case for Multi-Instance GPUs when using cuda_visible_devices (#1164) 2023-05-28 21:30:36 +08:00
Zaida Zhou 691500dce6
[Docs] Move the usage of distributed training to a single document (#1171) 2023-05-28 20:10:16 +08:00
Zaida Zhou 193b7fdfcc
[Refactor] Let unit tests not affect each other (#1169) 2023-05-27 22:36:04 +08:00
Mashiro 5d4e72144a
[Fix] Fix `ProfileHook` can not profile performance in ddp-training (#1140) 2023-05-26 10:55:15 +08:00
zimonitrome 277b530880
[Fix] Close MLflowVisBackend only if active (#1144) (#1151) 2023-05-25 23:54:00 +08:00
Mashiro 426a565908
Fix unit test error due to program abort caused by limited resources (#1162)
* Fix unit test error due to program abort cause by limited resources

* clean the code
2023-05-25 16:22:17 +08:00
CokeDong 49613414b2
[Feature] Support dipu device (#1127) 2023-05-25 14:10:45 +08:00
Zaida Zhou 7451216259
fix format (#1160) 2023-05-23 17:20:42 +08:00
Eric Chen 55019b89c5
[Feature] MLflowVisBackend adds a new paramter to select config keys to be saved (#1159) 2023-05-23 17:07:04 +08:00
Xin Li d59acfbd9f
[Docs] Translate data_element.md (#1067) 2023-05-23 12:59:29 +08:00
sYonu9 4bc2fe1aae
[Fix] Correctly record random seed in log (#1152) 2023-05-19 14:14:56 +08:00
YQisme 15298f3621
[Docs] Fix a typo (#1154) 2023-05-19 14:13:59 +08:00
gy77 ec2e00ae90
[Docs] Fix a missing comma in tutorials/runner.md (#1146) 2023-05-16 14:20:44 +08:00
WZN 8a0fae01f5
[Enhancement] Add an option to control whether to use progress bar in BaseInference (#1135)
* show_track

* Update mmengine/infer/infer.py

* Update mmengine/infer/infer.py

---------

Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
2023-05-09 15:55:08 +08:00
TankNee d4bb561ff8
Add a text classification in examples (#1122) 2023-05-09 15:48:47 +08:00
Zaida Zhou 70c28415db
[Docs] Move translation of infer.md to en (#1138) 2023-05-09 11:45:18 +08:00
XHr 3b7c70fa97
[Docs] Translate infer.md (#1121) 2023-05-09 10:53:51 +08:00
Yinlei Sun fed0e3821a
[Enhance] Enable full precision training on Ascend NPU (#1109) 2023-05-06 17:17:32 +08:00
Mashiro 6cd7a43a7f
[Docs] Limit urllib3 version for docs building (#1132) 2023-05-06 14:04:16 +08:00
Peiwei Li 49c691fb5e
[Fix] Correctly saves metadata of state_dict when save checkpoints (#1131) 2023-05-06 12:51:06 +08:00
CescMessi 83d76abc7f
[Fix] Fix the incorrect device of inputs in get_model_complexity_info (#1130) 2023-05-06 10:55:17 +08:00
Mashiro 2085046d22
[Fix] The ann_file and data_root of BaseDataset can be None (#850) 2023-05-04 22:22:52 +08:00
Mashiro 3715fea15b
[Refactor] Refactor the unit tests of SyncBuffersHook (#813) 2023-04-28 17:32:30 +08:00
Mashiro 69b563dc3b
Bump version to v0.7.3 (#1118) 2023-04-28 14:27:57 +08:00
Mashiro 298a4b1e49
[Fix] Fix build unnecessary loop during train/test/val (#1107)
* [Fix] Fix build unnecessary loop during train/test/val

* move unit test to runner

* Update unit test

* Fix unit test

* check train_loop is None

* update comment

* replace(type(None)) with is not None
2023-04-27 19:20:35 +08:00
Ma Zerun 49b27dd83f
[Imporve] Support `_load_state_dict_post_hooks` in `load_state_dict`. (#1103)
* [Imporve] Support `_load_state_dict_post_hooks` in `load_state_dict`.

* Update

* Add unit test
2023-04-26 16:48:57 +08:00
Mashiro 6ba667c8cf
[Fix] Save optimizer.state_dict() in cpu by default (#966) 2023-04-26 16:47:47 +08:00
Mashiro 9868131c98
[Enhance] Enhance error message during custom import (#1102) 2023-04-26 11:08:58 +08:00
Mashiro 1c01594c5c
[Docs] Update links (#1108) 2023-04-25 18:51:11 +08:00
Zaida Zhou cdec4cbd4a
[Fix] collate_fn does not support passing a function object (#1093) 2023-04-24 20:42:54 +08:00
shufan wu 2aef53d3fa
[Fix] No training log when the num of iterations is smaller than the interval (#1046) 2023-04-24 12:29:20 +08:00
Qian Zhao 580c9d463c
[Fix] Distributed use correct local rank (#973)
* fix distributed local rank

* fix as comments

* improve readability
2023-04-24 10:28:03 +08:00
Mashiro 4afed1332b
[Enhance] Visualizer.show supports calling opencv to show images (#1015)
* [Enhance] Enhance the efficiency of Visualizer.show

* Update unit test

* Simplify the logic of creating opencv window

* Update docstring

* Update unit test

* Update mmengine/visualization/visualizer.py

---------

Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
2023-04-23 20:30:29 +08:00
cyberslack_lee 0687b377b2
[Enhancement] MessageHub.get_info() supports returning a default value (#991) 2023-04-23 17:35:35 +08:00
Mashiro 1db55358fc
[Enhance] Enhance docstring and error cathing in MessageHub (#1098) 2023-04-23 17:16:52 +08:00
sjiang95 fafb476e58
[Feature] get_model_complexity_info() supports multiple inputs (#1065) 2023-04-23 16:11:31 +08:00
Zaida Zhou 43165160e6
[Docs] Replace MMCls with MMPretrain in docs (#1096)
* [Docs] Replace MMCls with MMPretrain in docs

* fix format
2023-04-23 15:29:43 +08:00
zccjjj c9ee331e43
[Docs] Translate the design/evaluation.md (#1062) 2023-04-21 20:25:40 +08:00
Mashiro 17c5414d16
[Fix] Fix the resuming error caused by HistoryBuffer (#1078) 2023-04-21 17:23:38 +08:00
luomaoling 5b9a1544b0
[Feature] Add torch_npu optimizer (#1079) 2023-04-21 15:15:10 +08:00
Mashiro f1aca8e307
[Fix] Failed to remove the previous best checkpoints (#1086)
* [Fix] Only reserve one best checkpoint

* [Fix] Only reserve one best checkpoint

* Fix unit test

* shutdown logging

* clean the save_checkpoint logic
2023-04-20 21:28:56 +08:00
Luo Yihang 6ebb6f838a
[Fix] Call SyncBufferHook before validation in IterBasedTrainLoop (#982)
* [Fix] Call SyncBufferHook before validation in IterBasedTrainLoop

* Add before_val_epoch in SyncBuffersHook

* Fix white space format

* Add comments for SyncBuffersHook

* Add comments for SyncBuffersHook

Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>

* Add comments for SyncBuffersHook

Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>

* Fix white space format

* Add before_test_epoch

* Remove before_test_epoch

---------

Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
2023-04-20 16:00:42 +08:00
Mashiro 0e5f9da68b
[Fix] Fix merge ci (#1089)
* fix main ci

* Fix

* Fix

* Fix
2023-04-20 15:48:08 +08:00