Mashiro
5b648c119f
[Fix] Fix LogProcessor._get_iter ( #370 )
2022-07-19 18:30:59 +08:00
RangiLyu
1241c21296
[Fix] Fix weight initializing in test and refine registry logging. ( #367 )
...
* [Fix] Fix weight initializing and registry logging.
* sync params
* resolve comments
2022-07-19 18:28:57 +08:00
Ma Zerun
3da66d1f87
[Enhance] Auto set the `end` of param schedulers. ( #361 )
...
* [Enhance] Auto set the `end` of param schedulers.
* Add log output and unit test
* Update docstring
* Update unit tests of `CosineAnnealingParamScheduler`.
2022-07-15 19:53:28 +08:00
Mashiro
78fad67d0d
[Fix] fix resume message_hub ( #353 )
...
* fix resume message_hub
* add unit test
* support resume from messagehub
* minor refine
* add comment
* fix typo
* update docstring
2022-07-14 20:13:22 +08:00
Mashiro
c9c6d454f1
init OptimWrapper after resume ( #362 )
2022-07-14 20:10:48 +08:00
Mashiro
45001a1f6f
[Enhance] Support using variables in base config directly as normal variables. ( #329 )
...
* first commit
* Support modify base config and add unit test
* remove import mmengine in config
* add unit test
* fix lint
* add unit test
* move RemoveAssignFromAST to config utils
* git add utils
* fix format issue in test file
* refine unit test
* refine unit test
2022-07-14 13:05:55 +08:00
Mashiro
6b608b4ef1
[Enhance] Add `build_model_from_cfg` ( #328 )
...
* clean code
* fix as comment
* fix as comment
* add get_registry_by_scope method
* add unit test and docstring example
* rename get_registry_by_scope to switch_scope_and_registry
* move build function to registry/builder
* fix docstring
* rename builder->registry_builder, move build_from_cfg to registry_builder
rename builder->registry_builder, move build_from_cfg to registry_builder
* rename registry_builder to build_function
rename registry_builder to build_function
* fix docstring and type hint
* rename build_function to build_functions
2022-07-13 19:01:59 +08:00
ChaimZhu
acd9cab25a
[Fix] fix `is_abs` criterion error ( #358 )
...
* fix abs criterion error
* update
2022-07-12 11:16:08 +08:00
Mashiro
936c4ebc58
[Fix] Fix missing device ids in wrap_model ( #351 )
...
* fix missing device ids in wrap_model
* clean the code
* use default broadcast_buffers
* refine MMSeparateDistributedDataParallel
* rename tmp variable
* refine docstring
* add type hints
* refactor docstring of ddp model
* add arg in docstring
* minor refine
* better ddp link
2022-07-08 15:01:47 +08:00
VVsssssk
792f481efe
[Fix]fix ClassBalancedDataset ( #354 )
...
* fix cbds
* fix
2022-07-08 14:51:51 +08:00
Mashiro
b2ee9f8b11
[Fix] Fix loss could be nan in optimizer wrapper ( #345 )
...
* fix optimizer wrapper counts
* fix ut
2022-07-06 16:42:49 +08:00
Mashiro
96378fa748
[Fix] make `autocast` compatible with GTX1660 and make it more robust. ( #344 )
...
* fix amp
* fix amp
* make logic consistent with torch.autocast
* support multiple device
* fix as comment
* fix as comment
* avoid circle import
2022-07-05 20:37:56 +08:00
RangiLyu
a3d2916790
[Enhance] Support scheduling betas with MomentumScheduler. ( #346 )
...
* [Enhance] Support scheduling betas with MomentumScheduler.
* enhance ut
* test adam betas
* enhance ut
* enhance ut
2022-07-05 20:37:23 +08:00
Mashiro
2853045e96
[Fix] Fix build multiple runners error ( #348 )
...
* fix build multiple runner error
* fix comments
* fix cpu ci
2022-07-05 20:35:06 +08:00
Mashiro
38e78d5549
[Fix] Fix ema hook and add unit test ( #327 )
...
* Fix ema hook and add unit test
* save state_dict of ema.module
save state_dict of ema.module
* replace warning.warn with MMLogger.warn
* fix as comment
* fix bug
* fix bug
2022-07-04 14:23:23 +08:00
Cedric Luo
9c55b4300c
[Enhance] Support dynamic interval ( #342 )
...
* support dynamic interval in iterbasedtrainloop
* update typehint
* update typehint
* add dynamic interval in epochbasedtrainloop
* update
* fix
Co-authored-by: luochunhua.vendor <luochunhua@pjlab.org.cn>
2022-06-30 15:08:56 +08:00
LeoXing1996
d65350a9da
[Fix] Fix bug of not save-best in iteration-based training ( #341 )
...
* fix bug of not save-best in iteration-based training
* revise the unit test
2022-06-30 14:51:31 +08:00
Mashiro
59b0ccfe6f
[Fix] Fix pytorch version compatibility of autocast ( #339 )
...
* fix unit test of autocast
* fix compatiblity of unit test of optimizerwrapper
* clean code
* fix as comment
* fix docstring
2022-06-29 20:30:53 +08:00
Mashiro
5ac3c23338
[Fix]: fix MMSeparateDistributedDataParallel ( #338 )
2022-06-28 22:20:20 +08:00
Mashiro
d624fa9191
[Enhance] assert image shape before forward ( #300 )
...
* assert image shape before forward
* add unit test
* enhance error message
* allow gray image input
* fix as comment
* fix unit test
* fix unit test
2022-06-28 11:46:12 +08:00
Zaida Zhou
6015fd35e5
Fix docstring format ( #337 )
2022-06-28 11:04:55 +08:00
Mashiro
2fd6beb972
[Fix] Fix UT of optimizer wrapper failed in pytorch1.6 ( #340 )
2022-06-28 10:31:14 +08:00
Jiazhen Wang
bbe00274c8
[Enhance] LR and Momentum Visualizer ( #267 )
...
* impl lr and momentum visualizer
* provide fakerun
2022-06-27 15:00:11 +08:00
Jiazhen Wang
3af3d40541
[Enhance] Refine BaseDataset ( #303 )
...
* refine data_root and data_prefix params
* modify unittest
2022-06-27 14:59:56 +08:00
Yuan Liu
03d5c17ba6
[Feature]: Set different seed to different rank ( #298 )
...
* [Feature]: Set different seed for diff rank
* [Feature]: Add log
* [Fix]: Fix lint
* [Fix]: Fix docstring
* [Fix]: Fix sampler seed
* [Fix]: Fix log bug
* [Fix]: Change diff_seed to diff_rank_seed
* [Fix]: Fix lint
2022-06-24 14:28:16 +08:00
Jiazhen Wang
12f7d3a0d3
[Fix]: fix load_checkpoint ( #332 )
2022-06-23 16:53:53 +08:00
Alex Yang
2994195be2
[Feat] Support training on MPS ( #331 )
...
* [Feat] Support mps
* fix docstring
2022-06-23 16:53:19 +08:00
Zaida Zhou
e877862d5b
[Docs] Improve docstring ( #324 )
...
* Fix docstring format of BaseDataElement
* fix docstring
2022-06-23 16:08:56 +08:00
Mashiro
a4f5533db6
fix torch 1.10 amp error ( #330 )
2022-06-22 23:12:20 +08:00
Haian Huang(深度眸)
2b8a32eca0
[Fix]: fix RuntimeError of SyncBuffersHook ( #309 )
...
* fix RuntimeError of SyncBuffersHook
* add UT
2022-06-22 20:00:46 +08:00
Alex Yang
e18832f046
[Feat] Support revert syncbn ( #326 )
...
* [Feat] Support revert syncbn
* use logger.info but not warning
* fix info string
2022-06-22 19:50:54 +08:00
Mashiro
312f264ecd
[Feature] Add autocast wrapper ( #307 )
...
* add autocast wrapper
* fix docstring
* fix docstring
* fix compare version
* fix unit test
* fix incompatible arguments
* fix as comment
* fix unit test
* rename auto_cast to autocast
2022-06-22 19:49:20 +08:00
Alex Yang
216521a936
[Feat] Support save best ckpt ( #310 )
...
* [Feat] Support save best ckpt
* reformat code
* rename function and reformat code
* fix logging info
2022-06-22 19:48:46 +08:00
Zaida Zhou
c451e71998
Add storage backends in init file ( #325 )
2022-06-22 19:41:31 +08:00
Mashiro
7154df2618
[Enhance] LogProcessor support custom significant digit ( #311 )
...
* LogProcessor support custom significant digit
* rename to num_digits
2022-06-22 19:35:52 +08:00
Jiazhen Wang
2086bc4554
[Feature] Fully support to use MLU for training ( #313 )
...
* modify cuda() to to()
* rollback load_checkpoint
* refine runner
* add TODO
2022-06-22 19:33:35 +08:00
Mashiro
afeac1c098
[Feature]: support to dump result in LoggerHook.after_test_epoch ( #321 )
2022-06-22 19:10:58 +08:00
Zaida Zhou
6501d21eab
[Fix]: fix mdformat version to support python3.6 ( #315 )
2022-06-21 16:32:58 +08:00
Alex Yang
dceef1f66f
[Refactor] Refactor `after_val_epoch` to make it output metric by epoch ( #278 )
...
* [Refactor]:Refactor `after_val_epoch` to make it output metric by epoch
* add an option for user to choose the way of outputing metric
* rename variable
* reformat docstring
* add type alias
* reformat code
* add test function
* add comment and test code
* add comment and test code
2022-06-21 15:39:59 +08:00
Alex Yang
ef946404e6
[Feat] Support FSDP Training ( #304 )
...
* [Feat] Support FSDP Training
* fix version comparison
* change param format and move `FSDP_WRAP_POLICY` to wrapper file
* add docstring and type hint,reformat code
* fix type hint
* fix typo, reformat code
2022-06-21 15:32:56 +08:00
Zaida Zhou
e76517c63a
[Doc]: Update hooks docs ( #317 )
2022-06-21 15:13:53 +08:00
Zaida Zhou
d09af9ead4
[Doc]: update root registries in docs ( #316 )
2022-06-21 15:12:49 +08:00
Tao Gong
45f5859b50
[Doc]: refactor docs for basedataset ( #318 )
2022-06-21 14:58:10 +08:00
Mashiro
44538e56c5
[Doc]: refine logging doc ( #320 )
2022-06-21 14:55:21 +08:00
Jiazhen Wang
e1422a34a3
[Fix]: Fix missing schedulers in __init__.py of schedulers ( #319 )
2022-06-21 14:40:00 +08:00
RangiLyu
e470c3aa1b
[Fix]: fix SWA in pytorch 1.6 ( #312 )
2022-06-21 14:35:22 +08:00
Mashiro
bc763758d8
Fix resource package in windows ( #308 )
...
* move import resource
* move import resource
2022-06-17 14:43:27 +08:00
Mashiro
4a4d6b1ab2
[Enhance] dump messagehub in runner.resume ( #237 )
...
* [Enhance] dump messagehub in runner.resume
* delete unnecessary code
* delete debugging code
Co-authored-by: imabackstabber <312276423@qq.com>
2022-06-17 11:10:37 +08:00
Mashiro
7129a98e36
[Fix]: fix log processor to log average time and grad norm ( #292 )
2022-06-17 10:54:20 +08:00
Jiazhen Wang
7b55c5bdbf
[Feature] Support resume from Ceph ( #294 )
...
* support resume from ceph
* move func and refine
* delete symlink
* fix unittest
* perserve _allow_symlink and symlink
2022-06-17 10:37:19 +08:00