Commit Graph

886 Commits (v0.10.4)
 

Author SHA1 Message Date
fanqiNO1 66fb81f7b3
Bump version to 0.10.4 (#1534) 2024-04-23 11:23:12 +08:00
Zhihao Lin acbc5e46dc
[Fix] Delete frozen parameters when using `paramwise_cfg` (#1441) 2024-04-22 19:54:48 +08:00
Hiram Foster 9ecced821b
Fix a typo (#1532) 2024-04-22 19:51:59 +08:00
Zhihao Lin 39ed23fae8
[Enhance] Enable `exclude_frozen_parameters` for `DeepSpeedEngine._zero3_consolidated_16bit_state_dict` (#1517) 2024-04-12 14:25:54 +08:00
Zhihao Lin e258c84824
Perform evaluation upon training completion (#1529) 2024-04-08 13:05:36 +08:00
Zaida Zhou 2c4516c622
Add the supported pytorch versions in README (#1512) 2024-03-06 11:00:42 +08:00
Zaida Zhou 447d3bba2c
Fix config of readthedocs (#1511) 2024-03-06 10:30:00 +08:00
David de la Iglesia Castro 2fe0ecec3d
[Feature] Support custom `artifact_location` in MLflowVisBackend (#1505) 2024-02-26 18:37:25 +08:00
jason_w c423d0c1da
Fix docstring of Config (#1506) 2024-02-24 09:46:01 +08:00
Zaida Zhou 9b98405672
Remove codeowners file (#1496) 2024-02-18 17:19:28 +08:00
Evan 4df682ba2d
Fix typos and remove fullwidth unicode chars (#1488) 2024-02-18 15:33:52 +08:00
fanqiNO1 ba5eed8409
[Fix] Fix warning capture (#1494) 2024-02-18 14:17:35 +08:00
Zaida Zhou f79111ecc0
fix typo (#1481) 2024-01-24 19:31:08 +08:00
Zaida Zhou b5f2d5860d
Refine mmengine introduction (#1479) 2024-01-24 19:27:02 +08:00
Zaida Zhou 02f80e8bdd
Bump version to 0.10.3 (#1478) 2024-01-24 12:45:00 +08:00
Zhihao Lin cd298e3086
[Feature] Support save_optimizer=False for DeepSpeed (#1474) 2024-01-24 11:12:54 +08:00
Anm半夏 396cac19cd
Fix a typo in visualizer.py (#1476) 2024-01-23 11:09:05 +08:00
hanhaowen-mt 3d8a611eec
[Feature] Add the support for musa device support (#1453) 2024-01-11 16:25:01 +08:00
Zhihao Lin 109cd44c7e
[Fix] Fix dist.collect_results to keep all ranks' elements (#1469) 2024-01-11 10:50:36 +08:00
Zhihao Lin b51bf60964
[Fix] Fix the resume of iteration (#1471) 2024-01-11 10:47:05 +08:00
Mashiro 4a50213c69
[Fix] Fix Config.to_dict (#1465) 2024-01-02 16:07:54 +08:00
Zaida Zhou e4600a6993
[Docs] Add the usage of ProfilerHook (#1466) 2024-01-02 15:59:37 +08:00
XiwuChen 369f15e27a
[Docs] Fix nnodes in the doc of ddp training (#1462) 2024-01-02 10:42:58 +08:00
fanqiNO1 1398e4200e
bump version to v0.10.2 (#1460) 2023-12-26 16:30:01 +08:00
lanzeshun 8e6fb12b1f
[Fix] Support multi-node distributed training with NPU backend (#1459) 2023-12-26 16:14:45 +08:00
fanqiNO1 671f3bcdf4
[Fix] Fix placement policy in ColossalAIStrategy (#1440) 2023-12-23 16:24:39 +08:00
SCZwangxiao efcd364124
[Fix] Fix load_model_state_dict in BaseStrategy (#1447) 2023-12-23 11:17:46 +08:00
del-zhenwu 504fa4f5cb
[Fix] Use ImportError to cover ModuleNotFoundError raised by opencv-python (#1438) 2023-12-23 11:15:20 +08:00
fanqiNO1 85c0976bc2
bump version to v0.10.1 (#1436) 2023-11-22 11:12:04 +08:00
fanqiNO1 e461581e55
[Docs] Add build mmengine-lite from source (#1435) 2023-11-22 11:02:12 +08:00
fanqiNO1 246ec1ff35
[Fix] Fix collect_env without opencv (#1434) 2023-11-22 10:50:23 +08:00
fanqiNO1 bdd653a8c3
[Fix] Fix deploy.yml (#1431) 2023-11-21 11:21:36 +08:00
fanqiNO1 be48e8b2f4
bump version to v0.10.0 (#1430) 2023-11-21 09:33:51 +08:00
fanqiNO1 6be0aeb777
[Feature] Support for installing mmengine without opencv (#1429) 2023-11-20 22:00:46 +08:00
fanqiNO1 a5db5bedb9
[Fix] Fix CI for torch2.1.0 (#1418) 2023-11-20 19:31:14 +08:00
fanqiNO1 fd5d06243f
[Fix] Fix scale_lr in SingleDeviceStrategy (#1428) 2023-11-20 16:36:43 +08:00
whcao 5a90805b1e
[Bugs] Fix bugs in colo optimwrapper (#1426) 2023-11-14 17:09:26 +08:00
Zhihao Lin 26f22ed283
[Fix] Support exclude_frozen_parameters for DeepSpeedStrategy's resume (#1424) 2023-11-08 23:35:12 +08:00
fanqiNO1 46784185cf
bump version to v0.9.1 (#1421) 2023-11-03 16:03:56 +08:00
fanqiNO1 eb4fa73b56
[Enhancement] Enhance inputs_to_half in DeepSpeedStrategy (#1400) 2023-11-02 17:19:42 +08:00
Zhihao Lin 27ab6a69f5
[Feature] Add `exclude_frozen_parameters` for `DeepSpeedStrategy` (#1415) 2023-11-02 14:32:55 +08:00
Jon 2a563f4dd5
[Fix] ConcatDataset raises error when metainfo is np.array (#1407) 2023-10-31 17:19:34 +08:00
Peng Lu e0cf958074
[Fix] Fix a bug when module is missing in low version of bitsandbytes (#1388) 2023-10-31 16:59:39 +08:00
whlook b0c701a4c9
[Fix] Fix func params using without init in OneCycleLR (#1403) 2023-10-31 14:12:50 +08:00
Mashiro e43bbb5e03
[Fix] Fix new config in visualizer (#1390) 2023-10-26 15:31:03 +08:00
Zaida Zhou c65187c6b8
[Docs] Rename master to main (#1397) 2023-10-18 19:13:26 +08:00
Yiyao Yang 7495b33f34
Add torch 2.1.0 checking in CI (#1389) 2023-10-18 18:42:13 +08:00
POI-WX d198b53426
[Feature] Support slurm distributed training for mlu devices (#1396) 2023-10-18 16:22:31 +08:00
fanqiNO1 6c5eebb823
bump version to v0.9.0 (#1384) 2023-10-10 18:55:41 +08:00
fanqiNO1 3b639da1ef
[Docs] Fix typo (#1385) 2023-10-10 11:32:46 +08:00