816 Commits

Author SHA1 Message Date
Zaida Zhou
45ee96d0c4
[Docs] Add activation checkpointing usage (#1341) 2023-09-05 11:23:44 +08:00
Zeyuan
ccd17571ce
[Feature] Implement gradient checkpointing (#1319) 2023-09-04 23:29:24 +08:00
Zaida Zhou
9aa883a24c
Fix the type check of tasks in progress bar (#1340) 2023-09-04 19:27:58 +08:00
Zaida Zhou
8c934d2681
Refine error message (#1338) 2023-09-04 16:20:41 +08:00
LRJKD
5671b53bc5
[Fix] Adapt to PyTorch v2.1 on Ascend (#1332) 2023-09-01 16:55:45 +08:00
lizuoxin-nreal
762c9a25b6
[Fix] Fix ndarray metainfo check in ConcatDataset (#1333) 2023-09-01 16:40:26 +08:00
Kevin Wang
8a7e80e9e0
[Feature] Support using other file handlers (#1188) 2023-08-30 20:37:39 +08:00
王永韬
0939d95c93
[Feature ] Add progressbar rich (#1157) 2023-08-30 20:10:07 +08:00
zhengjie.xu
f24144d317
Update QRCode (#1328) 2023-08-30 16:08:45 +08:00
Mashiro
170758aefe
[Fix] Fix get optimizer_cls (#1324) 2023-08-28 16:15:00 +08:00
LZHgrla
714c8eedc3
[Enhance] Unify the parameter style of DeepSpeedStrategy (#1320) 2023-08-25 19:38:58 +08:00
LRJKD
a53c2802a6
[Fix] Fix multi card issue in PyTorch v2.1 on Ascend (#1321) 2023-08-25 10:35:58 +08:00
xuuyangg
e1c6079d73
[Feature] Add collect_results support for Ascend NPU (#1309) 2023-08-23 10:15:23 +08:00
Mashiro
19ab172b2d
Fix typos and documents of colossalai (#1315) 2023-08-22 16:13:55 +08:00
Mashiro
db32234241
[Feature] Add colossalai strategy (#1299) 2023-08-18 15:09:35 +08:00
Zaida Zhou
03ad86cfd2
[Docs] Add a image for neptune (#1312) 2023-08-18 10:48:55 +08:00
Theodore
43e308caaf
[Feature] Add NeptuneVisBackend (#1311) 2023-08-17 23:29:58 +08:00
Zaida Zhou
a483dba9d1
fix typo (#1298) 2023-08-08 10:49:13 +08:00
Zaida Zhou
bbd416a55d
Ignore examples in CI (#1297) 2023-08-07 23:12:51 +08:00
Zaida Zhou
488fddc950
[Docs] Add a new ecosystem in README (#1296) 2023-08-07 22:58:14 +08:00
Zaida Zhou
6b0d5a5f1d
[Docs] Add README for examples (#1295) 2023-08-07 22:53:57 +08:00
Desjajja
398d229910
Add a text translation example (#1283) 2023-08-07 15:33:48 +08:00
Mashiro
d9fee4fbb1
bump version to v0.8.4 (#1291) v0.8.4 2023-08-03 22:13:49 +08:00
Mashiro
b24f3d9a45
[Fix] Fix config in colab (#1290) 2023-08-03 21:19:40 +08:00
Mashiro
45a3d310be
[Fix] Skip adding vis_backend when save_dir is not set (#1289) 2023-08-03 20:27:45 +08:00
Zaida Zhou
a54e814bf8
[Docs] Fix unused parameters (#1288) 2023-08-03 15:45:30 +08:00
Mashiro
5c5ec8b168
Add a segmentation example (#1282) 2023-08-03 15:27:58 +08:00
LZH
d772ad0962
[Enhance] Support callable collate_fn for FlexibleRunner (#1284) 2023-08-01 19:45:54 +08:00
Mashiro
d480df7112
bump version to v0.8.3 (#1278) v0.8.3 2023-07-31 16:57:06 +08:00
Zaida Zhou
5ef75fd7a7
[Docs] Introduce how to customize distributed training settings (#1279) 2023-07-31 15:40:45 +08:00
Xinyu Yang
4fdab5e9cb
[Fix] Fix Visualizer that built vis_backends will not be used when save_dir is None (#1275) 2023-07-31 15:29:44 +08:00
Zaida Zhou
2df93eb51f
Add the loop stage in message_hub (#1277) 2023-07-31 14:22:49 +08:00
Mashiro
237aee3866
[Enhance] Enhance error information in build function (#1088) 2023-07-28 11:21:09 +08:00
Mashiro
e56d6edf19
[Enhance] Ehance error message thrown by Config (#1270) 2023-07-28 10:09:47 +08:00
Mashiro
42fdbc2ddb
[Fix] Fix ConfigDict.items will be called during dump (#1272) 2023-07-28 10:08:10 +08:00
youkaichao
ee742da254
[Enhance] Use graph transform to deal with more general cases for efficient_conv_bn_eval (#1259) 2023-07-26 17:52:57 +08:00
vugia truong
c8a1264568
[Feature] Compare the difference of two configs (#1260) 2023-07-26 15:48:59 +08:00
黄启元
78205c3254
Support multi-node distributed training with MLU backend (#1266) 2023-07-26 10:32:53 +08:00
KerwinKai
68360e7ce8
[Feature] Add parameter save_begin for CheckpointHook (#1271) 2023-07-25 19:21:21 +08:00
Mashiro
3871881ef6
[Enhance] Support skipping initialization in BaseModule (#1263) 2023-07-25 12:56:42 +08:00
Mashiro
6187595677
Move data_preprocessor to target device in FSDPStrategy (#1261) 2023-07-24 10:42:53 +08:00
Mashiro
f4f2555324
Llama2 example (#1264) 2023-07-24 10:20:21 +08:00
Qingyun
86387da4a5
[Enhancement] Register DeepSpeedStrategy even if deepspeed is not installed (#1240) 2023-07-18 11:02:55 +08:00
Zaida Zhou
5bc841c09c
[Docs] Add the data flow of Runner in README (#1257) 2023-07-18 10:09:42 +08:00
Miguel Méndez
9f21d38907
[Docs] Add short explanation about registry scope (#1114) 2023-07-17 10:21:44 +08:00
youkaichao
66d828d8d3
[Enhancement] Rename fast_conv_bn_eval to efficient_conv_bn_eval (#1251) 2023-07-15 22:13:17 +08:00
i-aki-y
276c614bf5
[Fix] Fix scalar check in RuntimeInfoHook (#1250) 2023-07-15 22:03:43 +08:00
youkaichao
40e49ff747
[Feature] Enable fast conv bn eval (#1202) 2023-07-14 18:21:55 +08:00
Zaida Zhou
278f7f5666
[Docs] Add ecosystem in README (#1247) 2023-07-12 14:06:28 +08:00
Mashiro
0549e1ce01
Bump version to v0.8.2 (#1246) v0.8.2 2023-07-12 10:57:12 +08:00