Evan
4df682ba2d
Fix typos and remove fullwidth unicode chars ( #1488 )
2024-02-18 15:33:52 +08:00
XiwuChen
369f15e27a
[Docs] Fix nnodes in the doc of ddp training ( #1462 )
2024-01-02 10:42:58 +08:00
fanqiNO1
3b639da1ef
[Docs] Fix typo ( #1385 )
2023-10-10 11:32:46 +08:00
fanqiNO1
bf30c444de
Update the version info ( #1383 )
2023-10-09 16:18:40 +08:00
Shu Liqiang
b8a31671a4
[Feature] Runner supports setting the number of iterations for per epoch ( #1292 )
2023-10-08 16:43:44 +08:00
takuoko
d617bcafdd
[Feature] Support Adafactor Optimizer ( #1361 )
2023-09-21 16:30:24 +08:00
Zaida Zhou
9b94af42b8
[Docs] Update the usage of bitsandbytes in Chinese document ( #1359 )
2023-09-15 23:14:33 +08:00
takuoko
e91bfa4593
[Feature] Support bitsandbytes ( #1357 )
2023-09-15 22:56:11 +08:00
Range King
9a19f8bc89
[Feature] Add AimVisBackend ( #1347 )
2023-09-09 11:39:48 +08:00
Range King
273fb2b333
[Feature] Add DVCLiveVisBackend ( #1336 )
2023-09-08 17:22:23 +08:00
Zaida Zhou
45ee96d0c4
[Docs] Add activation checkpointing usage ( #1341 )
2023-09-05 11:23:44 +08:00
Mashiro
19ab172b2d
Fix typos and documents of colossalai ( #1315 )
2023-08-22 16:13:55 +08:00
Mashiro
db32234241
[Feature] Add colossalai strategy ( #1299 )
2023-08-18 15:09:35 +08:00
Zaida Zhou
03ad86cfd2
[Docs] Add a image for neptune ( #1312 )
2023-08-18 10:48:55 +08:00
Theodore
43e308caaf
[Feature] Add NeptuneVisBackend ( #1311 )
2023-08-17 23:29:58 +08:00
Zaida Zhou
a54e814bf8
[Docs] Fix unused parameters ( #1288 )
2023-08-03 15:45:30 +08:00
Zaida Zhou
5ef75fd7a7
[Docs] Introduce how to customize distributed training settings ( #1279 )
2023-07-31 15:40:45 +08:00
youkaichao
66d828d8d3
[Enhancement] Rename fast_conv_bn_eval to efficient_conv_bn_eval ( #1251 )
2023-07-15 22:13:17 +08:00
youkaichao
40e49ff747
[Feature] Enable fast conv bn eval ( #1202 )
2023-07-14 18:21:55 +08:00
Zaida Zhou
33e30b7cb6
[Docs] how to train a large model ( #1228 )
2023-07-05 18:20:07 +08:00
Zaida Zhou
cf477d15a2
[Docs] Add the usage of clearml ( #1180 )
2023-06-01 21:54:30 +08:00
Zaida Zhou
4a9e379c1a
[Feature] Support Sophia optimizers ( #1170 )
2023-05-30 15:44:06 +08:00
Zaida Zhou
691500dce6
[Docs] Move the usage of distributed training to a single document ( #1171 )
2023-05-28 20:10:16 +08:00
Zaida Zhou
43165160e6
[Docs] Replace MMCls with MMPretrain in docs ( #1096 )
...
* [Docs] Replace MMCls with MMPretrain in docs
* fix format
2023-04-23 15:29:43 +08:00
luomaoling
5b9a1544b0
[Feature] Add torch_npu optimizer ( #1079 )
2023-04-21 15:15:10 +08:00
Zaida Zhou
9207e84aa0
[Docs] Introduce the use of wandb and tensorboard ( #912 )
...
* [Docs] Introduce the use of wandb and tensorboard
* fix link
* Update docs/en/common_usage/visualize_training_log.md
2023-04-11 12:31:05 +08:00
Qian Zhao
0d25625ba2
[Feature] Support torch.compile since PyTorch2.0 ( #976 )
...
* enable compile configurations to support torch.compile in Runner
* enable compilation in train, val and test
* fix as comments
* add docstring to illustrate usage
* minor refine error message
* add unittests
* fix ut skip
* add logging message to inform users
* compile `train_step`, `val_step`, `test_step` instead
* fix as comments
* revert to compile `train_step` only due to pt2 issue
* add documentation about torch.compile
2023-03-12 18:26:43 +08:00
Xin Li
6e58c0d24c
[Docs] Translate "how to set random seed" ( #930 )
...
* translate random_seed init
* try fix lint err
* update en/index.rst
* translate zhihu answer
* Update docs/en/common_usage/set_random_seed.md
* Update docs/en/common_usage/set_random_seed.md
* Update docs/en/common_usage/set_random_seed.md
Co-authored-by: Mashiro <57566630+HAOCHENYE@users.noreply.github.com>
* Update docs/en/common_usage/set_random_seed.md
Co-authored-by: Mashiro <57566630+HAOCHENYE@users.noreply.github.com>
* Update docs/en/common_usage/set_random_seed.md
Co-authored-by: Mashiro <57566630+HAOCHENYE@users.noreply.github.com>
* Update docs/en/common_usage/set_random_seed.md
Co-authored-by: Qian Zhao <112053249+C1rN09@users.noreply.github.com>
* Update docs/en/common_usage/set_random_seed.md
Co-authored-by: Qian Zhao <112053249+C1rN09@users.noreply.github.com>
* Update docs/en/common_usage/set_random_seed.md
Co-authored-by: Qian Zhao <112053249+C1rN09@users.noreply.github.com>
* Update docs/en/common_usage/set_random_seed.md
Co-authored-by: Qian Zhao <112053249+C1rN09@users.noreply.github.com>
* Update docs/en/common_usage/set_random_seed.md
Co-authored-by: Qian Zhao <112053249+C1rN09@users.noreply.github.com>
* reorg last part
* minor refine
* Update set_random_seed.md
* fix lint
---------
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
Co-authored-by: Mashiro <57566630+HAOCHENYE@users.noreply.github.com>
Co-authored-by: Qian Zhao <112053249+C1rN09@users.noreply.github.com>
Co-authored-by: zhouzaida <zhouzaida@163.com>
2023-03-09 17:23:12 +08:00
Infinity_lee
54b6bbbd65
[Docs] Translate model analysis document to English ( #956 )
...
* add model analysis eng doc
* fix doc style
* add link in index.rst
* fix
* fix
* fix
* rename file
* fix
2023-03-05 21:03:26 +08:00
Qian Zhao
2ed8e343a0
[Feature] Enable bf16 in AmpOptimWrapper ( #960 )
...
* support bf16 in AmpOptimWrapper
* add docstring
* modify docs
* add unittests for bf16 in AmpOptimWrapper
* fix type
* fix to pass ci
* fix ut skip logic to pass ci
* fix as comment
* add type hints
* fix docstring and add warning information
* remove check for pytorch>=1.6 in unittest
* modify unittest
* modify unittest
* remove torch.float32 && torch.float64 from valid dtypes
* fix as comments
* minor refine docstring
* fix unittest parameterized to pass CI
* fix unittest && add back torch.float32, torch.float64
2023-03-01 21:35:18 +08:00
Infinity_lee
8a407ca214
[Docs] Translate "Debug Tricks" to English ( #953 )
...
* add english docs
* fix superlink
* fix
* fix
2023-03-01 19:29:04 +08:00
Yuetian Wang
53ae4fb2ec
[Docs] Add a document about setting interval ( #964 )
...
* Add a document about setting interval.
Add a document about Setting the Frequency of Logging, Weight Saving, and Validation.
* Update set_interval.md
Fixed a small bug in the hyperlink.
* Update set_interval.md
Fixed the wrong hyperlink
* Update with pre-commit.
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update docs/en/common_usage/set_interval.md
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Update index.rst
Added: common_usage/set_interval.md
---------
Co-authored-by: YuetianW <wangyuetian@supermicro.dscvlab.com>
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
2023-03-01 19:28:14 +08:00
Zaida Zhou
67acdbe245
[Docs] Add a document about debug tricks ( #938 )
...
* fix typo
* [Docs] Add debug skills
* minor fix
* refine
* rename debug_skills to debug_tricks
* refine
* Update docs/en/common_usage/debug_tricks.md
2023-02-21 21:40:35 +08:00
Mashiro
346989464c
[Docs] Add the document for the transition between IterBasedTraining and EpochBasedTraining ( #926 )
...
* Add epoch 2 iter
* Add epoch 2 iter
* Refine chinese docs
* Add example for training CIFAR10 by iter
* minor refine
* Fix as comment
* Fix as comment
* Refine description
* Fix as comment
* minor refine
* Refine description
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
* Translate to en
* Adjust indent
---------
Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com>
2023-02-21 21:12:38 +08:00
Zaida Zhou
4749682885
[Docs] Move examples to a new directory ( #911 )
...
* [Docs] Move examples to a new directory
* fix typo
2023-02-07 17:03:35 +08:00