Commit Graph

157 Commits (ab087065e9c1fade7a1dd1f13760f622d0a0ed5c)

Author SHA1 Message Date
Tingquan Gao ab087065e9
support to specify rank to log when using Fleet API (#3039)
* support to specify rank to log when using Fleet API

* log max mem reserved

* log_ranks support str type

example: -o Global.log_ranks="0,1"

* log max mem allocated

* support to specify rank to log in static mode

* log max mem reserved and max mem allocated in static mode
2023-11-16 11:32:29 +08:00
zhangyubo0722 aae1e9543f
del load pretrained from url for resnet (#2997)
* del load pretrained from url for resnet

* del load_dygraph_pretrain_from_url

* del load_dygraph_pretrain_from_url

* modify save_load
2023-10-30 13:44:16 +08:00
zhangyubo0722 74a33b7f50
fix gbk (#2941) 2023-09-01 17:49:33 +08:00
zhangyubo0722 f3b2b2f4ad
[uapi]Save predict result (#2926)
* sava predict result
2023-08-29 14:32:07 +08:00
baocheny 75a5bb17ba add 2 more custom devices intel_gpu and apple mps 2023-06-29 19:42:38 +08:00
Bobholamovic de5c4e1b1c Change vdl dir 2023-06-26 14:20:38 +08:00
Bobholamovic d6137854e2 Accommodate UAPI 2023-06-26 14:20:38 +08:00
gaotingquan 8405882f11 debug 2023-05-29 19:52:09 +08:00
gaotingquan 2d8346cd3b fix _init_amp when export 2023-05-29 19:52:09 +08:00
gaotingquan 14d06fb6bd support AMP.use_amp arg 2023-05-25 16:16:02 +08:00
gaotingquan 8b218b01ac refactor amp auto_cast context manager & loss scaler 2023-05-25 11:58:05 +08:00
gaotingquan f884f28853 refactor amp 2023-05-25 11:58:05 +08:00
gaotingquan b3678234fe fix bug when update_freq > iter_per_epoch 2023-05-17 15:19:13 +08:00
gaotingquan 9f621279b8 fix infer output 2023-04-17 20:28:40 +08:00
parap1uie-s 52f16cc85d Update engine.py 2023-04-11 19:23:57 +08:00
parap1uie-s 6e6586f59b Fixed the incorrect infer outputs 2023-04-11 19:23:57 +08:00
Yang Nie 5f2eaa7cb1 bugfix: set_epoch after reume 2023-04-06 15:33:30 +08:00
gaotingquan f37cb543b1 rm op black list in amp
the op flatten_contiguous_range and greater_than has supported amp mode since paddle 2.4
2023-03-29 14:57:02 +08:00
gaotingquan a7ba6eabd2 optimizer must be decorated when training with AMPO2 2023-03-28 18:42:26 +08:00
Tingquan Gao 5d06a88a36 Revert "refactor: simplify engine"
This reverts commit 376d83d46e.
2023-03-14 16:47:13 +08:00
Tingquan Gao 6aabb94d8c Revert "refactor: add ClassModel to unify model forward interface"
This reverts commit 75a20ba557.
2023-03-14 16:47:13 +08:00
Tingquan Gao e7e4f68b5c Revert "refactor: build_train_func & build_eval_func"
This reverts commit 6bed0f5707.
2023-03-14 16:47:13 +08:00
Tingquan Gao f2fc43baeb Revert "refactor: mv all dataloaders to engine.dataloader_dict"
This reverts commit 284e2a6756.
2023-03-14 16:47:13 +08:00
Tingquan Gao a1e840e0da Revert "refactor: iter_per_epoch -> max_iter"
This reverts commit a38e42f644.
2023-03-14 16:47:13 +08:00
Tingquan Gao 0efda2c75e Revert "refactor: simpfy engine.train()"
This reverts commit fad5c8e348.
2023-03-14 16:47:13 +08:00
Tingquan Gao 5a4ee1aec4 Revert "refactor"
This reverts commit 0e28a39da3.
2023-03-14 16:47:13 +08:00
Tingquan Gao f42719afbb Revert "replace the arg engine with config"
This reverts commit f525cea006.
2023-03-14 16:47:13 +08:00
Tingquan Gao 7243f1429b Revert "rm codes for compatibility with old version"
This reverts commit 6e77bd6cd5.
2023-03-14 16:47:13 +08:00
Tingquan Gao 56e8c5a992 Revert "mv model_saver to __init__()"
This reverts commit 0d7e595fc7.
2023-03-14 16:47:13 +08:00
Tingquan Gao f1a7a22a34 Revert "mv some attrs to __init__()"
This reverts commit 73e2cde617.
2023-03-14 16:47:13 +08:00
Tingquan Gao 915dde176a Revert "refactor: rm train and eval from engine"
This reverts commit 5a6fe171a7.
2023-03-14 16:47:13 +08:00
Tingquan Gao aa52682c55 Revert "rm amp code from train and eval & use decorator for amp training"
This reverts commit d3941dc1e9.
2023-03-14 16:47:13 +08:00
Tingquan Gao 85e200edb6 Revert "refactor"
This reverts commit 32593b6375.
2023-03-14 16:47:13 +08:00
Tingquan Gao 8002ccf4b6 Revert "support ShiTu"
This reverts commit 9beb154bc3.
2023-03-14 16:47:13 +08:00
Tingquan Gao b47fa5f50e Revert "debug"
This reverts commit 58daf805a9.
2023-03-14 16:47:13 +08:00
Tingquan Gao 578054dddd Revert "debug for infer"
This reverts commit 428edb6ff8.
2023-03-14 16:47:13 +08:00
Tingquan Gao 0055ca2ffe Revert "debug"
This reverts commit 9e683d0d69.
2023-03-14 16:47:13 +08:00
gaotingquan 9e683d0d69 debug 2023-03-10 16:56:55 +08:00
gaotingquan 428edb6ff8 debug for infer 2023-03-10 16:56:55 +08:00
gaotingquan 58daf805a9 debug 2023-03-10 16:56:55 +08:00
gaotingquan 9beb154bc3 support ShiTu 2023-03-10 16:56:55 +08:00
gaotingquan 32593b6375 refactor 2023-03-10 16:56:55 +08:00
gaotingquan d3941dc1e9 rm amp code from train and eval & use decorator for amp training 2023-03-10 16:56:55 +08:00
gaotingquan 5a6fe171a7 refactor: rm train and eval from engine 2023-03-10 16:56:55 +08:00
gaotingquan 73e2cde617 mv some attrs to __init__() 2023-03-10 16:56:55 +08:00
gaotingquan 0d7e595fc7 mv model_saver to __init__() 2023-03-10 16:56:55 +08:00
gaotingquan 6e77bd6cd5 rm codes for compatibility with old version 2023-03-10 16:56:55 +08:00
gaotingquan f525cea006 replace the arg engine with config 2023-03-10 16:56:55 +08:00
gaotingquan 0e28a39da3 refactor 2023-03-10 16:56:55 +08:00
gaotingquan fad5c8e348 refactor: simpfy engine.train()
1. ModelSaver();
2. _build_ema_model();
3. _init_checkpoints();
4. others.
2023-03-10 16:56:55 +08:00