Tingquan Gao
ab087065e9
support to specify rank to log when using Fleet API ( #3039 )
...
* support to specify rank to log when using Fleet API
* log max mem reserved
* log_ranks support str type
example: -o Global.log_ranks="0,1"
* log max mem allocated
* support to specify rank to log in static mode
* log max mem reserved and max mem allocated in static mode
2023-11-16 11:32:29 +08:00
zhangyubo0722
aae1e9543f
del load pretrained from url for resnet ( #2997 )
...
* del load pretrained from url for resnet
* del load_dygraph_pretrain_from_url
* del load_dygraph_pretrain_from_url
* modify save_load
2023-10-30 13:44:16 +08:00
zhangyubo0722
74a33b7f50
fix gbk ( #2941 )
2023-09-01 17:49:33 +08:00
zhangyubo0722
f3b2b2f4ad
[uapi]Save predict result ( #2926 )
...
* sava predict result
2023-08-29 14:32:07 +08:00
baocheny
75a5bb17ba
add 2 more custom devices intel_gpu and apple mps
2023-06-29 19:42:38 +08:00
Bobholamovic
de5c4e1b1c
Change vdl dir
2023-06-26 14:20:38 +08:00
Bobholamovic
d6137854e2
Accommodate UAPI
2023-06-26 14:20:38 +08:00
gaotingquan
8405882f11
debug
2023-05-29 19:52:09 +08:00
gaotingquan
2d8346cd3b
fix _init_amp when export
2023-05-29 19:52:09 +08:00
gaotingquan
14d06fb6bd
support AMP.use_amp arg
2023-05-25 16:16:02 +08:00
gaotingquan
8b218b01ac
refactor amp auto_cast context manager & loss scaler
2023-05-25 11:58:05 +08:00
gaotingquan
f884f28853
refactor amp
2023-05-25 11:58:05 +08:00
gaotingquan
b3678234fe
fix bug when update_freq > iter_per_epoch
2023-05-17 15:19:13 +08:00
gaotingquan
9f621279b8
fix infer output
2023-04-17 20:28:40 +08:00
parap1uie-s
52f16cc85d
Update engine.py
2023-04-11 19:23:57 +08:00
parap1uie-s
6e6586f59b
Fixed the incorrect infer outputs
2023-04-11 19:23:57 +08:00
Yang Nie
5f2eaa7cb1
bugfix: set_epoch after reume
2023-04-06 15:33:30 +08:00
gaotingquan
f37cb543b1
rm op black list in amp
...
the op flatten_contiguous_range and greater_than has supported amp mode since paddle 2.4
2023-03-29 14:57:02 +08:00
gaotingquan
a7ba6eabd2
optimizer must be decorated when training with AMPO2
2023-03-28 18:42:26 +08:00
Tingquan Gao
5d06a88a36
Revert "refactor: simplify engine"
...
This reverts commit 376d83d46e
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
6aabb94d8c
Revert "refactor: add ClassModel to unify model forward interface"
...
This reverts commit 75a20ba557
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
e7e4f68b5c
Revert "refactor: build_train_func & build_eval_func"
...
This reverts commit 6bed0f5707
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
f2fc43baeb
Revert "refactor: mv all dataloaders to engine.dataloader_dict"
...
This reverts commit 284e2a6756
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
a1e840e0da
Revert "refactor: iter_per_epoch -> max_iter"
...
This reverts commit a38e42f644
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
0efda2c75e
Revert "refactor: simpfy engine.train()"
...
This reverts commit fad5c8e348
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
5a4ee1aec4
Revert "refactor"
...
This reverts commit 0e28a39da3
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
f42719afbb
Revert "replace the arg engine with config"
...
This reverts commit f525cea006
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
7243f1429b
Revert "rm codes for compatibility with old version"
...
This reverts commit 6e77bd6cd5
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
56e8c5a992
Revert "mv model_saver to __init__()"
...
This reverts commit 0d7e595fc7
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
f1a7a22a34
Revert "mv some attrs to __init__()"
...
This reverts commit 73e2cde617
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
915dde176a
Revert "refactor: rm train and eval from engine"
...
This reverts commit 5a6fe171a7
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
aa52682c55
Revert "rm amp code from train and eval & use decorator for amp training"
...
This reverts commit d3941dc1e9
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
85e200edb6
Revert "refactor"
...
This reverts commit 32593b6375
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
8002ccf4b6
Revert "support ShiTu"
...
This reverts commit 9beb154bc3
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
b47fa5f50e
Revert "debug"
...
This reverts commit 58daf805a9
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
578054dddd
Revert "debug for infer"
...
This reverts commit 428edb6ff8
.
2023-03-14 16:47:13 +08:00
Tingquan Gao
0055ca2ffe
Revert "debug"
...
This reverts commit 9e683d0d69
.
2023-03-14 16:47:13 +08:00
gaotingquan
9e683d0d69
debug
2023-03-10 16:56:55 +08:00
gaotingquan
428edb6ff8
debug for infer
2023-03-10 16:56:55 +08:00
gaotingquan
58daf805a9
debug
2023-03-10 16:56:55 +08:00
gaotingquan
9beb154bc3
support ShiTu
2023-03-10 16:56:55 +08:00
gaotingquan
32593b6375
refactor
2023-03-10 16:56:55 +08:00
gaotingquan
d3941dc1e9
rm amp code from train and eval & use decorator for amp training
2023-03-10 16:56:55 +08:00
gaotingquan
5a6fe171a7
refactor: rm train and eval from engine
2023-03-10 16:56:55 +08:00
gaotingquan
73e2cde617
mv some attrs to __init__()
2023-03-10 16:56:55 +08:00
gaotingquan
0d7e595fc7
mv model_saver to __init__()
2023-03-10 16:56:55 +08:00
gaotingquan
6e77bd6cd5
rm codes for compatibility with old version
2023-03-10 16:56:55 +08:00
gaotingquan
f525cea006
replace the arg engine with config
2023-03-10 16:56:55 +08:00
gaotingquan
0e28a39da3
refactor
2023-03-10 16:56:55 +08:00
gaotingquan
fad5c8e348
refactor: simpfy engine.train()
...
1. ModelSaver();
2. _build_ema_model();
3. _init_checkpoints();
4. others.
2023-03-10 16:56:55 +08:00