gaotingquan 73f4d8e4ce to avoid cause issues for unset no_weight_decay models.
there seems be a diff for optimizer about using [] and [{"params":}, {"params":}] params
2023-04-12 20:55:38 +08:00
..
2023-03-14 16:47:13 +08:00
2023-02-28 15:01:21 +08:00