* [Improve] Use PyTorch official `scaled_dot_product_attention` to accelerate `MultiheadAttention`. * Support `--local-rank` and `--amp` option for new version PyTorch. * Fix imports and UT. |
||
---|---|---|
.. | ||
docker | ||
config.yml | ||
test.yml |
* [Improve] Use PyTorch official `scaled_dot_product_attention` to accelerate `MultiheadAttention`. * Support `--local-rank` and `--amp` option for new version PyTorch. * Fix imports and UT. |
||
---|---|---|
.. | ||
docker | ||
config.yml | ||
test.yml |