diff --git a/doc/doc_en/knowledge_distillation_en.md b/doc/doc_en/knowledge_distillation_en.md index 52725e5c05..4e1f4e78cb 100755 --- a/doc/doc_en/knowledge_distillation_en.md +++ b/doc/doc_en/knowledge_distillation_en.md @@ -228,7 +228,6 @@ Architecture: enc_dim: 512 max_text_length: *max_text_length ``` -``` When the model is finally trained, it contains 3 sub-networks: `Teacher`, `Student`, `Student2`.