* add weighted soft labels loss function add weighted soft labels loss function * fix typo in docs/zh_CN/advanced_tutorials/knowledge_distillation.md