* Add Syriac Language support dictionary The Syriac Script is a Unicode block containing characters for all forms of the Syriac alphabet, including the Estrangela, Serto, Eastern Syriac, and the Christian Palestinian Aramaic variants. It is used in Literary Syriac, Neo-Aramaic, and Arabic among Syriac-speaking Christians. It was used historically to write Armenian, Persian, Ottoman Turkish, and Malayalam. The script, like Arabic and Hebrew is RTL. https://en.wikipedia.org/wiki/Syriac_(Unicode_block) https://en.wikipedia.org/wiki/Syriac_language * Add Syriac script support for training The Syriac Script is a Unicode block containing characters for all forms of the Syriac alphabet, including the Estrangela, Serto, Eastern Syriac, and the Christian Palestinian Aramaic variants. It is used in Literary Syriac, Neo-Aramaic, and Arabic among Syriac-speaking Christians. It was used historically to write Armenian, Persian, Ottoman Turkish, and Malayalam. The script, like Arabic and Hebrew is RTL. https://en.wikipedia.org/wiki/Syriac_(Unicode_block) https://en.wikipedia.org/wiki/Syriac_language |
||
---|---|---|
.. | ||
kie_dict | ||
layout_dict | ||
README.md | ||
ar_dict.txt | ||
arabic_dict.txt | ||
be_dict.txt | ||
bengali_dict.txt | ||
bg_dict.txt | ||
bm_dict.txt | ||
bm_dict_add.txt | ||
bn_dict.txt | ||
chinese_cht_dict.txt | ||
confuse.pkl | ||
cyrillic_dict.txt | ||
devanagari_dict.txt | ||
en_dict.txt | ||
fa_dict.txt | ||
french_dict.txt | ||
german_dict.txt | ||
gujarati_dict.txt | ||
hebrew_dict.txt | ||
hi_dict.txt | ||
it_dict.txt | ||
japan_dict.txt | ||
ka_dict.txt | ||
kazakh_dict.txt | ||
korean_dict.txt | ||
latex_ocr_tokenizer.json | ||
latex_symbol_dict.txt | ||
latin_dict.txt | ||
mr_dict.txt | ||
ne_dict.txt | ||
oc_dict.txt | ||
parseq_dict.txt | ||
pu_dict.txt | ||
rs_dict.txt | ||
rsc_dict.txt | ||
ru_dict.txt | ||
samaritan_dict.txt | ||
spin_dict.txt | ||
syriac_dict.txt | ||
ta_dict.txt | ||
table_dict.txt | ||
table_master_structure_dict.txt | ||
table_structure_dict.txt | ||
table_structure_dict_ch.txt | ||
te_dict.txt | ||
ug_dict.txt | ||
uk_dict.txt | ||
ur_dict.txt | ||
vi_dict.txt | ||
xi_dict.txt |
README.md
Dictionary and Corpus
Dictionary files (usually character level vocabulary) are included here for easier configuration. Corpus contributed by OSS contirbutors are listed here, please respect copyrights when using them at your own risk.
- Burmese corpus: https://github.com/1chimaruGin/BurmeseCorpus