Commit Graph

22 Commits (ada310811a8c8769559827ed2ceee9710b5d7423)

Author SHA1 Message Date
johnlockejrr ada310811a
Add Syriac script support (#13800)
* Add Syriac Language support dictionary

The Syriac Script is a Unicode block containing characters for all forms of the Syriac alphabet, including the Estrangela, Serto, Eastern Syriac, and the Christian Palestinian Aramaic variants. It is used in Literary Syriac, Neo-Aramaic, and Arabic among Syriac-speaking Christians. It was used historically to write Armenian, Persian, Ottoman Turkish, and Malayalam. The script, like Arabic and Hebrew is RTL.

https://en.wikipedia.org/wiki/Syriac_(Unicode_block)
https://en.wikipedia.org/wiki/Syriac_language

* Add Syriac script support for training

The Syriac Script is a Unicode block containing characters for all forms of the Syriac alphabet, including the Estrangela, Serto, Eastern Syriac, and the Christian Palestinian Aramaic variants. It is used in Literary Syriac, Neo-Aramaic, and Arabic among Syriac-speaking Christians. It was used historically to write Armenian, Persian, Ottoman Turkish, and Malayalam. The script, like Arabic and Hebrew is RTL.

https://en.wikipedia.org/wiki/Syriac_(Unicode_block)
https://en.wikipedia.org/wiki/Syriac_language
2024-09-01 20:10:42 +08:00
johnlockejrr 6225a90ef0
Add support for Hebrew Language and Alphabet (#13797)
* Add Hebrew language support for training

https://en.wikipedia.org/wiki/Unicode_and_HTML_for_the_Hebrew_alphabet

* Add Hebrew language dictionary

https://en.wikipedia.org/wiki/Unicode_and_HTML_for_the_Hebrew_alphabet

* Add Samaritan Script dictionary

Samaritan Script is RTL like Arabic and Hebrew, used for Samaritan Hebrew and Aramaic, sometimes has Arabic letters in some texts.

https://en.wikipedia.org/wiki/Samaritan_(Unicode_block)
https://en.wikipedia.org/wiki/Samaritan_Hebrew
https://en.wikipedia.org/wiki/Samaritan_Aramaic_language

* Add Samaritan Script training

Samaritan Script is RTL like Arabic and Hebrew, used for Samaritan Hebrew and Aramaic, sometimes has Arabic letters in some texts.

https://en.wikipedia.org/wiki/Samaritan_(Unicode_block)
https://en.wikipedia.org/wiki/Samaritan_Hebrew
https://en.wikipedia.org/wiki/Samaritan_Aramaic_language

* Update hebrew_dict.txt
2024-09-01 09:18:37 +08:00
jzhang533 24f06d1a1b
update common pre-commit configs and commit the results of running pre-commit run -a (#12516) 2024-05-29 15:26:09 +08:00
Wang Xin 045e5f6ac7
add pre-commit workflow (#11973)
* add pre-commit workflow

* run 'pre-commit run --all-files'

* setup python version
2024-04-21 21:46:20 +08:00
tink2123 380dc6c27d rm rec_char_type 2021-10-12 14:29:00 +08:00
tink2123 a0e6e83397 fix some typo 2021-04-14 16:00:45 +08:00
tink2123 94474e40fc polish mult-lang doc and whl 2021-04-13 19:30:08 +08:00
xmy0916 d31ba7cc1b fix type error 2021-03-03 11:09:20 +08:00
tink2123 06a434bcf9 fix yml 2021-01-26 15:57:37 +08:00
tink2123 edeb12b1e0 rename en_sensitive EN_symbol 2021-01-26 15:53:49 +08:00
tink2123 8f52a73718 polish code 2021-01-26 15:24:13 +08:00
xmy0916 cafea5dcbe fix char dict 2021-01-20 20:06:07 +08:00
xmy0916 73edec1620 add copyright 2021-01-20 19:06:39 +08:00
xmy0916 fe694ae902 fix bugs 2021-01-20 15:16:37 +08:00
xmy0916 a301e05e1d fix bugs 2021-01-20 13:07:35 +08:00
xmy0916 d3c50fda3c fix bugs 2021-01-20 12:08:57 +08:00
xmy0916 46ac85ad8a fix some problems 2021-01-19 23:46:35 +08:00
xmy0916 141d50d670 add multi language config file imgs and dict 2021-01-19 15:52:04 +08:00
tink2123 7eeef5933c update multi dic and export 2020-12-09 11:56:37 +00:00
tink2123 311569b2bc update for multi-language 2020-12-08 19:09:03 +08:00
tink2123 dd0f8c1d89 update for multi-language 2020-12-08 19:07:39 +08:00
xiaoting 8a5566c974
add multi lang yml and dict (#1312)
* add multi lang yml and dict

* update yml
2020-12-08 13:32:17 +08:00