21 Commits

Author SHA1 Message Date
Hongbin Sun
4882c8a317
dataset preparation docs (#255) 2021-06-01 21:59:40 +08:00
Hongbin Sun
11768d02d4
update dataset doc (#204) 2021-05-18 11:43:29 +08:00
hq_wei
24c590bb04
Ner task (#148)
* update ner standard code format

* add pytest

* fix pre-commit

* Annotate the dataset section

* fix pre-commit for dataset

* rm big files and add comments in dataset

* rename configs for ner task

* minor changes if metric

* Note modification

* fix pre-commit

* detail modification

* rm transform

* rm magic number

* fix warnings in pylint

* fix pre-commit

* correct help info

* rename model files

* rename err fixed

* 428_tag

* Adjust to more general pipline

* update unit test rate

* update

* Unit test coverage over 90% and add Readme

* modify details

* fix precommit

* update

* fix pre-commit

* update

* update

* update

* update result

* update readme

* update baseline config

* update config and small minor changes

* minor changes in readme and etc.

* back to original

* update toy config

* upload model and log

* fix pytest

* Modify the notes.

* fix readme

* Delete Chinese punctuation

* add demo and fix some logic and naming problems

* add To_tensor transformer for ner and load pretrained model in config

* delete extra lines

* split ner loss to MaskedCrossEntropyLoss and MaskedFocalLoss

* update config

* fix err

* updata

* modify noqa

* update new model report

* fix err in ner demo

* Update ner_dataset.py

* Update test_ner_dataset.py

* Update ner_dataset.py

* Update ner_transforms.py

* rm toy config and data

* add comment

* add empty

* fix conflict

* fix precommit

* fix pytest

* fix pytest err

* Update ner_dataset.py

* change dataset name to cluener2020

* move the postprocess in metric to convertor

* rm __init__ etc.

* precommit

* add discription in loss

* add auto download

* add http

* update

* remove some 'issert'

* replace unsqueeze

* update config

* update doc and bert.py

* update

* update demo code

Co-authored-by: weihuaqiang <weihuaqiang@sensetime.com>
Co-authored-by: Hongbin Sun <hongbin306@gmail.com>
2021-05-18 11:33:51 +08:00
Hongbin Sun
360082635b
fix #93: update ic17 download url (#94) 2021-04-19 13:29:36 +08:00
Hongbin Sun
1a129a1e98
add svt converter (#65)
* add svt converter

* fix str fmt

* fix str fmt

* update convert script
2021-04-14 18:33:14 +08:00
Hongbin Sun
344cc9a38d
Add unittest for kie (#56)
* add unittest for sdmgr

* fix #36: update datasets.md

* unittest pass
2021-04-13 13:58:31 +08:00
Hongbin Sun
1176b485fa fix name mistake 2021-04-10 17:10:50 +08:00
lizz
87ccebcd91
Update datasets.md 2021-04-10 00:28:12 +08:00
lizz
8097113e17
Update datasets.md 2021-04-10 00:27:17 +08:00
Hongbin Sun
bb51a41bcd rename to datasets 2021-04-09 23:50:33 +08:00
Hongbin Sun
b3aaa27c6f rename datasets.md 2021-04-09 21:08:19 +08:00
Hongbin Sun
cb6a0d5c99 update docs 2021-04-08 18:05:46 +08:00
lizz
0206781bb6
Fix readthedocs generation (#14)
* Fix readthedocs

* Fix doc

* md format
2021-04-08 15:49:46 +08:00
Hongbin Sun
47f5906f0a
remove markdown toc (#3)
* remove toc from md

* remove toc
2021-04-08 01:12:01 +08:00
Hongbin Sun
0f00378f9a
Feature/iss 33 (#34)
* fix #33: update dataset.md

* fix #33: pytest for transformer related
2021-04-05 23:54:57 +08:00
lizz
9d62bdf84c
Format readme (#23)
* Format readme

Signed-off-by: lizz <lizz@sensetime.com>

* try

Signed-off-by: lizz <lizz@sensetime.com>

* Remove redudant config link

Signed-off-by: lizz <lizz@sensetime.com>
2021-04-05 16:06:06 +08:00
Hongbin Sun
a347a97c23 fix #13: update docs with toc 2021-04-04 11:56:14 +08:00
Hongbin Sun
dd120271ba
fix #11: update docs (#12)
* fix #11: update docs

* fix #11: update datasets.md for kie
2021-04-04 00:19:55 +08:00
quincylin1
8476c239b2 [feature]: dbnet and docs 2021-04-03 01:21:33 +08:00
yuexy
72135ed552
Revert "documentation and dbnet related code" 2021-04-03 01:00:26 +08:00
quincylin1
b031934129 documentation and dbnet related code 2021-04-03 00:41:23 +08:00