mmocr/configs/_base_/recog_datasets/MJ_train.py

# Text Recognition Training set, including:
# Synthetic Datasets: Syn90k

train_root = 'data/mixture/Syn90k'

train_img_prefix = f'{train_root}/mnt/ramdisk/max/90kDICT32px'
train_ann_file = f'{train_root}/label.lmdb'

train = dict(
    type='OCRDataset',
    img_prefix=train_img_prefix,
    ann_file=train_ann_file,
    loader=dict(
        type='AnnFileLoader',
        repeat=1,
        file_format='lmdb',
        parser=dict(
            type='LineStrParser',
            keys=['filename', 'text'],
            keys_idx=[0, 1],
            separator=' ')),
    pipeline=None,
    test_mode=False)

train_list = [train]
[Refactor] refactor textrecog config structure (#617) * refactor configs of textrecog * remove duplicate key in config _base_ * fix typo * rename dataset config file 2021-11-25 16:27:45 +08:00			`# Text Recognition Training set, including:`
			`# Synthetic Datasets: Syn90k`

			`train_root = 'data/mixture/Syn90k'`

			`train_img_prefix = f'{train_root}/mnt/ramdisk/max/90kDICT32px'`
			`train_ann_file = f'{train_root}/label.lmdb'`

			`train = dict(`
			`type='OCRDataset',`
			`img_prefix=train_img_prefix,`
			`ann_file=train_ann_file,`
			`loader=dict(`
[Enhancement] Update Dataset Configs (#980) * update runner in configs * update AnnFileLoader 2022-04-27 12:53:57 +08:00			`type='AnnFileLoader',`
[Refactor] refactor textrecog config structure (#617) * refactor configs of textrecog * remove duplicate key in config _base_ * fix typo * rename dataset config file 2021-11-25 16:27:45 +08:00			`repeat=1,`
[Enhancement] Update Dataset Configs (#980) * update runner in configs * update AnnFileLoader 2022-04-27 12:53:57 +08:00			`file_format='lmdb',`
[Refactor] refactor textrecog config structure (#617) * refactor configs of textrecog * remove duplicate key in config _base_ * fix typo * rename dataset config file 2021-11-25 16:27:45 +08:00			`parser=dict(`
			`type='LineStrParser',`
			`keys=['filename', 'text'],`
			`keys_idx=[0, 1],`
			`separator=' ')),`
			`pipeline=None,`
			`test_mode=False)`

			`train_list = [train]`