* update the HRNet-OCR & add ResNet-101-OCR * revise the script * add the results of resnet-101+ocr * add cascade ocr, aspp ocr * add comparison table * move comparison table * support ocr+decoder * revise the ocrnet_sep_aspp * update the results of ocrnet * update the results of ocrnet * add sep-ocr-variants * add bs2x exp of deeplabv3/v3+ * apply sep-conv in ocr module * update the results * update the results * update the results of OCRNet * update the results of OCRNet * correct the results * verify the release branch * init the release branch * add more results of ocrnet and ocrnetplus * resolve the conflicts * rename OCRNetPlus as OCRNet+ * fix the format * fix the lint issues * fix the lint issues * fix the lint issues * fix the lint isort issues * fix the lint yapf issues * fix the format issues * remove the changes by the master branch * remove the changes by the master branch * remove the changes by the master branch * remove the changes by the master branch * remove the changes by the master branch * add the logs folder to .gitignore * recover .gitignore * update readme * update readme * reset the cudnn_benchmark * revise the README of OCRNet * revise the name * revise the reference of OCRNet * revise the Figure of OCRNet+ * update the results of OCR/OCR+ * update the results of OCR/OCR+ * update the results of OCR/OCR+ * fix the format issue * fix the format issue * remove the ocr+ * update the results * update the results * fix the conflicts * fix the lint issue * fix the lint issue * fix the lint issue * fix the lint issue * fix the inconsistency * add urls to README * clean the code * remove the schedule configs * clean the custom code * clean up * remove ocr.png Co-authored-by: Jiarui XU <xvjiarui0826@gmail.com> |
||
---|---|---|
.. | ||
README.md | ||
ocrnet_hr18_512x512_20k_voc12aug.py | ||
ocrnet_hr18_512x512_40k_voc12aug.py | ||
ocrnet_hr18_512x512_80k_ade20k.py | ||
ocrnet_hr18_512x512_160k_ade20k.py | ||
ocrnet_hr18_512x1024_40k_cityscapes.py | ||
ocrnet_hr18_512x1024_80k_cityscapes.py | ||
ocrnet_hr18_512x1024_160k_cityscapes.py | ||
ocrnet_hr18s_512x512_20k_voc12aug.py | ||
ocrnet_hr18s_512x512_40k_voc12aug.py | ||
ocrnet_hr18s_512x512_80k_ade20k.py | ||
ocrnet_hr18s_512x512_160k_ade20k.py | ||
ocrnet_hr18s_512x1024_40k_cityscapes.py | ||
ocrnet_hr18s_512x1024_80k_cityscapes.py | ||
ocrnet_hr18s_512x1024_160k_cityscapes.py | ||
ocrnet_hr48_512x512_20k_voc12aug.py | ||
ocrnet_hr48_512x512_40k_voc12aug.py | ||
ocrnet_hr48_512x512_80k_ade20k.py | ||
ocrnet_hr48_512x512_160k_ade20k.py | ||
ocrnet_hr48_512x1024_40k_cityscapes.py | ||
ocrnet_hr48_512x1024_80k_cityscapes.py | ||
ocrnet_hr48_512x1024_160k_cityscapes.py | ||
ocrnet_r101-d8_512x1024_40k_b8_cityscapes.py | ||
ocrnet_r101-d8_512x1024_40k_b16_cityscapes.py | ||
ocrnet_r101-d8_512x1024_80k_b16_cityscapes.py |
README.md
Object-Contextual Representations for Semantic Segmentation
Introduction
@article{YuanW18,
title={Ocnet: Object context network for scene parsing},
author={Yuhui Yuan and Jingdong Wang},
booktitle={arXiv preprint arXiv:1809.00916},
year={2018}
}
@article{YuanCW20,
title={Object-Contextual Representations for Semantic Segmentation},
author={Yuhui Yuan and Xilin Chen and Jingdong Wang},
booktitle={ECCV},
year={2020}
}
Results and models
Cityscapes
HRNet backbone
Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | mIoU | mIoU(ms+flip) | download |
---|---|---|---|---|---|---|---|---|
OCRNet | HRNetV2p-W18-Small | 512x1024 | 40000 | 3.5 | 10.45 | 74.30 | 75.95 | model | log |
OCRNet | HRNetV2p-W18 | 512x1024 | 40000 | 4.7 | 7.50 | 77.72 | 79.49 | model | log |
OCRNet | HRNetV2p-W48 | 512x1024 | 40000 | 8 | 4.22 | 80.58 | 81.79 | model | log |
OCRNet | HRNetV2p-W18-Small | 512x1024 | 80000 | - | - | 77.16 | 78.66 | model | log |
OCRNet | HRNetV2p-W18 | 512x1024 | 80000 | - | - | 78.57 | 80.46 | model | log |
OCRNet | HRNetV2p-W48 | 512x1024 | 80000 | - | - | 80.70 | 81.87 | model | log |
OCRNet | HRNetV2p-W18-Small | 512x1024 | 160000 | - | - | 78.45 | 79.97 | model | log |
OCRNet | HRNetV2p-W18 | 512x1024 | 160000 | - | - | 79.47 | 80.91 | model | log |
OCRNet | HRNetV2p-W48 | 512x1024 | 160000 | - | - | 81.35 | 82.70 | model | log |
ResNet backbone
Method | Backbone | Crop Size | Batch Size | Lr schd | Mem (GB) | Inf time (fps) | mIoU | mIoU(ms+flip) | download |
---|---|---|---|---|---|---|---|---|---|
OCRNet | R-101-D8 | 512x1024 | 8 | 40000 | - | - | 80.09 | - | model | log |
OCRNet | R-101-D8 | 512x1024 | 16 | 40000 | 8.8 | 3.02 | 80.30 | - | model | log |
OCRNet | R-101-D8 | 512x1024 | 16 | 80000 | 8.8 | 3.02 | 80.81 | - | model | log |
ADE20K
Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | mIoU | mIoU(ms+flip) | download |
---|---|---|---|---|---|---|---|---|
OCRNet | HRNetV2p-W18-Small | 512x512 | 80000 | 6.7 | 28.98 | 35.06 | 35.80 | model | log |
OCRNet | HRNetV2p-W18 | 512x512 | 80000 | 7.9 | 18.93 | 37.79 | 39.16 | model | log |
OCRNet | HRNetV2p-W48 | 512x512 | 80000 | 11.2 | 16.99 | 43.00 | 44.30 | model | log |
OCRNet | HRNetV2p-W18-Small | 512x512 | 160000 | - | - | 37.19 | 38.40 | model | log |
OCRNet | HRNetV2p-W18 | 512x512 | 160000 | - | - | 39.32 | 40.80 | model | log |
OCRNet | HRNetV2p-W48 | 512x512 | 160000 | - | - | 43.25 | 44.88 | model | log |
Pascal VOC 2012 + Aug
Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | mIoU | mIoU(ms+flip) | download |
---|---|---|---|---|---|---|---|---|
OCRNet | HRNetV2p-W18-Small | 512x512 | 20000 | 3.5 | 31.55 | 71.70 | 73.84 | model | log |
OCRNet | HRNetV2p-W18 | 512x512 | 20000 | 4.7 | 19.91 | 74.75 | 77.11 | model | log |
OCRNet | HRNetV2p-W48 | 512x512 | 20000 | 8.1 | 17.83 | 77.72 | 79.87 | model | log |
OCRNet | HRNetV2p-W18-Small | 512x512 | 40000 | - | - | 72.76 | 74.60 | model | log |
OCRNet | HRNetV2p-W18 | 512x512 | 40000 | - | - | 74.98 | 77.40 | model | log |
OCRNet | HRNetV2p-W48 | 512x512 | 40000 | - | - | 77.14 | 79.71 | model | log |