Object-Contextual Representations for Semantic Segmentation
Introduction
@article{yuan2019ocr,
title={Object-Contextual Representations for Semantic Segmentation},
author={Yuan Yuhui and Chen Xilin and Wang Jingdong},
journal={arXiv preprint arXiv:1909.11065},
year={2019}
}
Results and models
Cityscapes
Method |
Backbone |
Crop Size |
Lr schd |
Mem (GB) |
Inf time (fps) |
mIoU |
mIoU(ms+flip) |
download |
OCRNet |
HRNetV2p-W18-Small |
512x1024 |
40000 |
3.5 |
10.45 |
74.30 |
75.95 |
model | log |
OCRNet |
HRNetV2p-W18 |
512x1024 |
40000 |
4.7 |
7.50 |
77.72 |
79.49 |
model | log |
OCRNet |
HRNetV2p-W48 |
512x1024 |
40000 |
8 |
4.22 |
80.58 |
81.79 |
model | log |
OCRNet |
HRNetV2p-W18-Small |
512x1024 |
80000 |
- |
- |
77.16 |
78.66 |
model | log |
OCRNet |
HRNetV2p-W18 |
512x1024 |
80000 |
- |
- |
78.57 |
80.46 |
model | log |
OCRNet |
HRNetV2p-W48 |
512x1024 |
80000 |
- |
- |
80.70 |
81.87 |
model | log |
OCRNet |
HRNetV2p-W18-Small |
512x1024 |
160000 |
- |
- |
78.45 |
79.97 |
model | log |
OCRNet |
HRNetV2p-W18 |
512x1024 |
160000 |
- |
- |
79.47 |
80.91 |
model | log |
OCRNet |
HRNetV2p-W48 |
512x1024 |
160000 |
- |
- |
81.35 |
82.70 |
model | log |
ADE20K
Method |
Backbone |
Crop Size |
Lr schd |
Mem (GB) |
Inf time (fps) |
mIoU |
mIoU(ms+flip) |
download |
OCRNet |
HRNetV2p-W18-Small |
512x512 |
80000 |
6.7 |
28.98 |
35.06 |
35.80 |
model | log |
OCRNet |
HRNetV2p-W18 |
512x512 |
80000 |
7.9 |
18.93 |
37.79 |
39.16 |
model | log |
OCRNet |
HRNetV2p-W48 |
512x512 |
80000 |
11.2 |
16.99 |
43.00 |
44.30 |
model | log |
OCRNet |
HRNetV2p-W18-Small |
512x512 |
160000 |
- |
- |
37.19 |
38.40 |
model | log |
OCRNet |
HRNetV2p-W18 |
512x512 |
160000 |
- |
- |
39.32 |
40.80 |
model | log |
OCRNet |
HRNetV2p-W48 |
512x512 |
160000 |
- |
- |
43.25 |
44.88 |
model | log |
Pascal VOC 2012 + Aug
Method |
Backbone |
Crop Size |
Lr schd |
Mem (GB) |
Inf time (fps) |
mIoU |
mIoU(ms+flip) |
download |
OCRNet |
HRNetV2p-W18-Small |
512x512 |
20000 |
3.5 |
31.55 |
71.70 |
73.84 |
model | log |
OCRNet |
HRNetV2p-W18 |
512x512 |
20000 |
4.7 |
19.91 |
74.75 |
77.11 |
model | log |
OCRNet |
HRNetV2p-W48 |
512x512 |
20000 |
8.1 |
17.83 |
77.72 |
79.87 |
model | log |
OCRNet |
HRNetV2p-W18-Small |
512x512 |
40000 |
- |
- |
72.76 |
74.60 |
model | log |
OCRNet |
HRNetV2p-W18 |
512x512 |
40000 |
- |
- |
74.98 |
77.40 |
model | log |
OCRNet |
HRNetV2p-W48 |
512x512 |
40000 |
- |
- |
77.14 |
79.71 |
model | log |