History

Yixiao Fang 08dc8c75d3 [Refactor] Add selfsup algorithms. (#1389 ) * remove basehead * add moco series * add byol simclr simsiam * add ut * update configs * add simsiam hook * add and refactor beit * update ut * add cae * update extract_feat * refactor cae * add mae * refactor data preprocessor * update heads * add maskfeat * add milan * add simmim * add mixmim * fix lint * fix ut * fix lint * add eva * add densecl * add barlowtwins * add swav * fix lint * update readtherdocs rst * update docs * update * Decrease UT memory usage * Fix docstring * update DALLEEncoder * Update model docs * refactor dalle encoder * update docstring * fix ut * fix config error * add val_cfg and test_cfg * refactor clip generator * fix lint * pass check * fix ut * add lars * update type of BEiT in configs * Use MMEngine style momentum in EMA. * apply mmpretrain solarize --------- Co-authored-by: mzr1996 <mzr1996@163.com>		2023-03-06 16:53:15 +08:00
..
benchmarks	[Refactor] Move transforms in mmselfsup to mmpretrain. (#1396 )	2023-03-03 15:01:11 +08:00
README.md	[Docs] Update generate_readme.py and readme files. (#1388 )	2023-03-02 13:29:07 +08:00
densecl_resnet50_8xb32-coslr-200e_in1k.py	[Refactor] Add selfsup algorithms. (#1389 )	2023-03-06 16:53:15 +08:00
metafile.yml	[Refactor] Refactor configs and metafile (#1369 )	2023-02-23 11:17:16 +08:00

README.md

DenseCL

Dense contrastive learning for self-supervised visual pre-training

Abstract

To date, most existing self-supervised learning methods are designed and optimized for image classification. These pre-trained models can be sub-optimal for dense prediction tasks due to the discrepancy between image-level prediction and pixel-level prediction. To fill this gap, we aim to design an effective, dense self-supervised learning method that directly works at the level of pixels (or local features) by taking into account the correspondence between local features. We present dense contrastive learning (DenseCL), which implements self-supervised learning by optimizing a pairwise contrastive (dis)similarity loss at the pixel level between two views of input images.

How to use it?

Predict image

from mmpretrain import inference_model

predict = inference_model('resnet50_densecl-pre_8xb32-linear-steplr-100e_in1k', 'demo/bird.JPEG')
print(predict['pred_class'])
print(predict['pred_score'])

Use the model

import torch
from mmpretrain import get_model

model = get_model('densecl_resnet50_8xb32-coslr-200e_in1k', pretrained=True)
inputs = torch.rand(1, 3, 224, 224)
out = model(inputs)
print(type(out))
# To extract features.
feats = model.extract_feat(inputs)
print(type(feats))

Train/Test Command

Prepare your dataset according to the docs.

Train:

python tools/train.py configs/densecl/densecl_resnet50_8xb32-coslr-200e_in1k.py

Test:

python tools/test.py configs/densecl/benchmarks/resnet50_8xb32-linear-steplr-100e_in1k.py https://download.openmmlab.com/mmselfsup/1.x/densecl/densecl_resnet50_8xb32-coslr-200e_in1k/resnet50_linear-8xb32-steplr-100e_in1k/resnet50_linear-8xb32-steplr-100e_in1k_20220825-f0f0a579.pth

Models and results

Pretrained models

Model	Params (M)	Flops (G)	Config	Download
`densecl_resnet50_8xb32-coslr-200e_in1k`	N/A	N/A	config	model \| log

Image Classification on ImageNet-1k

Model	Pretrain	Params (M)	Flops (G)	Top-1 (%)	Config	Download
`resnet50_densecl-pre_8xb32-linear-steplr-100e_in1k`	DENSECL	N/A	N/A	63.50	config	model \| log

Citation

@inproceedings{wang2021dense,
  title={Dense contrastive learning for self-supervised visual pre-training},
  author={Wang, Xinlong and Zhang, Rufeng and Shen, Chunhua and Kong, Tao and Li, Lei},
  booktitle={CVPR},
  year={2021}
}