mirror of
https://github.com/huggingface/pytorch-image-models.git
synced 2025-06-03 15:01:08 +08:00
261 lines
7.4 KiB
Markdown
261 lines
7.4 KiB
Markdown
# Res2Net
|
||
|
||
**Res2Net** is an image model that employs a variation on bottleneck residual blocks, [Res2Net Blocks](https://paperswithcode.com/method/res2net-block). The motivation is to be able to represent features at multiple scales. This is achieved through a novel building block for CNNs that constructs hierarchical residual-like connections within one single residual block. This represents multi-scale features at a granular level and increases the range of receptive fields for each network layer.
|
||
|
||
{% include 'code_snippets.md' %}
|
||
|
||
## How do I train this model?
|
||
|
||
You can follow the [timm recipe scripts](https://rwightman.github.io/pytorch-image-models/scripts/) for training a new model afresh.
|
||
|
||
## Citation
|
||
|
||
```BibTeX
|
||
@article{Gao_2021,
|
||
title={Res2Net: A New Multi-Scale Backbone Architecture},
|
||
volume={43},
|
||
ISSN={1939-3539},
|
||
url={http://dx.doi.org/10.1109/TPAMI.2019.2938758},
|
||
DOI={10.1109/tpami.2019.2938758},
|
||
number={2},
|
||
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
|
||
publisher={Institute of Electrical and Electronics Engineers (IEEE)},
|
||
author={Gao, Shang-Hua and Cheng, Ming-Ming and Zhao, Kai and Zhang, Xin-Yu and Yang, Ming-Hsuan and Torr, Philip},
|
||
year={2021},
|
||
month={Feb},
|
||
pages={652–662}
|
||
}
|
||
```
|
||
|
||
<!--
|
||
Type: model-index
|
||
Collections:
|
||
- Name: Res2Net
|
||
Paper:
|
||
Title: 'Res2Net: A New Multi-scale Backbone Architecture'
|
||
URL: https://paperswithcode.com/paper/res2net-a-new-multi-scale-backbone
|
||
Models:
|
||
- Name: res2net101_26w_4s
|
||
In Collection: Res2Net
|
||
Metadata:
|
||
FLOPs: 10415881200
|
||
Parameters: 45210000
|
||
File Size: 181456059
|
||
Architecture:
|
||
- Batch Normalization
|
||
- Convolution
|
||
- Global Average Pooling
|
||
- ReLU
|
||
- Res2Net Block
|
||
Tasks:
|
||
- Image Classification
|
||
Training Techniques:
|
||
- SGD with Momentum
|
||
- Weight Decay
|
||
Training Data:
|
||
- ImageNet
|
||
Training Resources: 4x Titan Xp GPUs
|
||
ID: res2net101_26w_4s
|
||
LR: 0.1
|
||
Epochs: 100
|
||
Crop Pct: '0.875'
|
||
Momentum: 0.9
|
||
Batch Size: 256
|
||
Image Size: '224'
|
||
Weight Decay: 0.0001
|
||
Interpolation: bilinear
|
||
Code: https://github.com/rwightman/pytorch-image-models/blob/d8e69206be253892b2956341fea09fdebfaae4e3/timm/models/res2net.py#L152
|
||
Weights: https://github.com/rwightman/pytorch-image-models/releases/download/v0.1-res2net/res2net101_26w_4s-02a759a1.pth
|
||
Results:
|
||
- Task: Image Classification
|
||
Dataset: ImageNet
|
||
Metrics:
|
||
Top 1 Accuracy: 79.19%
|
||
Top 5 Accuracy: 94.43%
|
||
- Name: res2net50_14w_8s
|
||
In Collection: Res2Net
|
||
Metadata:
|
||
FLOPs: 5403546768
|
||
Parameters: 25060000
|
||
File Size: 100638543
|
||
Architecture:
|
||
- Batch Normalization
|
||
- Convolution
|
||
- Global Average Pooling
|
||
- ReLU
|
||
- Res2Net Block
|
||
Tasks:
|
||
- Image Classification
|
||
Training Techniques:
|
||
- SGD with Momentum
|
||
- Weight Decay
|
||
Training Data:
|
||
- ImageNet
|
||
Training Resources: 4x Titan Xp GPUs
|
||
ID: res2net50_14w_8s
|
||
LR: 0.1
|
||
Epochs: 100
|
||
Crop Pct: '0.875'
|
||
Momentum: 0.9
|
||
Batch Size: 256
|
||
Image Size: '224'
|
||
Weight Decay: 0.0001
|
||
Interpolation: bilinear
|
||
Code: https://github.com/rwightman/pytorch-image-models/blob/d8e69206be253892b2956341fea09fdebfaae4e3/timm/models/res2net.py#L196
|
||
Weights: https://github.com/rwightman/pytorch-image-models/releases/download/v0.1-res2net/res2net50_14w_8s-6527dddc.pth
|
||
Results:
|
||
- Task: Image Classification
|
||
Dataset: ImageNet
|
||
Metrics:
|
||
Top 1 Accuracy: 78.14%
|
||
Top 5 Accuracy: 93.86%
|
||
- Name: res2net50_26w_4s
|
||
In Collection: Res2Net
|
||
Metadata:
|
||
FLOPs: 5499974064
|
||
Parameters: 25700000
|
||
File Size: 103110087
|
||
Architecture:
|
||
- Batch Normalization
|
||
- Convolution
|
||
- Global Average Pooling
|
||
- ReLU
|
||
- Res2Net Block
|
||
Tasks:
|
||
- Image Classification
|
||
Training Techniques:
|
||
- SGD with Momentum
|
||
- Weight Decay
|
||
Training Data:
|
||
- ImageNet
|
||
Training Resources: 4x Titan Xp GPUs
|
||
ID: res2net50_26w_4s
|
||
LR: 0.1
|
||
Epochs: 100
|
||
Crop Pct: '0.875'
|
||
Momentum: 0.9
|
||
Batch Size: 256
|
||
Image Size: '224'
|
||
Weight Decay: 0.0001
|
||
Interpolation: bilinear
|
||
Code: https://github.com/rwightman/pytorch-image-models/blob/d8e69206be253892b2956341fea09fdebfaae4e3/timm/models/res2net.py#L141
|
||
Weights: https://github.com/rwightman/pytorch-image-models/releases/download/v0.1-res2net/res2net50_26w_4s-06e79181.pth
|
||
Results:
|
||
- Task: Image Classification
|
||
Dataset: ImageNet
|
||
Metrics:
|
||
Top 1 Accuracy: 77.99%
|
||
Top 5 Accuracy: 93.85%
|
||
- Name: res2net50_26w_6s
|
||
In Collection: Res2Net
|
||
Metadata:
|
||
FLOPs: 8130156528
|
||
Parameters: 37050000
|
||
File Size: 148603239
|
||
Architecture:
|
||
- Batch Normalization
|
||
- Convolution
|
||
- Global Average Pooling
|
||
- ReLU
|
||
- Res2Net Block
|
||
Tasks:
|
||
- Image Classification
|
||
Training Techniques:
|
||
- SGD with Momentum
|
||
- Weight Decay
|
||
Training Data:
|
||
- ImageNet
|
||
Training Resources: 4x Titan Xp GPUs
|
||
ID: res2net50_26w_6s
|
||
LR: 0.1
|
||
Epochs: 100
|
||
Crop Pct: '0.875'
|
||
Momentum: 0.9
|
||
Batch Size: 256
|
||
Image Size: '224'
|
||
Weight Decay: 0.0001
|
||
Interpolation: bilinear
|
||
Code: https://github.com/rwightman/pytorch-image-models/blob/d8e69206be253892b2956341fea09fdebfaae4e3/timm/models/res2net.py#L163
|
||
Weights: https://github.com/rwightman/pytorch-image-models/releases/download/v0.1-res2net/res2net50_26w_6s-19041792.pth
|
||
Results:
|
||
- Task: Image Classification
|
||
Dataset: ImageNet
|
||
Metrics:
|
||
Top 1 Accuracy: 78.57%
|
||
Top 5 Accuracy: 94.12%
|
||
- Name: res2net50_26w_8s
|
||
In Collection: Res2Net
|
||
Metadata:
|
||
FLOPs: 10760338992
|
||
Parameters: 48400000
|
||
File Size: 194085165
|
||
Architecture:
|
||
- Batch Normalization
|
||
- Convolution
|
||
- Global Average Pooling
|
||
- ReLU
|
||
- Res2Net Block
|
||
Tasks:
|
||
- Image Classification
|
||
Training Techniques:
|
||
- SGD with Momentum
|
||
- Weight Decay
|
||
Training Data:
|
||
- ImageNet
|
||
Training Resources: 4x Titan Xp GPUs
|
||
ID: res2net50_26w_8s
|
||
LR: 0.1
|
||
Epochs: 100
|
||
Crop Pct: '0.875'
|
||
Momentum: 0.9
|
||
Batch Size: 256
|
||
Image Size: '224'
|
||
Weight Decay: 0.0001
|
||
Interpolation: bilinear
|
||
Code: https://github.com/rwightman/pytorch-image-models/blob/d8e69206be253892b2956341fea09fdebfaae4e3/timm/models/res2net.py#L174
|
||
Weights: https://github.com/rwightman/pytorch-image-models/releases/download/v0.1-res2net/res2net50_26w_8s-2c7c9f12.pth
|
||
Results:
|
||
- Task: Image Classification
|
||
Dataset: ImageNet
|
||
Metrics:
|
||
Top 1 Accuracy: 79.19%
|
||
Top 5 Accuracy: 94.37%
|
||
- Name: res2net50_48w_2s
|
||
In Collection: Res2Net
|
||
Metadata:
|
||
FLOPs: 5375291520
|
||
Parameters: 25290000
|
||
File Size: 101421406
|
||
Architecture:
|
||
- Batch Normalization
|
||
- Convolution
|
||
- Global Average Pooling
|
||
- ReLU
|
||
- Res2Net Block
|
||
Tasks:
|
||
- Image Classification
|
||
Training Techniques:
|
||
- SGD with Momentum
|
||
- Weight Decay
|
||
Training Data:
|
||
- ImageNet
|
||
Training Resources: 4x Titan Xp GPUs
|
||
ID: res2net50_48w_2s
|
||
LR: 0.1
|
||
Epochs: 100
|
||
Crop Pct: '0.875'
|
||
Momentum: 0.9
|
||
Batch Size: 256
|
||
Image Size: '224'
|
||
Weight Decay: 0.0001
|
||
Interpolation: bilinear
|
||
Code: https://github.com/rwightman/pytorch-image-models/blob/d8e69206be253892b2956341fea09fdebfaae4e3/timm/models/res2net.py#L185
|
||
Weights: https://github.com/rwightman/pytorch-image-models/releases/download/v0.1-res2net/res2net50_48w_2s-afed724a.pth
|
||
Results:
|
||
- Task: Image Classification
|
||
Dataset: ImageNet
|
||
Metrics:
|
||
Top 1 Accuracy: 77.53%
|
||
Top 5 Accuracy: 93.56%
|
||
-->
|