1.0 KiB

Raw Blame History

PVTV2

1. 概述

PVTV2 是 VisionTransformer 系列模型，该模型基于 PVT（Pyramid Vision Transformer）改进得到，PVT 模型使用 Transformer 结构构建了特征金字塔网络。PVTV2 的主要创新点有：1. 带 overlap 的 Patch embeding；2. 结合卷积神经网络；3. 注意力模块为线性复杂度。论文地址。

2. 精度、FLOPS 和参数量

Models	Top1	Top5	Reference top1	Reference top5	FLOPS (G)	Params (M)
PVT_V2_B0	0.705	0.902	0.705	-	0.53	3.7
PVT_V2_B1	0.787	0.945	0.787	-	2.0	14.0
PVT_V2_B2	0.821	0.960	0.820	-	3.9	25.4
PVT_V2_B3	0.831	0.965	0.831	-	6.7	45.2
PVT_V2_B4	0.836	0.967	0.836	-	9.8	62.6
PVT_V2_B5	0.837	0.966	0.838	-	11.4	82.0
PVT_V2_B2_Linear	0.821	0.961	0.821	-	3.8	22.6

1.0 KiB Raw Blame History Unescape Escape

PVTV2

目录

1. 概述

2. 精度、FLOPS 和参数量

1.0 KiB

Raw Blame History