pytorch-image-models/timm/models/layers/evo_norm.py

"""EvoNormB0 (Batched) and EvoNormS0 (Sample) in PyTorch

An attempt at getting decent performing EvoNorms running in PyTorch.
While currently faster than other impl, still quite a ways off the built-in BN
in terms of memory usage and throughput (roughly 5x mem, 1/2 - 1/3x speed).

Still very much a WIP, fiddling with buffer usage, in-place/jit optimizations, and layouts.

Hacked together by / Copyright 2020 Ross Wightman
"""

import torch
import torch.nn as nn

from .trace_utils import _assert


class EvoNormBatch2d(nn.Module):
    def __init__(self, num_features, apply_act=True, momentum=0.1, eps=1e-5, drop_block=None):
        super(EvoNormBatch2d, self).__init__()
        self.apply_act = apply_act  # apply activation (non-linearity)
        self.momentum = momentum
        self.eps = eps
        self.weight = nn.Parameter(torch.ones(num_features), requires_grad=True)
        self.bias = nn.Parameter(torch.zeros(num_features), requires_grad=True)
        self.v = nn.Parameter(torch.ones(num_features), requires_grad=True) if apply_act else None
        self.register_buffer('running_var', torch.ones(num_features))
        self.reset_parameters()

    def reset_parameters(self):
        nn.init.ones_(self.weight)
        nn.init.zeros_(self.bias)
        if self.apply_act:
            nn.init.ones_(self.v)

    def forward(self, x):
        _assert(x.dim() == 4, 'expected 4D input')
        x_type = x.dtype
        if self.v is not None:
            running_var = self.running_var.view(1, -1, 1, 1)
            if self.training:
                var = x.var(dim=(0, 2, 3), unbiased=False, keepdim=True)
                n = x.numel() / x.shape[1]
                running_var = var.detach() * self.momentum * (n / (n - 1)) + running_var * (1 - self.momentum)
                self.running_var.copy_(running_var.view(self.running_var.shape))
            else:
                var = running_var
            v = self.v.to(dtype=x_type).reshape(1, -1, 1, 1)
            d = x * v + (x.var(dim=(2, 3), unbiased=False, keepdim=True) + self.eps).sqrt().to(dtype=x_type)
            d = d.max((var + self.eps).sqrt().to(dtype=x_type))
            x = x / d
        return x * self.weight.view(1, -1, 1, 1) + self.bias.view(1, -1, 1, 1)


class EvoNormSample2d(nn.Module):
    def __init__(self, num_features, apply_act=True, groups=32, eps=1e-5, drop_block=None):
        super(EvoNormSample2d, self).__init__()
        self.apply_act = apply_act  # apply activation (non-linearity)
        self.groups = groups
        self.eps = eps
        self.weight = nn.Parameter(torch.ones(num_features), requires_grad=True)
        self.bias = nn.Parameter(torch.zeros(num_features), requires_grad=True)
        self.v = nn.Parameter(torch.ones(num_features), requires_grad=True) if apply_act else None
        self.reset_parameters()

    def reset_parameters(self):
        nn.init.ones_(self.weight)
        nn.init.zeros_(self.bias)
        if self.apply_act:
            nn.init.ones_(self.v)

    def forward(self, x):
        _assert(x.dim() == 4, 'expected 4D input')
        B, C, H, W = x.shape
        _assert(C % self.groups == 0, '')
        if self.v is not None:
            n = x * (x * self.v.view(1, -1, 1, 1)).sigmoid()
            x = x.reshape(B, self.groups, -1)
            x = n.reshape(B, self.groups, -1) / (x.var(dim=-1, unbiased=False, keepdim=True) + self.eps).sqrt()
            x = x.reshape(B, C, H, W)
        return x * self.weight.view(1, -1, 1, 1) + self.bias.view(1, -1, 1, 1)
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`"""EvoNormB0 (Batched) and EvoNormS0 (Sample) in PyTorch`

			`An attempt at getting decent performing EvoNorms running in PyTorch.`
			`While currently faster than other impl, still quite a ways off the built-in BN`
Monster commit, activation refactor, VoVNet, norm_act improvements, more * refactor activations into basic PyTorch, jit scripted, and memory efficient custom auto * implement hard-mish, better grad for hard-swish * add initial VovNet V1/V2 impl, fix #151 * VovNet and DenseNet first models to use NormAct layers (support BatchNormAct2d, EvoNorm, InplaceIABN) * Wrap IABN for any models that use it * make more models torchscript compatible (DPN, PNasNet, Res2Net, SelecSLS) and add tests 2020-06-01 16:59:51 -07:00			`in terms of memory usage and throughput (roughly 5x mem, 1/2 - 1/3x speed).`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00
Monster commit, activation refactor, VoVNet, norm_act improvements, more * refactor activations into basic PyTorch, jit scripted, and memory efficient custom auto * implement hard-mish, better grad for hard-swish * add initial VovNet V1/V2 impl, fix #151 * VovNet and DenseNet first models to use NormAct layers (support BatchNormAct2d, EvoNorm, InplaceIABN) * Wrap IABN for any models that use it * make more models torchscript compatible (DPN, PNasNet, Res2Net, SelecSLS) and add tests 2020-06-01 16:59:51 -07:00			`Still very much a WIP, fiddling with buffer usage, in-place/jit optimizations, and layouts.`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00
Fix some attributions, add copyrights to some file docstrings 2020-07-27 13:44:56 -07:00			`Hacked together by / Copyright 2020 Ross Wightman`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`"""`

			`import torch`
			`import torch.nn as nn`

wip - pre-rebase 2021-11-12 20:42:45 +00:00			`from .trace_utils import _assert`

DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00
			`class EvoNormBatch2d(nn.Module):`
Monster commit, activation refactor, VoVNet, norm_act improvements, more * refactor activations into basic PyTorch, jit scripted, and memory efficient custom auto * implement hard-mish, better grad for hard-swish * add initial VovNet V1/V2 impl, fix #151 * VovNet and DenseNet first models to use NormAct layers (support BatchNormAct2d, EvoNorm, InplaceIABN) * Wrap IABN for any models that use it * make more models torchscript compatible (DPN, PNasNet, Res2Net, SelecSLS) and add tests 2020-06-01 16:59:51 -07:00			`def __init__(self, num_features, apply_act=True, momentum=0.1, eps=1e-5, drop_block=None):`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`super(EvoNormBatch2d, self).__init__()`
Monster commit, activation refactor, VoVNet, norm_act improvements, more * refactor activations into basic PyTorch, jit scripted, and memory efficient custom auto * implement hard-mish, better grad for hard-swish * add initial VovNet V1/V2 impl, fix #151 * VovNet and DenseNet first models to use NormAct layers (support BatchNormAct2d, EvoNorm, InplaceIABN) * Wrap IABN for any models that use it * make more models torchscript compatible (DPN, PNasNet, Res2Net, SelecSLS) and add tests 2020-06-01 16:59:51 -07:00			`self.apply_act = apply_act # apply activation (non-linearity)`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`self.momentum = momentum`
			`self.eps = eps`
Make evonorm variables 1d to match other PyTorch norm layers, will break weight compat for any existing use (likely minimal, easy to fix). 2021-11-20 15:50:51 -08:00			`self.weight = nn.Parameter(torch.ones(num_features), requires_grad=True)`
			`self.bias = nn.Parameter(torch.zeros(num_features), requires_grad=True)`
			`self.v = nn.Parameter(torch.ones(num_features), requires_grad=True) if apply_act else None`
			`self.register_buffer('running_var', torch.ones(num_features))`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`self.reset_parameters()`

			`def reset_parameters(self):`
			`nn.init.ones_(self.weight)`
			`nn.init.zeros_(self.bias)`
Monster commit, activation refactor, VoVNet, norm_act improvements, more * refactor activations into basic PyTorch, jit scripted, and memory efficient custom auto * implement hard-mish, better grad for hard-swish * add initial VovNet V1/V2 impl, fix #151 * VovNet and DenseNet first models to use NormAct layers (support BatchNormAct2d, EvoNorm, InplaceIABN) * Wrap IABN for any models that use it * make more models torchscript compatible (DPN, PNasNet, Res2Net, SelecSLS) and add tests 2020-06-01 16:59:51 -07:00			`if self.apply_act:`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`nn.init.ones_(self.v)`

			`def forward(self, x):`
Fix FX breaking assert in evonorm 2021-11-24 09:24:47 -08:00			`_assert(x.dim() == 4, 'expected 4D input')`
Add norm_act factory method, move JIT of norm layers to factory 2020-05-09 22:07:01 -07:00			`x_type = x.dtype`
Make evonorm variables 1d to match other PyTorch norm layers, will break weight compat for any existing use (likely minimal, easy to fix). 2021-11-20 15:50:51 -08:00			`if self.v is not None:`
Fix FX breaking assert in evonorm 2021-11-24 09:24:47 -08:00			`running_var = self.running_var.view(1, -1, 1, 1)`
			`if self.training:`
			`var = x.var(dim=(0, 2, 3), unbiased=False, keepdim=True)`
			`n = x.numel() / x.shape[1]`
			`running_var = var.detach() * self.momentum * (n / (n - 1)) + running_var * (1 - self.momentum)`
			`self.running_var.copy_(running_var.view(self.running_var.shape))`
			`else:`
			`var = running_var`
Make evonorm variables 1d to match other PyTorch norm layers, will break weight compat for any existing use (likely minimal, easy to fix). 2021-11-20 15:50:51 -08:00			`v = self.v.to(dtype=x_type).reshape(1, -1, 1, 1)`
Fix a silly bug in Sample version of EvoNorm missing x* part of swish, update EvoNormBatch to accumulated unbiased variance. 2020-08-13 18:23:50 -07:00			`d = x * v + (x.var(dim=(2, 3), unbiased=False, keepdim=True) + self.eps).sqrt().to(dtype=x_type)`
Monster commit, activation refactor, VoVNet, norm_act improvements, more * refactor activations into basic PyTorch, jit scripted, and memory efficient custom auto * implement hard-mish, better grad for hard-swish * add initial VovNet V1/V2 impl, fix #151 * VovNet and DenseNet first models to use NormAct layers (support BatchNormAct2d, EvoNorm, InplaceIABN) * Wrap IABN for any models that use it * make more models torchscript compatible (DPN, PNasNet, Res2Net, SelecSLS) and add tests 2020-06-01 16:59:51 -07:00			`d = d.max((var + self.eps).sqrt().to(dtype=x_type))`
Add norm_act factory method, move JIT of norm layers to factory 2020-05-09 22:07:01 -07:00			`x = x / d`
Make evonorm variables 1d to match other PyTorch norm layers, will break weight compat for any existing use (likely minimal, easy to fix). 2021-11-20 15:50:51 -08:00			`return x * self.weight.view(1, -1, 1, 1) + self.bias.view(1, -1, 1, 1)`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00

			`class EvoNormSample2d(nn.Module):`
Prep a set of ResNetV2 models with GroupNorm, EvoNormB0, EvoNormS0 for BN free model experiments on TPU and IPU 2021-11-19 17:37:00 -08:00			`def __init__(self, num_features, apply_act=True, groups=32, eps=1e-5, drop_block=None):`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`super(EvoNormSample2d, self).__init__()`
Monster commit, activation refactor, VoVNet, norm_act improvements, more * refactor activations into basic PyTorch, jit scripted, and memory efficient custom auto * implement hard-mish, better grad for hard-swish * add initial VovNet V1/V2 impl, fix #151 * VovNet and DenseNet first models to use NormAct layers (support BatchNormAct2d, EvoNorm, InplaceIABN) * Wrap IABN for any models that use it * make more models torchscript compatible (DPN, PNasNet, Res2Net, SelecSLS) and add tests 2020-06-01 16:59:51 -07:00			`self.apply_act = apply_act # apply activation (non-linearity)`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`self.groups = groups`
			`self.eps = eps`
Make evonorm variables 1d to match other PyTorch norm layers, will break weight compat for any existing use (likely minimal, easy to fix). 2021-11-20 15:50:51 -08:00			`self.weight = nn.Parameter(torch.ones(num_features), requires_grad=True)`
			`self.bias = nn.Parameter(torch.zeros(num_features), requires_grad=True)`
			`self.v = nn.Parameter(torch.ones(num_features), requires_grad=True) if apply_act else None`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`self.reset_parameters()`

			`def reset_parameters(self):`
			`nn.init.ones_(self.weight)`
			`nn.init.zeros_(self.bias)`
Monster commit, activation refactor, VoVNet, norm_act improvements, more * refactor activations into basic PyTorch, jit scripted, and memory efficient custom auto * implement hard-mish, better grad for hard-swish * add initial VovNet V1/V2 impl, fix #151 * VovNet and DenseNet first models to use NormAct layers (support BatchNormAct2d, EvoNorm, InplaceIABN) * Wrap IABN for any models that use it * make more models torchscript compatible (DPN, PNasNet, Res2Net, SelecSLS) and add tests 2020-06-01 16:59:51 -07:00			`if self.apply_act:`
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN 2020-05-09 18:26:41 -07:00			`nn.init.ones_(self.v)`

			`def forward(self, x):`
wip - pre-rebase 2021-11-12 20:42:45 +00:00			`_assert(x.dim() == 4, 'expected 4D input')`
Add norm_act factory method, move JIT of norm layers to factory 2020-05-09 22:07:01 -07:00			`B, C, H, W = x.shape`
wip - pre-rebase 2021-11-12 20:42:45 +00:00			`_assert(C % self.groups == 0, '')`
Make evonorm variables 1d to match other PyTorch norm layers, will break weight compat for any existing use (likely minimal, easy to fix). 2021-11-20 15:50:51 -08:00			`if self.v is not None:`
			`n = x * (x * self.v.view(1, -1, 1, 1)).sigmoid()`
Add norm_act factory method, move JIT of norm layers to factory 2020-05-09 22:07:01 -07:00			`x = x.reshape(B, self.groups, -1)`
Fix a silly bug in Sample version of EvoNorm missing x* part of swish, update EvoNormBatch to accumulated unbiased variance. 2020-08-13 18:23:50 -07:00			`x = n.reshape(B, self.groups, -1) / (x.var(dim=-1, unbiased=False, keepdim=True) + self.eps).sqrt()`
Add norm_act factory method, move JIT of norm layers to factory 2020-05-09 22:07:01 -07:00			`x = x.reshape(B, C, H, W)`
Make evonorm variables 1d to match other PyTorch norm layers, will break weight compat for any existing use (likely minimal, easy to fix). 2021-11-20 15:50:51 -08:00			`return x * self.weight.view(1, -1, 1, 1) + self.bias.view(1, -1, 1, 1)`