Ross Wightman
|
f77c04ff36
|
Torchscript fixes/hacks for rms_norm, refactor ParallelScalingBlock with manual combination of input projections, closer paper match
|
2023-02-16 16:57:42 -08:00 |
Ross Wightman
|
621e1b2182
|
Add ideas from 'Scaling ViT to 22-B Params', testing PyTorch 2.0 fused F.scaled_dot_product_attention impl in vit, vit_relpos, maxxvit / coatnet.
|
2023-02-16 16:57:42 -08:00 |
Ross Wightman
|
927f031293
|
Major module / path restructure, timm.models.layers -> timm.layers, add _ prefix to all non model modules in timm.models
|
2022-12-06 15:00:06 -08:00 |