pytorch-image-models

mirror of https://github.com/huggingface/pytorch-image-models.git synced 2025-06-03 15:01:08 +08:00

Author	SHA1	Message	Date
Ross Wightman	7c7ecd2492	Add --use-train-size flag to force use of train input_size (over test input size) for validation. Default test-time pooling to use train input size (fixes issues).	2022-07-07 22:01:24 -07:00
Ross Wightman	eca09b8642	Add MobileVitV2 support. Fix #1332 . Move GroupNorm1 to common layers (used in poolformer + mobilevitv2). Keep ol custom ConvNeXt LayerNorm2d impl as LayerNormExp2d for reference.	2022-07-07 14:41:01 -07:00
Ross Wightman	7a9c6811c9	Add eps arg to LayerNorm2d, add 'tf' (tensorflow) variant of trunc_normal_ that applies scale/shift after sampling (instead of needing to move a/b)	2022-07-01 15:15:39 -07:00
Ross Wightman	82c311d082	Add more experimental darknet and 'cs2' darknet variants (different cross stage setup, closer to newer YOLO backbones) for train trials.	2022-07-01 15:14:01 -07:00
Ross Wightman	07d0c4ae96	Improve repr for DropPath module	2022-06-24 14:58:15 -07:00
Ross Wightman	e27c16b8a0	Remove unecessary code for synbn guard	2022-06-24 14:57:42 -07:00
Ross Wightman	0da3c9ebbf	Remove SiLU layer in default args that breaks import on old old PyTorch	2022-06-24 14:56:58 -07:00
Ross Wightman	879df47c0a	Support BatchNormAct2d for sync-bn use. Fix #1254	2022-06-24 14:51:26 -07:00
Ross Wightman	4b30bae67b	Add updated vit_relpos weights, and impl w/ support for official swin-v2 differences for relpos. Add bias control support for MLP layers	2022-05-13 13:53:57 -07:00
jjsjann123	f88c606fcf	fixing channels_last on cond_conv2d; update nvfuser debug env variable	2022-04-25 12:41:46 -07:00
Ross Wightman	f670d98cb8	Make a few more layers symbolically traceable (remove from FX leaf modules) * remove dtype kwarg from .to() calls in EvoNorm as it messed up script + trace combo * BatchNormAct2d always uses custom forward (cut & paste from original) instead of super().forward. Fixes #1176 * BlurPool groups==channels, no need to use input.dim[1]	2022-03-24 21:43:56 -07:00
Ross Wightman	b049a5c5c6	Merge remote-tracking branch 'origin/master' into norm_norm_norm	2022-03-21 13:41:43 -07:00
Ross Wightman	9440a50c95	Merge branch 'mrT23-master'	2022-03-21 12:30:02 -07:00
Ross Wightman	372ad5fa0d	Significant model refactor and additions: * All models updated with revised foward_features / forward_head interface * Vision transformer and MLP based models consistently output sequence from forward_features (pooling or token selection considered part of 'head') * WIP param grouping interface to allow consistent grouping of parameters for layer-wise decay across all model types * Add gradient checkpointing support to a significant % of models, especially popular architectures * Formatting and interface consistency improvements across models * layer-wise LR decay impl part of optimizer factory w/ scale support in scheduler * Poolformer and Volo architectures added	2022-02-28 13:56:23 -08:00
Ross Wightman	95cfc9b3e8	Merge remote-tracking branch 'origin/master' into norm_norm_norm	2022-01-25 22:20:45 -08:00
Ross Wightman	656757d26b	Fix MobileNetV2 head conv size for multiplier < 1.0. Add some missing modification copyrights, fix starting date of some old ones.	2022-01-14 16:28:27 -08:00
Ross Wightman	b27c21b09a	Update drop_path and drop_block (fast impl) to be symbolically traceable, slightly faster	2022-01-06 16:04:58 -08:00
Ross Wightman	214c84a235	Disable use of timm nn.Linear wrapper since AMP autocast + torchscript use appears fixed	2022-01-06 16:01:51 -08:00
Ross Wightman	a52a614475	Remove layer experiment which should not have been added	2021-12-14 14:29:32 -08:00
Ross Wightman	ab49d275de	Significant norm update * ConvBnAct layer renamed -> ConvNormAct and ConvNormActAa for anti-aliased * Significant update to EfficientNet and MobileNetV3 arch to support NormAct layers and grouped conv (as alternative to depthwise) * Update RegNet to add Z variant * Add Pre variant of XceptionAligned that works with NormAct layers * EvoNorm matches bits_and_tpu branch for merge	2021-12-14 13:48:30 -08:00
Ross Wightman	d04f2f1377	Update drop_path and drop_block (fast impl) to be symbolically traceable, slightly faster	2021-12-05 15:36:56 -08:00
Ross Wightman	834a9ec721	Disable use of timm nn.Linear wrapper since AMP autocast + torchscript use appears fixed	2021-12-01 14:58:09 -08:00
Ross Wightman	78912b6375	Updated EvoNorm implementations with some experimentation. Add FilterResponseNorm. Updated RegnetZ and ResNetV2 model defs for trials.	2021-12-01 12:09:01 -08:00
talrid	c11f4c3218	support CNNs	2021-11-30 08:48:08 +02:00
mrT23	d6701d8a81	Merge branch 'rwightman:master' into master	2021-11-30 08:07:44 +02:00
Ross Wightman	480c676ffa	Fix FX breaking assert in evonorm	2021-11-24 09:24:47 -08:00
talrid	41559247e9	use_ml_decoder_head	2021-11-22 17:50:39 +02:00
Ross Wightman	93cc08fdc5	Make evonorm variables 1d to match other PyTorch norm layers, will break weight compat for any existing use (likely minimal, easy to fix).	2021-11-20 15:50:51 -08:00
Ross Wightman	af607b75cc	Prep a set of ResNetV2 models with GroupNorm, EvoNormB0, EvoNormS0 for BN free model experiments on TPU and IPU	2021-11-19 17:37:00 -08:00
Ross Wightman	c976a410d9	Add ResNet-50 w/ GN (resnet50_gn) and SEBotNet-33-TS (sebotnet33ts_256) model defs and weights. Update halonet50ts weights w/ slightly better variant in1k val, more robust to test sets.	2021-11-19 14:24:43 -08:00
Alexander Soare	b25ff96768	wip - pre-rebase	2021-11-12 20:45:05 +00:00
Alexander Soare	e051dce354	Make all models FX traceable	2021-11-12 20:45:05 +00:00
Alexander Soare	0149ec30d7	wip - attempting to rebase	2021-11-12 20:45:05 +00:00
Alexander Soare	bc3d4eb403	wip -rebase	2021-11-12 20:45:05 +00:00
Ross Wightman	2ddef942b9	Better fix for #954 that doesn't break torchscript, pull torch._assert into timm namespace when it exists	2021-11-02 11:22:33 -07:00
Ross Wightman	4f0f9cb348	Fix #954 by bringing traceable _assert into timm to allow compat w/ PyTorch < 1.8	2021-11-02 09:21:40 -07:00
Ross Wightman	b745d30a3e	Fix formatting of last commit	2021-10-25 15:15:14 -07:00
Ross Wightman	3478f1d7f1	Traceability fix for vit models for some experiments	2021-10-25 15:13:08 -07:00
Ross Wightman	f658a72e72	Cleanup re-use of Dropout modules in Mlp modules after some twitter feedback :p	2021-10-25 00:40:59 -07:00
Ross Wightman	c02334d9fa	Add weights for regnetz_d and haloregnetz_c, update regnetz_c weights. Add commented PyTorch XLA code for halo attention	2021-10-19 12:32:09 -07:00
Ross Wightman	02daf2ab94	Add option to include relative pos embedding in the attention scaling as per references. See discussion #912	2021-10-12 15:37:01 -07:00
Ross Wightman	e2b8d44ff0	Halo, bottleneck attn, lambda layer additions and cleanup along w/ experimental model defs * align interfaces of halo, bottleneck attn and lambda layer * add qk_ratio to all of above, control q/k dim relative to output dim * add experimental haloregnetz, and trionet (lambda + halo + bottle) models	2021-10-06 16:32:48 -07:00
Ross Wightman	007bc39323	Some halo and bottleneck attn code cleanup, add halonet50ts weights, use optimal crop ratios	2021-10-02 15:51:42 -07:00
Ross Wightman	b1c2e3eb92	Match rel_pos_indices attr rename in conv branch	2021-09-30 23:19:05 -07:00
Ross Wightman	b49630a138	Add relative pos embed option to LambdaLayer, fix last transpose/reshape.	2021-09-30 22:45:09 -07:00
Ross Wightman	b81e79aae9	Fix bottleneck attn transpose typo, hopefully these train better now..	2021-09-28 16:38:41 -07:00
Ross Wightman	515121cca1	Use reshape instead of view in std_conv, causing issues in recent PyTorch in channels_last	2021-09-23 15:43:48 -07:00
Ross Wightman	5bd04714e4	Cleanup weight init for byob/byoanet and related	2021-09-05 15:34:05 -07:00
Ross Wightman	8642401e88	Swap botnet 26/50 weights/models after realizing a mistake in arch def, now figuring out why they were so low...	2021-09-05 15:17:19 -07:00
Ross Wightman	5f12de4875	Add initial AttentionPool2d that's being trialed. Fix comment and still trying to improve reliability of sgd test.	2021-09-05 12:41:14 -07:00

1 2 3 4

167 Commits