pytorch-image-models

mirror of https://github.com/huggingface/pytorch-image-models.git synced 2025-06-03 15:01:08 +08:00

Author	SHA1	Message	Date
Ross Wightman	43aa84e861	Add 'fast' layer norm that doesn't cast to float32, support APEX LN impl for slight speed gain, update norm and act factories, tweak SE for ability to disable bias (needed by GCVit)	2022-08-17 14:32:58 -07:00
Ross Wightman	c486aa71f8	Add GCViT	2022-08-17 14:29:18 -07:00
Ross Wightman	fba6ecd39b	Add EfficientFormer	2022-08-17 14:08:53 -07:00
Ross Wightman	ff4a38e2c3	Add PyramidVisionTransformerV2	2022-08-17 12:06:05 -07:00
Ross Wightman	1d8ada359a	Add timm ConvNeXt 'atto' weights, change test resolution for FB ConvNeXt 224x224 weights, add support for different dw kernel_size	2022-08-15 17:56:08 -07:00
Ross Wightman	2544d3b80f	ConvNeXt pico, femto, and nano, pico, femto ols (overlapping stem) weights and model defs	2022-08-05 17:05:50 -07:00
Ross Wightman	13565aad50	Add edgenext_base model def & weight link, update to improve ONNX export #1385	2022-08-05 16:58:34 -07:00
Ross Wightman	8ad4bdfa06	Allow ntuple to be used with string values	2022-07-28 16:18:18 -07:00
Christoph Reich	faae93e62d	Fix typo in PositionalEncodingFourier	2022-07-28 19:08:08 -04:00
Ross Wightman	ec6a28830f	Add DeiT-III 'medium' model defs and weights	2022-07-28 15:03:20 -07:00
Ross Wightman	6f103a442b	Add convnext_nano weights, 80.8 @ 224, 81.5 @ 288	2022-07-26 16:40:27 -07:00
Ross Wightman	4042a94f8f	Add weights for two 'Edge' block (3x3->1x1) variants of CS3 networks.	2022-07-26 16:40:27 -07:00
Ross Wightman	c8f69e04a9	Merge pull request #1365 from veritable-tech/fix-resize-pos-embed Take `no_emb_class` into account when calling `resize_pos_embed`	2022-07-24 21:03:01 -07:00
Ceshine Lee	0b64117592	Take `no_emb_class` into account when calling `resize_pos_embed`	2022-07-24 19:11:45 +08:00
Jasha10	56c3a84db3	Update type hint for `register_notrace_module` register_notrace_module is used to decorate types (i.e. subclasses of nn.Module). It is not called on module instances.	2022-07-22 16:59:55 -05:00
Ross Wightman	1b278136c3	Change models with mean 0,0,0 std 1,1,1 from int to float for consistency as mentioned in #1355	2022-07-21 17:36:15 -07:00
Ross Wightman	909705e7ff	Remove some redundant requires_grad=True from nn.Parameter in third party code	2022-07-20 12:37:41 -07:00
Ross Wightman	c5e0d1c700	Add dilation support to convnext, allows output_stride=8 and 16 use. Fix #1341	2022-07-19 17:52:10 -07:00
Ross Wightman	dc376e3676	Ensure all model entrypoint fn default to `pretrained=False` (a few didn't)	2022-07-19 13:58:41 -07:00
Ross Wightman	23b102064a	Add cs3sedarknet_x weights w/ 82.65 @ 288 top1. Add 2 cs3 edgenet models (w/ 3x3-1x1 block), remove aa from cspnet blocks (not needed)	2022-07-19 13:56:44 -07:00
Ross Wightman	05313940e2	Add cs3darknet_x, cs3sedarknet_l, and darknetaa53 weights from TPU sessions. Move SE btwn conv1 & conv2 in DarkBlock. Improve SE/attn handling in Csp/DarkNet. Fix leaky_relu bug on older csp models.	2022-07-15 16:55:16 -07:00
nateraw	51cca82aa1	👽 use hf_hub_download instead of cached_download	2022-07-14 16:41:45 -04:00
Ross Wightman	a45b4bce9a	x and xx small edgenext models do benefit from larger test input size	2022-07-08 10:53:27 -07:00
Ross Wightman	a8e34051c1	Unbreak gamma remap impacting beit checkpoint load, version bump to 0.6.4	2022-07-07 23:07:43 -07:00
Ross Wightman	a1cb25066e	Add edgnext_small_rw weights trained with swin like recipe. Better than original 'small' but not the recent 'USI' distilled weights.	2022-07-07 22:02:57 -07:00
Ross Wightman	7c7ecd2492	Add --use-train-size flag to force use of train input_size (over test input size) for validation. Default test-time pooling to use train input size (fixes issues).	2022-07-07 22:01:24 -07:00
Ross Wightman	ce65a7b29f	Update vit_relpos w/ some additional weights, some cleanup to match recent vit updates, more MLP log coord experiments.	2022-07-07 21:33:25 -07:00
Ross Wightman	58621723bd	Add CrossStage3 DarkNet (cs3) weights	2022-07-07 17:43:38 -07:00
Ross Wightman	db0cee9910	Refactor cspnet configuration using dataclasses, update feature extraction for new cs3 variants.	2022-07-07 14:43:27 -07:00
Ross Wightman	eca09b8642	Add MobileVitV2 support. Fix #1332 . Move GroupNorm1 to common layers (used in poolformer + mobilevitv2). Keep ol custom ConvNeXt LayerNorm2d impl as LayerNormExp2d for reference.	2022-07-07 14:41:01 -07:00
Ross Wightman	06307b8b41	Remove experimental downsample in block support in ConvNeXt. Experiment further before keeping it in.	2022-07-07 14:37:58 -07:00
Ross Wightman	7d4b3807d5	Support DeiT-3 (Revenge of the ViT) checkpoints. Add non-overlapping (w/ class token) pos-embed support to vit.	2022-07-04 22:25:22 -07:00
Ross Wightman	d0c5bd5722	Rename cs2->cs3 for darknets. Fix features_only for cs3 darknets.	2022-07-03 08:32:41 -07:00
Ross Wightman	d765305821	Remove first_conv for resnetaa50 def	2022-07-02 15:56:17 -07:00
Ross Wightman	dd9b8f57c4	Add feature_info to edgenext for features_only support, hopefully fix some fx / test errors	2022-07-02 15:20:45 -07:00
Ross Wightman	377e9bfa21	Add TPU trained darknet53 weights. Add mising pretrain_cfg for some csp/darknet models.	2022-07-02 15:18:52 -07:00
Ross Wightman	c170ba3173	Add weights for resnet10t, resnet14t, and resnetaa50 models. Fix #1314	2022-07-02 15:18:06 -07:00
Ross Wightman	188c194b0f	Left some experiment stem code in convnext by mistake	2022-07-02 15:17:28 -07:00
Ross Wightman	6064d16a2d	Add initial EdgeNeXt import. Significant cleanup / reorg (like ConvNeXt). Fix #1320 * edgenext refactored for torchscript compat, stage base organization * slight refactor of ConvNeXt to match some EdgeNeXt additions * remove use of funky LayerNorm layer in ConvNeXt and just use nn.LayerNorm and LayerNorm2d (permute)	2022-07-01 15:18:42 -07:00
Ross Wightman	7a9c6811c9	Add eps arg to LayerNorm2d, add 'tf' (tensorflow) variant of trunc_normal_ that applies scale/shift after sampling (instead of needing to move a/b)	2022-07-01 15:15:39 -07:00
Ross Wightman	82c311d082	Add more experimental darknet and 'cs2' darknet variants (different cross stage setup, closer to newer YOLO backbones) for train trials.	2022-07-01 15:14:01 -07:00
Ross Wightman	a050fde5cd	Add resnet10t (basic block) and resnet14t (bottleneck) with 1,1,1,1 repeats	2022-07-01 15:03:28 -07:00
Ross Wightman	e6d7df40ec	no longer a point using kwargs for pretrain_cfg resolve, just pass explicit arg	2022-06-24 21:36:23 -07:00
Ross Wightman	07d0c4ae96	Improve repr for DropPath module	2022-06-24 14:58:15 -07:00
Ross Wightman	e27c16b8a0	Remove unecessary code for synbn guard	2022-06-24 14:57:42 -07:00
Ross Wightman	0da3c9ebbf	Remove SiLU layer in default args that breaks import on old old PyTorch	2022-06-24 14:56:58 -07:00
Ross Wightman	7d657d2ef4	Improve resolve_pretrained_cfg behaviour when no cfg exists, warn instead of crash. Improve usability ex #1311	2022-06-24 14:55:25 -07:00
Ross Wightman	879df47c0a	Support BatchNormAct2d for sync-bn use. Fix #1254	2022-06-24 14:51:26 -07:00
Ross Wightman	4b30bae67b	Add updated vit_relpos weights, and impl w/ support for official swin-v2 differences for relpos. Add bias control support for MLP layers	2022-05-13 13:53:57 -07:00
Ross Wightman	d4c0588012	Remove persistent buffers from Swin-V2. Change SwinV2Cr cos attn + tau/logit_scale to match official, add ckpt convert, init_value zeros resid LN weight by default	2022-05-13 10:50:59 -07:00

... 6 7 8 9 10 ...

1066 Commits