Commit Graph

1778 Commits (f63a11cf81f8f96fcd9c89adc1462b7607e3efc7)

Author SHA1 Message Date
Ross Wightman 79abc25f55 Add ADOPT optimizer 2024-11-12 20:49:01 -08:00
Ross Wightman 36a45e5d94 Improve row/col dim var name 2024-11-12 20:49:01 -08:00
Ross Wightman e7b0480381 Cleanup original adafactor impl, add row/col dim heuristic that works with both conv and linear layers 2024-11-12 20:49:01 -08:00
Ross Wightman 1409ce2dbe Change eps defaults in adafactor_bv again after some checking 2024-11-12 20:49:01 -08:00
Ross Wightman 9d8ccd2ba7 A bit of lars/lamb cleanup, torch.where supports scalars properly now, make lamb grad clipping optional, clean it up a bit 2024-11-12 20:49:01 -08:00
Ross Wightman 7cfaeced67 Change adafactor_bv epsilon default 2024-11-12 20:49:01 -08:00
Ross Wightman 0b5ae49251 Remove adafactorbv numpy dep, hack fix for loading optimizer state w/ half prec momentum (need better one) 2024-11-12 20:49:01 -08:00
Ross Wightman 19090ea966 Need to init momentum with correct dtype 2024-11-12 20:49:01 -08:00
Ross Wightman 484a88f4b4 Remove unused beta2 fn, make eps grad^2 handling same across factorized and non-factorized cases 2024-11-12 20:49:01 -08:00
Ross Wightman 7c16adca83 An impl of adafactor as per big vision (scaling vit) changes 2024-11-12 20:49:01 -08:00
Ross Wightman 363b043c13 Extend train epoch schedule by warmup_epochs if warmup_prefix enable, allows schedule to reach end w/ prefix enabledy 2024-11-08 11:01:11 -08:00
Augustin Godinot 7f0c1b1f30 Add trust_remote_code argument to ReaderHfds 2024-11-08 08:16:36 -08:00
Wojtek Jasiński eb94efb218 fix pos embed dynamic resampling for eva 2024-11-06 16:03:27 -08:00
Wojtek Jasiński 3c7822c621 fix pos embed dynamic resampling for deit 2024-11-06 16:03:27 -08:00
Wojtek Jasiński 3ae3f44288 Fix positional embedding resampling for non-square inputs in ViT 2024-11-06 16:03:27 -08:00
Ross Wightman d4dde48dd5 Missed first_conv from resnet18d 2024-10-31 19:29:53 -07:00
Ross Wightman e6263bf64d Add resnet and resnet-v2 18/34 weights trained with mnv4 small based recipe 2024-10-31 16:39:35 -07:00
Ross Wightman f5b58e31a2 Allow non train mode for wds reader to operate w/o sample count, exhaust iterator 2024-10-31 16:39:35 -07:00
Ross Wightman f689c850b9 One more small c&p issue 2024-10-23 21:51:09 -07:00
Ross Wightman baa7242dd3 Fix c&p error, slight reformat 2024-10-23 21:51:09 -07:00
Ross Wightman 1b5cae681c Update some clip pretrained weights to point to new hub locations, add a few missing weights 2024-10-23 21:51:09 -07:00
Ross Wightman 310ffa32c5
Update version.py
dev version 1.0.12.dev0
2024-10-19 09:56:17 -07:00
Ross Wightman 015fbe457a Merge branch 'MengqingCao-npu_support' into device_amp_cleanup 2024-10-18 14:50:44 -07:00
Ross Wightman 81b59faf77 Merge branch 'npu_support' of github.com:MengqingCao/pytorch-image-models into MengqingCao-npu_support 2024-10-18 14:50:00 -07:00
Ross Wightman 1766a01f96 Cleanup some amp related behaviour to better support different (non-cuda) devices 2024-10-18 13:54:16 -07:00
MengqingCao 37c731ca37 fix device check 2024-10-17 12:38:02 +00:00
Feraidoon Mehri ca20e102fe
mambaout.py: fixed bug 2024-10-17 01:03:28 +03:30
Ross Wightman 8cb2548962 Version 1.0.11 2024-10-16 14:14:44 -07:00
Ross Wightman 89dffc5ff0 Another small fix for original mambaout models, no classifier nn.Linear when num_classe=0 on init 2024-10-16 12:36:36 -07:00
Ross Wightman fad4538801 Elevate import deprecation warnings from DeprecationWarning to FutureWarning so messages are now seen 2024-10-16 11:30:01 -07:00
Ross Wightman a1f379e712 Add intern300m vit w/ converted timm weights. Fix #2300 2024-10-16 10:29:06 -07:00
MengqingCao 234f975787 add npu support 2024-10-16 07:13:45 +00:00
Ross Wightman 60f517c883 Fix wrong name in _all_ for models._registry 2024-10-15 07:39:46 -07:00
Ross Wightman b4a9a166c3 Version 1.0.10 2024-10-14 21:40:30 -07:00
Ross Wightman c3052fa19e
Merge pull request #2298 from huggingface/preact_resnet18
Add resnet18/18d pre-act model configs for potential training.
2024-10-14 19:39:04 -07:00
Ross Wightman abdf33145c Add 34/34d pre-act resnet variants 2024-10-14 13:23:50 -07:00
Ross Wightman c82ce86f8f Add 384x384 mambaout_base_plus model weights 2024-10-14 12:28:57 -07:00
Ross Wightman 82ae247879 MambaOut weights on hub, configs finalized 2024-10-11 11:07:40 -07:00
Ross Wightman 7efb60c299 Add first_conv for mambaout 2024-10-09 14:11:40 -07:00
Ross Wightman 5dc5ee5b42 Add global_pool to mambaout __init__ and pass to heads 2024-10-09 14:11:40 -07:00
Ross Wightman 9d1dfe8dbe Incorrectly named head_hidden_size 2024-10-09 14:11:40 -07:00
Ross Wightman 91e743f2dd Mambaout tweaks 2024-10-09 14:11:40 -07:00
Ross Wightman 4542cf03f9 Add features_only, other bits to mambaout, define different base alternatives 2024-10-09 14:11:40 -07:00
Ross Wightman c2da12c7e1 Update rw models, fix heads 2024-10-09 14:11:40 -07:00
Ross Wightman f2086f51a0 Add mambaout builder support, pretrained weight remap 2024-10-09 14:11:40 -07:00
Ross Wightman c6ef54eefa Initial mambaout work 2024-10-09 14:11:40 -07:00
Ross Wightman d9321b0e10 Add weights for fine-tuned siglip so400m. Add webli_i18n pretrained tags for the multi-lingual model variants (incl older base) 2024-10-09 09:04:44 -07:00
Ross Wightman 01b62264af Add i18n variant of so400m model w/ weights. Add two in1k fine-tunes of original so400m 384x384 but at 378x378 (better matches patch14) 2024-10-08 23:40:24 -07:00
Ross Wightman 72f0edb7e8 missed first_conv for rnv2 18d 2024-10-08 12:38:54 -07:00
Ross Wightman 3ed603a2ce Add resnet18/18d pre-act model configs for potential training. Fix #2289 2024-10-08 11:28:07 -07:00
Ross Wightman 41a79e0fcb Add overlapped stem convnext zepto weights 2024-10-08 11:26:34 -07:00
Ross Wightman 545bd4056c Tag along test_vit3 weights 2024-09-30 12:03:32 -07:00
Ross Wightman 69b687d4cc Add zepto weights 2024-09-30 11:43:23 -07:00
Ross Wightman c6e5557a5a Mismatch pretrained_cfg 2024-09-30 11:43:23 -07:00
Ross Wightman 5d7bd2973e convnext zepto, rmsnorm experiments 2024-09-30 11:43:23 -07:00
Ross Wightman e3242a5258
Merge pull request #2277 from huggingface/more_tiny_test_models
Adding some more tiny test models to train...
2024-09-22 10:28:29 -07:00
Ross Wightman c1cb5641c7 Add weight fore mobilenetv4 small 0.5, change 0.25 -> 0.35 2024-09-22 10:27:01 -07:00
Ross Wightman a22ce0a329 Merge branch 'patch-1' of https://github.com/baorepo/pytorch-image-models into baorepo-patch-1 2024-09-22 10:14:35 -07:00
Ross Wightman 9067be6a30 Add weights for new tiny test models 2024-09-22 07:59:23 -07:00
Ross Wightman 65564f7da5 Fix reversed H & W padding for swin patch merging 2024-09-21 16:51:02 -07:00
Ross Wightman a2f539f055 Add a few more test model defs in prep for weight upload 2024-09-21 11:38:38 -07:00
Ross Wightman 6ab2af610d Adding some more tiny test models to train 2024-09-06 15:35:57 -07:00
alias pillar1989 d6b8816eda MobilenetV4: add two more lightweight models
Mobilenetv4 is very fast and ideal for embedded devices. However, for many low-cost, low-power embedded MCU devices, smaller models are required. Hopefully this PR will merge.
2024-09-05 02:34:11 +00:00
Ross Wightman f81cbdcca9
Merge pull request #2274 from huggingface/bulk_runner_tweaks
Better all res resolution for bulk runner
2024-09-03 12:11:56 -07:00
Ross Wightman 6c42299b3e Back to dev version 2024-09-03 10:29:05 -07:00
Ross Wightman a50713ce6e Fix #2272 2024-09-02 13:20:05 -07:00
Ross Wightman ebbe530ee4 Add MobileNetV3 RA4 (mnv4 recipe) weights 2024-09-02 13:10:34 -07:00
Ross Wightman fa4a1e597f Better all res resolution for bulk runner 2024-08-26 22:28:01 -07:00
Ross Wightman 072783331f Version 1.0.9 2024-08-23 11:47:42 -07:00
Ross Wightman 76b0e9931a Placeholder for new mnv3 model 2024-08-23 10:11:20 -07:00
Ross Wightman 39e92f0c0d mobilenet_edgetpu can use group_size override, more consistency in arg wrap/sadface w/ extra group_size arg 2024-08-22 11:44:02 -07:00
Ross Wightman b9f020a509 Allow group_size override for more efficientnet and mobilenetv3 based models 2024-08-21 16:51:38 -07:00
Ross Wightman 17923a66bb Add layer scale to hieradet 2024-08-21 11:23:39 -07:00
Ross Wightman 47e6958263 Add hierdet_small (non sam) model def 2024-08-21 11:05:54 -07:00
Ross Wightman 9fcbf39cdc Add remaining sbb vit betwixt/mediumd fine-tunes 2024-08-21 10:09:38 -07:00
Ross Wightman dc94cca0e5 Remaining Hiera sbb weights uploaded 2024-08-21 10:06:27 -07:00
Ross Wightman a256e50457 Move padding back in front of windowing 2024-08-17 11:22:53 -07:00
Ross Wightman 7d83749207 pool size test fixes 2024-08-17 08:27:13 -07:00
Ross Wightman 1bd92bca0e Add fused_attn flag to HieraDet attn block 2024-08-16 22:57:49 -07:00
Ross Wightman 691bb54443 Larger min input size needed 2024-08-16 17:09:19 -07:00
Ross Wightman de3a91a7a0 Add min_input_size of 128 for hieradet/sam2 2024-08-16 15:13:56 -07:00
Ross Wightman 0b05122cda Fixing hieradet (sam2) tests 2024-08-16 14:33:40 -07:00
Ross Wightman e035381171 Move padding out of windowing code for hieradet, fix torchscript typing issues, make pooling MaxPool unique instances across two modules 2024-08-16 13:36:33 -07:00
Ross Wightman 146c2fbe34 Add resnet50d and efficientnet_b1 ra4 (mnv4) hparam weights 2024-08-16 12:10:00 -07:00
Ross Wightman 962958723c More Hiera updates. Add forward_intermediates to hieradat/sam2 impl. Make both use same classifier module. Add coarse bool to intermediates. 2024-08-16 11:10:04 -07:00
Ross Wightman f2cfb4c677 Add WIP HieraDet impl (SAM2 backbone support) 2024-08-15 17:58:15 -07:00
Ross Wightman a50e53d41f Rename global pos embed for Hiera abswin, factor out commonly used vit weight init fns to layers. Add a channels-last ver of normmlp head. 2024-08-15 17:46:36 -07:00
Ross Wightman 2f3fed43b8 Fix hiera init with num_classes=0, fix weight tag names for sbb2 hiera/vit weights, add LayerScale/LayerScale2d to layers 2024-08-15 11:14:38 -07:00
Ross Wightman fee91fdd41 Update Hiera model for abswin, more stable weight init, layer-scale. ImageNet-12k weights for hiera_small_abswin, and two of the sbb vits with improved reg4 init. 2024-08-14 12:22:40 -07:00
Ross Wightman ac3470188b Add mobilenetv1_125 weights from ra4 / mnv4 hparam series 2024-08-14 08:47:02 -07:00
Ross Wightman a7b0bfc376 exception for unknown arg should have been TypeError 2024-08-12 16:09:36 -07:00
Ross Wightman bd0f79153b Add weights only flag to avoid warning, try to keep bwd compat. Default to True for remote load of pretrained weights, keep False for local checkpoing load to avoid training checkpoint breaks.. fix #2249 2024-08-12 11:41:41 -07:00
Ross Wightman 531215eded Bring changelog up to current, back to dev version, 1.0.9.dev0 2024-08-08 09:17:23 -07:00
Ross Wightman bf781bc3c2
Merge pull request #2245 from dhkim0225/feature/rdnet
Feature: Add RDNet
2024-08-07 16:19:22 -07:00
Ross Wightman 49f6e1da96 Remove eval to avoid safety concerns 2024-08-07 14:54:13 -07:00
Ross Wightman 2d5c9bf60d
Merge pull request #2253 from brendanartley/hgnet-grad-checkpointing
Add gradient checkpointing to hgnets
2024-08-07 12:45:14 -07:00
Ross Wightman e9ef9424f0 Add a few missing __all__ entries. 2024-08-07 09:35:51 -07:00
brendanartley 232a459e36 Added gradient checkpointing to hgnet 2024-08-07 09:22:51 -07:00
dong-hyun ea33d6a74d bugfix pool_size 2024-08-03 09:50:20 +09:00
Ross Wightman 10344625be proj_drop wasn't being called in sam vit 2024-08-02 16:12:02 -07:00
dong-hyun 6fdc9d8243 update code for ci pass 2024-08-02 15:14:40 +09:00
dong-hyun 7a866b6521 update code for torchscript 2024-08-02 09:58:13 +09:00
dong-hyun 8248122f82 add rdnet 2024-08-01 14:54:29 +09:00
dong-hyun 025259024d add rdnet 2024-08-01 14:51:15 +09:00
dong-hyun 225f4f92b3 add rdnet 2024-08-01 14:49:21 +09:00
Ross Wightman 4a10302754 Add mobilenet_edgetpu_v2_m weights 2024-07-28 17:19:36 -07:00
Ross Wightman 70182ea0cd Version 1.0.8 2024-07-26 17:05:36 -07:00
Ross Wightman ab8cb070fc Add xavier_uniform init of MNVC hybrid attention modules. Small improvement in training stability. 2024-07-26 17:03:40 -07:00
Ross Wightman cec70b6779
Merge pull request #2225 from huggingface/small_things
Small things
2024-07-25 20:29:13 -07:00
Ross Wightman 61df3fde89 Wrong hybrid_medium in12k pool sizes 2024-07-25 15:39:21 -07:00
Ross Wightman 9aa2930760 Add latest mobilenetv4 and baseline updates for mobilenetv1 and efficientnet_b0 weights 2024-07-25 14:20:54 -07:00
Ross Wightman 7b6a406474 remove swin debug prints 2024-07-24 21:05:56 -07:00
Ross Wightman 4c531be479 set_input_size(), always_partition, strict_img_size, dynamic mask option for all swin models. More flexibility in resolution, window resizing. 2024-07-24 16:41:31 -07:00
Ross Wightman 8efdc38213 Fix #2242 add checks for out indices with intermediate getter mode 2024-07-23 08:19:09 -07:00
Ross Wightman d2240745d3 Fix issue where feature out_indices out of order after wrapping with FeatureGetterNet due to use of set() 2024-07-22 13:33:30 -07:00
Ross Wightman 2b3f1a4633 Make channels for classic resnet configurable 2024-07-22 10:47:40 -07:00
Ross Wightman 9b2b8014e8 Add weights for test models 2024-07-22 10:08:57 -07:00
Ross Wightman 7e0caa1ba3 Padding helpers work if tuples/lists passed 2024-07-19 14:28:03 -07:00
Ross Wightman 2180800646 MQA query_strides bugs fix #2237. No padding for avg_pool2d if not 'same', use scale_factor for Upsample. 2024-07-19 14:26:54 -07:00
Ross Wightman 1a05ed29a1 Add to 'abswin' hiera models for train trials 2024-07-19 11:05:31 -07:00
Ross Wightman 0cbf4fa586 _orig_mod still causing issues even though I thought it was fixed in pytorch, add unwrap / clean helpers 2024-07-19 11:03:45 -07:00
Feraidoon Mehri 4cca568bd8
eva.py: fixed bug in applying attention mask
The mask should be applied before the softmax.
2024-07-19 15:12:04 +03:30
Ross Wightman 3a8a965891 Implement absolute+window pos embed for hiera, resizable but needs new weights 2024-07-18 21:43:37 -07:00
Ross Wightman 7160af4a24
Merge pull request #2229 from Promisery/reg_token
Initialize weights of reg_token for ViT
2024-07-18 09:25:29 -07:00
Ross Wightman 392b78aee7 set_input_size initial impl for vit & swin v1. Move HybridEmbed to own location in timm/layers 2024-07-17 15:25:48 -07:00
Ross Wightman 34c9fee554 Fix pass through of input / target keys so ImageDataset readers so args work with hfds instead of just hfids (iterable) 2024-07-17 10:11:46 -07:00
Promisery 417cf7f871 Initialize weights of reg_token for ViT 2024-07-13 11:11:42 +08:00
Ross Wightman f920119f3b Fixing tests 2024-07-09 14:53:20 -07:00
Ross Wightman 644abf9588 Fix default_cfg test for mobilenet_100 2024-07-09 12:52:24 -07:00
Ross Wightman d5afe106dc Merge remote-tracking branch 'origin/tiny_test_models' into small_things 2024-07-09 12:49:57 -07:00
Ross Wightman 55101028bb Rename test_tiny* -> test*. Fix ByobNet BasicBlock attn location and add test_byobnet model. 2024-07-09 11:53:11 -07:00
Ross Wightman 1334598462 Add support back to EfficientNet to disable head_conv / bn2 so mobilnetv1 can be implemented properly 2024-07-08 13:51:26 -07:00
Ross Wightman 800405d941 Add conv_large mobilenetv3 aa/blur model defs 2024-07-08 13:50:05 -07:00
Ross Wightman f81b094aaa Add 'qkv_bias_separate' flag for EVA/beit/swinv2 attn modules to allow an override for easy quantization wrappers. Fix #2098 2024-07-08 13:48:38 -07:00
Ross Wightman 83c2c2f0c5 Add 'Maybe' PIL / image tensor conversions in case image alread in tensor format 2024-07-08 13:43:51 -07:00
Steffen Schneider c01a47c9e7
Fix typo in type annotations in timm.models.hrnet 2024-07-08 00:53:16 +02:00
Daniel Suess 197c10463b Fix jit.script breaking with features_fx 2024-06-28 03:58:51 +00:00
Ross Wightman b751da692d Add latest ix (xavier init for mqa) hybrid medium & large weights for MobileNetV4 2024-06-24 13:49:55 -07:00
Ross Wightman d4d4d84fda Dev version 1.0.8.dev0 2024-06-24 11:34:13 -07:00
Ross Wightman f8342a045a
Merge pull request #2213 from huggingface/florence2
Fix #2212 map florence2 image tower to davit with a few changes
2024-06-24 11:01:08 -07:00
Sejik c33a001397
Fix typo 2024-06-24 21:54:38 +09:00
Ross Wightman 02d0f27721 cleanup davit padding 2024-06-22 12:06:46 -07:00
Ross Wightman c715c724e7 Fix tracing by removing float cast, should end up float anyways 2024-06-22 08:35:30 -07:00
Ross Wightman fb58a73033 Fix #2212 map florence2 image tower to davit with a few changes 2024-06-21 15:31:29 -07:00
Ross Wightman b28945ff05 Version 1.0.7, prep for release 2024-06-18 16:19:43 -07:00
Ross Wightman fb13e6385e
Merge pull request #2203 from huggingface/more_mobile
Add mobilenet edgetpu defs for exp, add ol mobilenet v1 back for comp…
2024-06-18 15:20:01 -07:00
Ross Wightman 16e082e1c2 Add mobilenetv4 hybrid-large weights 2024-06-17 11:08:31 -07:00
Ross Wightman e41125cc83
Merge pull request #2209 from huggingface/fcossio-vit-maxpool
ViT pooling refactor
2024-06-17 07:51:12 -07:00
Ross Wightman a22466852d Add 2400 epoch mobilenetv4 small weights, almost at paper, rounds to 73.8 2024-06-16 10:51:00 -07:00
Ross Wightman b1a6f4a946 Some missed reset_classifier() type annotations 2024-06-16 10:39:27 -07:00