2767 Commits

Author SHA1 Message Date
Ross Wightman
72858c193c Add siglip2 compatible naflex encoders. Add support to factorized pos embeds and 'aspect preserving mode' to Flex Embeds. Some more docstrings and typing. 2025-05-30 16:15:37 -07:00
Ross Wightman
b7ced7c40c torch.fx.wrap not working with older pytorch, trying register_notrace instead 2025-05-25 14:13:36 -07:00
Ross Wightman
842a786626 A few more maybe_add_mask situations 2025-05-25 08:51:56 -07:00
Ross Wightman
dd2c1418d0 Fix tracing of attention module with attn_mask support 2025-05-24 21:13:01 -07:00
Ross Wightman
162f49295e Move naflex global pool into one fn that can be marked notrace 2025-05-24 14:06:12 -07:00
Ross Wightman
2ad75e8023 Fix issue w/ MAP attention mask and no patch_valid 2025-05-24 12:37:39 -07:00
Ross Wightman
d7d3538335 Add so400m model size for test, few tweaks. 2025-05-23 18:29:30 -07:00
Ross Wightman
7bfe606d9f Merge remote-tracking branch 'origin/main' into naflex 2025-05-23 09:41:49 -07:00
Ross Wightman
e7925ea6d6 Add explicit disable of fc_norm to pe giant spatial variant 2025-05-21 07:31:36 -07:00
Ross Wightman
f001b15ed3 NaFlex random erasing performance improvements, python loops were slow. Remove subregion mode, not going to be worth it. 2025-05-20 17:03:46 -07:00
Ross Wightman
7624389fc9 Mixup cleanup, add prob support and train script integration. Add working loader based patch compatible RandomErasing for NaFlex mode. 2025-05-20 14:38:50 -07:00
Ross Wightman
edc37be1c3 Remove default 1000 classes from onnx export to use model default 2025-05-15 14:14:53 -07:00
Ross Wightman
f14f6507ab
Merge pull request #2487 from huggingface/eva_pe_integration
Add EVA ViT based PE (Perceptual Encoder) impl
2025-05-15 12:56:03 -07:00
Ross Wightman
cabd26d018 Fix missing device arg in train.py, fix #2488 2025-05-14 13:51:43 -07:00
Ross Wightman
6b302f27a3
Merge pull request #2480 from brianhou0208/tnt
Update TNT-(S/B) model weights and add feature extraction support
2025-05-14 12:27:21 -07:00
Ross Wightman
16d0b26e19 Fix torchscript issue with legacy tnt 2025-05-14 09:33:41 -07:00
Ross Wightman
74ad32a67e Updated tnt model weights on hub, add back legacy model in case bwd compat 2025-05-14 08:40:53 -07:00
Ryan
69b1fbcdc1 Merge branch 'main' into tnt 2025-05-11 22:45:48 +08:00
Ross Wightman
88b7ef6035 Disable dynamic_img_size default on PE models for now 2025-05-10 15:00:29 -07:00
Ross Wightman
8fcbceb609 Add a WIP NaFlex compatible mixup/cutmix for testing 2025-05-10 14:59:37 -07:00
Ross Wightman
fa7287faee Add EVA ViT based PE (Perceptual Encoder) impl 2025-05-10 13:13:12 -07:00
Ryan Ramos
81900a6bae remove download argument from torch_kwargs 2025-05-09 07:34:32 -07:00
atharva-pathak
59e59ff040 Update README.md
The previous link to Chris Hughes blog post was wrong, so updated the correct link
2025-05-08 08:43:18 -07:00
Ryan
d1140c1a0f fix pit & add to test 2025-05-08 07:17:19 -07:00
Ryan
2e9b2a76fb update some model 2025-05-08 07:17:19 -07:00
Ryan
99c25fa5c0 fix nest type error 2025-05-08 07:17:19 -07:00
Ryan
72b2a09106 fix final norm only apply at last indice 2025-05-08 07:17:19 -07:00
Ryan
e0ae4db8fc update FEAT_INTER_FILTERS list 2025-05-08 07:17:19 -07:00
Ryan
98d9cde8b4 support efficientformer_v2 2025-05-08 07:17:19 -07:00
Ryan
12def0d118 support efficientvit, edgenext, davit 2025-05-08 07:17:19 -07:00
Ryan
9aedecbb5f fix norm at last feat_idx 2025-05-08 07:17:19 -07:00
Ryan
e16d385592 support more models
inception_next, hgnet, gcvit, focalnet, inception_v4
2025-05-08 07:17:19 -07:00
Ryan
6b6beffa6b support mambaout, metaformer, nest, nextvit, pvt_v2 2025-05-08 07:17:19 -07:00
Ryan
2d3155908c fix tresnet and rdnet 2025-05-08 07:17:19 -07:00
Ryan
f8be741f0f support rexnet, resnetv2, repvit and repghostnet 2025-05-08 07:17:19 -07:00
Ryan
5e8cc616d4 support tiny_vit 2025-05-08 07:17:19 -07:00
Ryan
8befebd93c support vovnet 2025-05-08 07:17:19 -07:00
Ryan
411b892dbc support tresnet 2025-05-08 07:17:19 -07:00
Ross Wightman
fdcf7cf5c3 Check forward_intermediates features against forward_features output 2025-05-06 13:48:51 -07:00
Ryan
37bbac1256 Fix checkpoint_filter_fn 2025-05-03 15:41:18 +08:00
Ryan
fc0b6ad183 Fix default_cfgs 2025-05-03 08:40:07 +08:00
Ryan
848b8c3e57 Support features_only 2025-05-02 20:59:05 +08:00
Ryan
b37f0f7a76 Update tnt.py 2025-05-02 20:34:31 +08:00
Ross Wightman
e2073e32d0 Move NaFlexCollate with dataset, remove stand alone collate_fn and remove redundancy 2025-04-29 10:44:46 -07:00
Ross Wightman
39eb56f875 Starting to test distributed train, fix issue with batch_size reduce 2025-04-28 16:48:06 -07:00
Ross Wightman
ee27b73da4 Further pos embed tweaks, rejig model defs for testing 2025-04-28 09:15:11 -07:00
Ross Wightman
3dc90ed7a7 Add naflex loader support to validate.py, fix bug in naflex pos embed add, classic vit weight loading for naflex model 2025-04-25 16:00:54 -07:00
Yutong Xiang
c8c4f256b8
add-lightly-train-to-training-frameworks-readme (#2474) 2025-04-22 08:50:11 -07:00
Ross Wightman
c527c37969 Optimizations for pos embed resize, merge different mask helper fns 2025-04-21 14:05:18 -07:00
Xihua Dong
0c136f7dab fix img_size type 2025-04-18 21:05:05 -07:00