2719 Commits

Author SHA1 Message Date
Ross Wightman
8fcbceb609 Add a WIP NaFlex compatible mixup/cutmix for testing 2025-05-10 14:59:37 -07:00
Ross Wightman
e2073e32d0 Move NaFlexCollate with dataset, remove stand alone collate_fn and remove redundancy 2025-04-29 10:44:46 -07:00
Ross Wightman
39eb56f875 Starting to test distributed train, fix issue with batch_size reduce 2025-04-28 16:48:06 -07:00
Ross Wightman
ee27b73da4 Further pos embed tweaks, rejig model defs for testing 2025-04-28 09:15:11 -07:00
Ross Wightman
3dc90ed7a7 Add naflex loader support to validate.py, fix bug in naflex pos embed add, classic vit weight loading for naflex model 2025-04-25 16:00:54 -07:00
Ross Wightman
c527c37969 Optimizations for pos embed resize, merge different mask helper fns 2025-04-21 14:05:18 -07:00
Ross Wightman
ea728f67fa Improve several typing issues for flex vit, can (almost) work with jit if we bash h,w key into an int or str 2025-04-14 11:01:56 -07:00
Ross Wightman
97341fec51 A much faster resample_patch_embed, can be used at train/validation time 2025-04-10 15:58:24 -07:00
Ross Wightman
b4bb0f452a Exclude embeds module and mask attn functions from tracing 2025-04-09 15:34:15 -07:00
Ross Wightman
13e0f3a4a3 Add loss scale arg, initial distributed loss scale. Maybe fix FX for the model. 2025-04-08 20:47:57 -07:00
Ross Wightman
6675590264 Fix ParallelThingsBlock w/ attn_mask 2025-04-08 09:35:34 -07:00
Ross Wightman
9b23d6dea2 Exclude naflex models from jit tests 2025-04-08 07:59:19 -07:00
Ross Wightman
825edccf19 Type fixes, remove old comments 2025-04-07 21:35:03 -07:00
Ross Wightman
0893f5d296 Initial NaFlex ViT model and training support 2025-04-07 21:27:10 -07:00
Ross Wightman
e44f14d7d2 Update README v1.0.15 2025-02-22 21:04:13 -08:00
Ross Wightman
98e9651952
Update version.py
Version 1.0.15, prep for a release
2025-02-22 10:50:21 -08:00
Ross Wightman
e76ea5474d
Update README.md 2025-02-21 16:09:42 -08:00
Adam J. Stewart
92682d8d4d timm.models: explicitly export attributes 2025-02-21 14:19:39 -08:00
Ross Wightman
a667d3d8f0 siglip2 weights on hub, fix forward_intermediates when no prefix tokens (& return prefix selected) 2025-02-21 13:10:51 -08:00
Ross Wightman
f63a11cf81 Remove duplicate so400m/16 @ 256 model def 2025-02-21 13:10:51 -08:00
Ross Wightman
9758e0b8b0 Prep for siglip2 release 2025-02-21 13:10:51 -08:00
Adam J. Stewart
c68d724e9c adapt_input_conv: add type hints 2025-02-21 12:28:22 -08:00
Ross Wightman
105a667baa Dev version 1.0.15.dev0 2025-02-17 15:50:12 -08:00
Ross Wightman
7234f5c6c5 Add 448 so150m2 weight/model, add updated internvit 300m weight 2025-02-17 12:59:10 -08:00
Ross Wightman
9ce824c39a Add vit so150m2 weights 2025-02-14 15:55:51 -08:00
Ross Wightman
a49b020eff Merge branch 'ClashLuke-patch-1' 2025-01-31 12:53:29 -08:00
Ross Wightman
490d222dd8 Fix issue taking device from V before V exists 2025-01-31 12:52:47 -08:00
Ross Wightman
875c19d0c9 Merge branch 'patch-1' of github.com:ClashLuke/pytorch-image-models into ClashLuke-patch-1 2025-01-31 12:43:28 -08:00
Ross Wightman
8b3c07a841
Update README.md 2025-01-31 10:37:32 -08:00
Lucas Nestler
e025328f96
simplify RNG 2025-01-31 17:26:14 +01:00
Lucas Nestler
6367267298
unify RNG 2025-01-31 17:23:53 +01:00
Ross Wightman
872978ccfe Fix comment, add 'stochastic weight decay' idea because why not 2025-01-30 18:22:36 -08:00
Ross Wightman
510bbd5389 Change start/end args 2025-01-30 18:22:36 -08:00
Ross Wightman
31831f5948 Change flattening behaviour in Kron 2025-01-30 18:22:36 -08:00
Ross Wightman
cdbafd9057 Try to force numpy<2.0 for torch 1.13 tests, update newest tested torch to 2.5.1 2025-01-28 20:56:30 -08:00
Ross Wightman
b1752eefb5 Fix missing model key in bulk validate results on error 2025-01-28 13:20:40 -08:00
Ross Wightman
b3a83b81d6 Prep Kron for merge, add detail to attributions note, README. 2025-01-27 21:02:26 -08:00
Ross Wightman
67ef6f0a92 Move opt_einsum import back out of class __init__ 2025-01-27 21:02:26 -08:00
Ross Wightman
9ab5464e4d More additions to Kron 2025-01-27 21:02:26 -08:00
Ross Wightman
5f10450235 Some more kron work. Figured out why some tests fail, implemented a deterministic rng state load but too slow so skipping some tests for now. 2025-01-27 21:02:26 -08:00
Ross Wightman
cd21e80d03 Fiddling with Kron (PSGD) 2025-01-27 21:02:26 -08:00
Adam J. Stewart
d81da93c16 Use import alias 2025-01-22 10:27:17 -08:00
Adam J. Stewart
4de1abf837 timm: add __all__ to __init__ 2025-01-22 10:27:17 -08:00
Ryan
bda46f8e6f Add num_classes assertion after reset_classifier 2025-01-21 11:52:05 -08:00
Ryan
17eabaad17 Fix RDNet forward call 2025-01-21 11:52:05 -08:00
Ryan
80a4877376 Fix self.reset_classifier num_classes update 2025-01-21 11:52:05 -08:00
Collin McCarthy
84631cb5c6 Add missing training flag to convert_sync_batchnorm 2025-01-21 11:51:55 -08:00
Josua Rieder
cb4cea561a add arguments to the respective argument groups 2025-01-20 10:54:35 -08:00
Josua Rieder
634b68ae50 Fix metavar for --input-size 2025-01-20 10:53:46 -08:00
Ross Wightman
5d535d7a2d Version 1.0.14, update README & changelog v1.0.14 2025-01-19 13:53:09 -08:00