Commit Graph

1786 Commits (c8c4f256b8c279575ed87012f38620a4cd25df57)

Author SHA1 Message Date
Xihua Dong 0c136f7dab fix img_size type 2025-04-18 21:05:05 -07:00
Ross Wightman 3ff3899026 Add local-dir: schema support for model loading (config + weights) from folder 2025-04-17 12:32:19 -07:00
Ross Wightman ceca5efdec Remove torch_out from onnx export, no point without the export_ fn 2025-04-15 14:01:17 -07:00
Ross Wightman 0cae8a4cd8 Fix #2472, torch.onnx.export_ (with return output) finally removed :( 2025-04-15 14:01:17 -07:00
Ross Wightman 681be882e8 Fix arg merging of sknet, old seresnet. Fix #2470 2025-04-14 10:32:26 -07:00
Ross Wightman 98e9651952
Update version.py
Version 1.0.15, prep for a release
2025-02-22 10:50:21 -08:00
Adam J. Stewart 92682d8d4d timm.models: explicitly export attributes 2025-02-21 14:19:39 -08:00
Ross Wightman a667d3d8f0 siglip2 weights on hub, fix forward_intermediates when no prefix tokens (& return prefix selected) 2025-02-21 13:10:51 -08:00
Ross Wightman f63a11cf81 Remove duplicate so400m/16 @ 256 model def 2025-02-21 13:10:51 -08:00
Ross Wightman 9758e0b8b0 Prep for siglip2 release 2025-02-21 13:10:51 -08:00
Adam J. Stewart c68d724e9c adapt_input_conv: add type hints 2025-02-21 12:28:22 -08:00
Ross Wightman 105a667baa Dev version 1.0.15.dev0 2025-02-17 15:50:12 -08:00
Ross Wightman 7234f5c6c5 Add 448 so150m2 weight/model, add updated internvit 300m weight 2025-02-17 12:59:10 -08:00
Ross Wightman 9ce824c39a Add vit so150m2 weights 2025-02-14 15:55:51 -08:00
Ross Wightman 490d222dd8 Fix issue taking device from V before V exists 2025-01-31 12:52:47 -08:00
Lucas Nestler e025328f96
simplify RNG 2025-01-31 17:26:14 +01:00
Lucas Nestler 6367267298
unify RNG 2025-01-31 17:23:53 +01:00
Ross Wightman 872978ccfe Fix comment, add 'stochastic weight decay' idea because why not 2025-01-30 18:22:36 -08:00
Ross Wightman 510bbd5389 Change start/end args 2025-01-30 18:22:36 -08:00
Ross Wightman 31831f5948 Change flattening behaviour in Kron 2025-01-30 18:22:36 -08:00
Ross Wightman b3a83b81d6 Prep Kron for merge, add detail to attributions note, README. 2025-01-27 21:02:26 -08:00
Ross Wightman 67ef6f0a92 Move opt_einsum import back out of class __init__ 2025-01-27 21:02:26 -08:00
Ross Wightman 9ab5464e4d More additions to Kron 2025-01-27 21:02:26 -08:00
Ross Wightman 5f10450235 Some more kron work. Figured out why some tests fail, implemented a deterministic rng state load but too slow so skipping some tests for now. 2025-01-27 21:02:26 -08:00
Ross Wightman cd21e80d03 Fiddling with Kron (PSGD) 2025-01-27 21:02:26 -08:00
Adam J. Stewart d81da93c16 Use import alias 2025-01-22 10:27:17 -08:00
Adam J. Stewart 4de1abf837 timm: add __all__ to __init__ 2025-01-22 10:27:17 -08:00
Ryan 17eabaad17 Fix RDNet forward call 2025-01-21 11:52:05 -08:00
Ryan 80a4877376 Fix self.reset_classifier num_classes update 2025-01-21 11:52:05 -08:00
Collin McCarthy 84631cb5c6 Add missing training flag to convert_sync_batchnorm 2025-01-21 11:51:55 -08:00
Ross Wightman 5d535d7a2d Version 1.0.14, update README & changelog 2025-01-19 13:53:09 -08:00
Ross Wightman aa333079da Tweak so150m2 def 2025-01-19 13:40:53 -08:00
Josua Rieder 8d81fdf3d9 Fix typos 2025-01-19 13:39:40 -08:00
Ross Wightman 3677f67902 Add the 256x256 in1k ft of the so150m, add an alternate so150m def 2025-01-18 15:51:57 -08:00
Ross Wightman 2a84d68d02 Add some so150m vit w/ sbb recipe weights, and a ese_vovnet57b model with RA4 recipe 2025-01-18 15:51:57 -08:00
Ross Wightman 9265d54a3a LeViT safetensors load is broken by conversion code that wasn't deactivated 2025-01-16 11:37:00 -08:00
Ross Wightman 21e75a9d25
Update version.py
Back to dev version
2025-01-16 11:23:17 -08:00
Adam J. Stewart 6d21eb0d37
VGG ConvMlp: fix layer defaults/types 2025-01-15 12:11:56 +01:00
Adam J. Stewart f5c4d5cbb7
Add missing imports 2025-01-11 15:13:16 +01:00
Adam J. Stewart 19aaea3c8f
Fix nn.Module type hints 2025-01-11 15:09:21 +01:00
Ross Wightman 47811bc05a Update README, bump version to 1.0.13 non-dev 2025-01-09 09:33:59 -08:00
Ross Wightman deb9895600 Update checkpoint save to fix old hard-link + fuse issue I ran into again... fix #340 2025-01-08 15:36:58 -08:00
Ross Wightman 92f610c982 Add half-precision (bfloat16, float16) support to train & validate scripts. Should push dtype handling into model factory / pretrained load at some point... 2025-01-07 10:25:14 -08:00
Ross Wightman 155f6e7fea Update README, few minor fixups. 2025-01-06 13:09:15 -08:00
Ross Wightman 2b251fb291 Wrap torch checkpoint() fn to default use_reentrant flag to False and allow env var override 2025-01-06 11:28:39 -08:00
Ross Wightman 131518c15c Add comments to MLP layers re expected layouts 2025-01-02 09:41:35 -08:00
Louis Lac 2d5277e858
Merge branch 'main' into fix-mqa-v2 2025-01-02 00:11:22 +01:00
Louis Lac 2d734d9058 Fixed unfused attn2d scale 2025-01-01 12:34:07 -08:00
Louis Lac 6171e756d3 Fix MQA V2 scale and out shape 2025-01-01 15:37:28 +01:00
Ross Wightman e846b2cf28 Add 384x384 in12k pretrain and finetune for convnext_nano 2024-12-31 13:16:43 -08:00