Xihua Dong
|
0c136f7dab
|
fix img_size type
|
2025-04-18 21:05:05 -07:00 |
Ross Wightman
|
3ff3899026
|
Add local-dir: schema support for model loading (config + weights) from folder
|
2025-04-17 12:32:19 -07:00 |
Ross Wightman
|
ceca5efdec
|
Remove torch_out from onnx export, no point without the export_ fn
|
2025-04-15 14:01:17 -07:00 |
Ross Wightman
|
0cae8a4cd8
|
Fix #2472, torch.onnx.export_ (with return output) finally removed :(
|
2025-04-15 14:01:17 -07:00 |
Ross Wightman
|
681be882e8
|
Fix arg merging of sknet, old seresnet. Fix #2470
|
2025-04-14 10:32:26 -07:00 |
Ross Wightman
|
98e9651952
|
Update version.py
Version 1.0.15, prep for a release
|
2025-02-22 10:50:21 -08:00 |
Adam J. Stewart
|
92682d8d4d
|
timm.models: explicitly export attributes
|
2025-02-21 14:19:39 -08:00 |
Ross Wightman
|
a667d3d8f0
|
siglip2 weights on hub, fix forward_intermediates when no prefix tokens (& return prefix selected)
|
2025-02-21 13:10:51 -08:00 |
Ross Wightman
|
f63a11cf81
|
Remove duplicate so400m/16 @ 256 model def
|
2025-02-21 13:10:51 -08:00 |
Ross Wightman
|
9758e0b8b0
|
Prep for siglip2 release
|
2025-02-21 13:10:51 -08:00 |
Adam J. Stewart
|
c68d724e9c
|
adapt_input_conv: add type hints
|
2025-02-21 12:28:22 -08:00 |
Ross Wightman
|
105a667baa
|
Dev version 1.0.15.dev0
|
2025-02-17 15:50:12 -08:00 |
Ross Wightman
|
7234f5c6c5
|
Add 448 so150m2 weight/model, add updated internvit 300m weight
|
2025-02-17 12:59:10 -08:00 |
Ross Wightman
|
9ce824c39a
|
Add vit so150m2 weights
|
2025-02-14 15:55:51 -08:00 |
Ross Wightman
|
490d222dd8
|
Fix issue taking device from V before V exists
|
2025-01-31 12:52:47 -08:00 |
Lucas Nestler
|
e025328f96
|
simplify RNG
|
2025-01-31 17:26:14 +01:00 |
Lucas Nestler
|
6367267298
|
unify RNG
|
2025-01-31 17:23:53 +01:00 |
Ross Wightman
|
872978ccfe
|
Fix comment, add 'stochastic weight decay' idea because why not
|
2025-01-30 18:22:36 -08:00 |
Ross Wightman
|
510bbd5389
|
Change start/end args
|
2025-01-30 18:22:36 -08:00 |
Ross Wightman
|
31831f5948
|
Change flattening behaviour in Kron
|
2025-01-30 18:22:36 -08:00 |
Ross Wightman
|
b3a83b81d6
|
Prep Kron for merge, add detail to attributions note, README.
|
2025-01-27 21:02:26 -08:00 |
Ross Wightman
|
67ef6f0a92
|
Move opt_einsum import back out of class __init__
|
2025-01-27 21:02:26 -08:00 |
Ross Wightman
|
9ab5464e4d
|
More additions to Kron
|
2025-01-27 21:02:26 -08:00 |
Ross Wightman
|
5f10450235
|
Some more kron work. Figured out why some tests fail, implemented a deterministic rng state load but too slow so skipping some tests for now.
|
2025-01-27 21:02:26 -08:00 |
Ross Wightman
|
cd21e80d03
|
Fiddling with Kron (PSGD)
|
2025-01-27 21:02:26 -08:00 |
Adam J. Stewart
|
d81da93c16
|
Use import alias
|
2025-01-22 10:27:17 -08:00 |
Adam J. Stewart
|
4de1abf837
|
timm: add __all__ to __init__
|
2025-01-22 10:27:17 -08:00 |
Ryan
|
17eabaad17
|
Fix RDNet forward call
|
2025-01-21 11:52:05 -08:00 |
Ryan
|
80a4877376
|
Fix self.reset_classifier num_classes update
|
2025-01-21 11:52:05 -08:00 |
Collin McCarthy
|
84631cb5c6
|
Add missing training flag to convert_sync_batchnorm
|
2025-01-21 11:51:55 -08:00 |
Ross Wightman
|
5d535d7a2d
|
Version 1.0.14, update README & changelog
|
2025-01-19 13:53:09 -08:00 |
Ross Wightman
|
aa333079da
|
Tweak so150m2 def
|
2025-01-19 13:40:53 -08:00 |
Josua Rieder
|
8d81fdf3d9
|
Fix typos
|
2025-01-19 13:39:40 -08:00 |
Ross Wightman
|
3677f67902
|
Add the 256x256 in1k ft of the so150m, add an alternate so150m def
|
2025-01-18 15:51:57 -08:00 |
Ross Wightman
|
2a84d68d02
|
Add some so150m vit w/ sbb recipe weights, and a ese_vovnet57b model with RA4 recipe
|
2025-01-18 15:51:57 -08:00 |
Ross Wightman
|
9265d54a3a
|
LeViT safetensors load is broken by conversion code that wasn't deactivated
|
2025-01-16 11:37:00 -08:00 |
Ross Wightman
|
21e75a9d25
|
Update version.py
Back to dev version
|
2025-01-16 11:23:17 -08:00 |
Adam J. Stewart
|
6d21eb0d37
|
VGG ConvMlp: fix layer defaults/types
|
2025-01-15 12:11:56 +01:00 |
Adam J. Stewart
|
f5c4d5cbb7
|
Add missing imports
|
2025-01-11 15:13:16 +01:00 |
Adam J. Stewart
|
19aaea3c8f
|
Fix nn.Module type hints
|
2025-01-11 15:09:21 +01:00 |
Ross Wightman
|
47811bc05a
|
Update README, bump version to 1.0.13 non-dev
|
2025-01-09 09:33:59 -08:00 |
Ross Wightman
|
deb9895600
|
Update checkpoint save to fix old hard-link + fuse issue I ran into again... fix #340
|
2025-01-08 15:36:58 -08:00 |
Ross Wightman
|
92f610c982
|
Add half-precision (bfloat16, float16) support to train & validate scripts. Should push dtype handling into model factory / pretrained load at some point...
|
2025-01-07 10:25:14 -08:00 |
Ross Wightman
|
155f6e7fea
|
Update README, few minor fixups.
|
2025-01-06 13:09:15 -08:00 |
Ross Wightman
|
2b251fb291
|
Wrap torch checkpoint() fn to default use_reentrant flag to False and allow env var override
|
2025-01-06 11:28:39 -08:00 |
Ross Wightman
|
131518c15c
|
Add comments to MLP layers re expected layouts
|
2025-01-02 09:41:35 -08:00 |
Louis Lac
|
2d5277e858
|
Merge branch 'main' into fix-mqa-v2
|
2025-01-02 00:11:22 +01:00 |
Louis Lac
|
2d734d9058
|
Fixed unfused attn2d scale
|
2025-01-01 12:34:07 -08:00 |
Louis Lac
|
6171e756d3
|
Fix MQA V2 scale and out shape
|
2025-01-01 15:37:28 +01:00 |
Ross Wightman
|
e846b2cf28
|
Add 384x384 in12k pretrain and finetune for convnext_nano
|
2024-12-31 13:16:43 -08:00 |