Commit Graph

2665 Commits (2a84d68d02bc1e84eeec6ce3a0e8febc9e3d7e35)
 

Author SHA1 Message Date
Ross Wightman 2a84d68d02 Add some so150m vit w/ sbb recipe weights, and a ese_vovnet57b model with RA4 recipe 2025-01-18 15:51:57 -08:00
Ross Wightman 9265d54a3a LeViT safetensors load is broken by conversion code that wasn't deactivated 2025-01-16 11:37:00 -08:00
Ross Wightman 21e75a9d25
Update version.py
Back to dev version
2025-01-16 11:23:17 -08:00
Ross Wightman c96e9e7ce0
Merge pull request #2408 from JosuaRieder/no_console_results
Implement --no-console-results in inference.py
2025-01-15 08:11:31 -08:00
Ross Wightman 63b2de7a6c
Merge pull request #2409 from adamjstewart/models/vgg
VGG ConvMlp: fix layer defaults/types
2025-01-15 07:37:26 -08:00
Adam J. Stewart 6d21eb0d37
VGG ConvMlp: fix layer defaults/types 2025-01-15 12:11:56 +01:00
Josua Rieder 47790637f2 Implement --no-console-results in inference.py 2025-01-15 11:37:11 +01:00
Ross Wightman ef7dec8e43
Merge pull request #2406 from JosuaRieder/fix_latex
fix incorrect LaTeX formulas
2025-01-14 11:03:15 -08:00
Ross Wightman 1572769059
Merge pull request #2402 from JosuaRieder/fix_inference_csv_export
disable abbreviating csv inference output with ellipses
2025-01-14 11:01:48 -08:00
Ross Wightman fc0609bcb6 Add --model-dtype (pure bfloat16/float16) support to inference.py 2025-01-14 11:00:16 -08:00
Ross Wightman 53c3c89a86
Merge pull request #2403 from JosuaRieder/efficientnet_typo
fix typo in EfficientNet docs
2025-01-14 10:57:46 -08:00
Ross Wightman 5e0df16fc0
Merge pull request #2404 from JosuaRieder/fix_training_scripts_doc_link
fix 'timm recipe scripts' link
2025-01-14 10:57:25 -08:00
Ross Wightman 7fa37c1f26
Merge pull request #2405 from JosuaRieder/add_missing_paper_title
Add missing paper title
2025-01-14 10:57:02 -08:00
Josua Rieder adf9efeac7 fix incorrect LaTeX formulas 2025-01-14 19:17:52 +01:00
Josua Rieder 1dab96c637 fix small copy paste mistake 2025-01-14 18:58:54 +01:00
Josua Rieder cac1899ba1 add missing paper title DLA: Deep Layer Aggregation 2025-01-14 18:56:23 +01:00
Josua Rieder 676c734796 fix 'timm recipe scripts' link 2025-01-14 18:49:45 +01:00
Josua Rieder 17a1abfc0d fix typo in EfficientNet docs 2025-01-14 18:36:51 +01:00
Josua Rieder 8ce197e33a disable abbreviating csv inference output with ellipses 2025-01-14 17:39:48 +01:00
Ross Wightman ff77dfa825
Merge pull request #2400 from adamjstewart/types/nn-module
Fix nn.Module type hints
2025-01-14 08:23:57 -08:00
Adam J. Stewart f5c4d5cbb7
Add missing imports 2025-01-11 15:13:16 +01:00
Adam J. Stewart 19aaea3c8f
Fix nn.Module type hints 2025-01-11 15:09:21 +01:00
Ross Wightman 47811bc05a Update README, bump version to 1.0.13 non-dev 2025-01-09 09:33:59 -08:00
Ross Wightman eeee38e972 Avoid unecessary compat break btw train script and nearby timm versions w/ dtype addition. 2025-01-08 21:10:15 -08:00
Ross Wightman deb9895600 Update checkpoint save to fix old hard-link + fuse issue I ran into again... fix #340 2025-01-08 15:36:58 -08:00
Ross Wightman c4fb98f399
Merge pull request #2398 from huggingface/caojiaolong-main
Merging wandb project name chages w/ addition
2025-01-08 10:15:09 -08:00
Ross Wightman c173886e75 Merge branch 'main' into caojiaolong-main 2025-01-08 09:11:50 -08:00
Ross Wightman 2d0ac6f567
Merge pull request #2397 from huggingface/half_prec_trainval
Add half-precision (bfloat16, float16) support to train & validate scripts
2025-01-07 11:48:02 -08:00
Ross Wightman 1969528296 Fix dtype log when default (None) is used w/o AMP 2025-01-07 11:47:22 -08:00
Ross Wightman 92f610c982 Add half-precision (bfloat16, float16) support to train & validate scripts. Should push dtype handling into model factory / pretrained load at some point... 2025-01-07 10:25:14 -08:00
Jiao-Long Cao 40c19f3939
Add wandb project name argument and allow change wandb run name 2025-01-07 16:43:34 +08:00
Ross Wightman 6f80214e80
Merge pull request #2394 from huggingface/non_reentrant_ckpt
Wrap torch checkpoint() fn to default use_reentrant flag to False and allow env var override
2025-01-06 14:44:06 -08:00
Ross Wightman 155f6e7fea Update README, few minor fixups. 2025-01-06 13:09:15 -08:00
Ross Wightman 2b251fb291 Wrap torch checkpoint() fn to default use_reentrant flag to False and allow env var override 2025-01-06 11:28:39 -08:00
Ross Wightman 131518c15c Add comments to MLP layers re expected layouts 2025-01-02 09:41:35 -08:00
Ross Wightman d23facd697
Merge pull request #2388 from laclouis5/fix-mqa-v2
Fix MQA V2
2025-01-02 07:48:35 -08:00
Louis Lac 2d5277e858
Merge branch 'main' into fix-mqa-v2 2025-01-02 00:11:22 +01:00
Louis Lac 2d734d9058 Fixed unfused attn2d scale 2025-01-01 12:34:07 -08:00
Louis Lac 6171e756d3 Fix MQA V2 scale and out shape 2025-01-01 15:37:28 +01:00
Ross Wightman 851e0746a9
Update README.md 2024-12-31 14:12:16 -08:00
Ross Wightman e846b2cf28 Add 384x384 in12k pretrain and finetune for convnext_nano 2024-12-31 13:16:43 -08:00
Ross Wightman dafe866047
Update README.md 2024-12-31 10:19:43 -08:00
Ross Wightman 52595a9641
Update README.md 2024-12-31 10:10:52 -08:00
Ruida Zeng 1245b83924 fix: minor typos in UPGRADING 2024-12-31 09:26:13 -08:00
Ruida Zeng 8fd2f48b65 fix: minor typos in README 2024-12-31 09:26:13 -08:00
Ross Wightman b0068ba5d0 Switch hf hub entries for new aimv2 / dfn weights to point to timm locations. Undo forced device for SDR linspace, part of another change. 2024-12-30 19:24:21 -08:00
Ross Wightman cc7fd34015 test filter tweaks 2024-12-30 19:24:21 -08:00
Ross Wightman 1bf84b35c3 Update tests for aimv2 filtering 2024-12-30 19:24:21 -08:00
Ross Wightman b33418713a Add (almost) full set of aimv2 model instances. Switch back to unpacked SwiGLU. Verify correctness. Add DFN L/14 39B weight. 2024-12-30 19:24:21 -08:00
Ross Wightman de35fd87f5 Add SimpleNorm to create_norm factory 2024-12-30 19:24:21 -08:00