Ross Wightman
ac1b08deb6
fix_init on vit & relpos vit
2024-02-10 20:15:37 -08:00
Ross Wightman
935950cc11
Fix F.sdpa attn drop prob
2024-02-10 20:14:47 -08:00
Ross Wightman
0737cf231d
Add Next-ViT
2024-02-10 17:05:16 -08:00
Ross Wightman
d6c2cc91af
Make NormMlpClassifier head reset args consistent with ClassifierHead
2024-02-10 16:25:33 -08:00
Ross Wightman
87fec3dc14
Update experimental vit model configs
2024-02-10 16:05:58 -08:00
Ross Wightman
7d3c2dc993
Add group_matcher for DaViT
2024-02-10 14:58:45 -08:00
Ross Wightman
7bc7798d0e
Type annotation correctness for create_act
2024-02-10 14:57:58 -08:00
Ross Wightman
7d121ac2ef
Small tweak of timm ToTensor for clarity
2024-02-10 14:57:40 -08:00
Ross Wightman
5a58f4d3dc
Remove test MESA support, no signal that it's helpful so far
2024-02-10 14:38:01 -08:00
Ross Wightman
c7ac37693d
Add device arg to validate() calls in train.py
2024-02-04 10:14:57 -08:00
Ross Wightman
a08b57e801
Fix distributed flag bug w/ flex device handling
2024-02-03 16:26:15 -08:00
Ross Wightman
bee0471f91
forward() pass through for ema model, flag for ema warmup, comment about warmup
2024-02-03 16:24:45 -08:00
Ross Wightman
5e4a4b2adc
Merge branch 'device_flex' into mesa_ema
2024-02-02 09:45:30 -08:00
Ross Wightman
dd84ef2cd5
ModelEmaV3 and MESA experiments
2024-02-02 09:45:04 -08:00
Ross Wightman
d0ff315eed
Merge remote-tracking branch 'emav3/faster_ema' into mesa_ema
2024-01-27 14:52:10 -08:00
Ross Wightman
88889de923
Fix meshgrid deprecation warnings and backward compat with explicit 'ndgrid' and 'meshgrid' fn w/o indexing arg
2024-01-27 13:48:33 -08:00
Ross Wightman
fa247fd9ba
Update README.md
2024-01-27 10:54:10 -08:00
Ross Wightman
dea2dd5d25
Update README.md
...
Update optimizer list and references
2024-01-27 10:46:41 -08:00
Ross Wightman
2633fd4c81
Update changes.md
2024-01-27 10:31:12 -08:00
Ross Wightman
ef5485d609
Update README.md
...
Clear history to start of 2023
2024-01-27 10:30:47 -08:00
Ross Wightman
d4386219c6
Improve type handling for arange & rel pos embeds, keep calculations in float32 until application (may change to apply in float32 in future). Prevent arange type hijacking by DeepSpeed Zero
2024-01-26 16:35:51 -08:00
Ross Wightman
3234daf783
Add missing deprecation mapping for a densenet and xcit model. Fix #2086 . Tweak xcit pos embed use of arange for better low prec safety.
2024-01-24 22:04:04 -08:00
Ross Wightman
809a9e14e2
Pass train-crop-mode to create_loader/transforms from train.py args
2024-01-24 16:19:02 -08:00
Ross Wightman
a48ab818f5
Improving device flexibility in train. Fix #2081
2024-01-20 15:10:20 -08:00
Li zhuoqun
53a4888328
Add droppath and type hint to Xception.
2024-01-19 11:15:47 -08:00
kalazus
7f19a4cce7
fix fast catavgmax selection
2024-01-16 10:30:08 -08:00
lorenzbaraldi
8c663c4b86
Fixed index out of range in case of resume
2024-01-12 23:33:32 -08:00
Ross Wightman
36449617ff
Update README.md
2024-01-09 15:04:34 -08:00
Ross Wightman
614f9d2080
Update README.md
2024-01-09 14:59:49 -08:00
Ross Wightman
2eac2f6955
Fiddling with iterator wrapping for HF ds streaming
2024-01-09 12:41:54 -08:00
Ross Wightman
992976f007
Update version.py
2024-01-08 09:39:22 -08:00
Ross Wightman
c50004db79
Allow training w/o validation split set
2024-01-08 09:38:42 -08:00
Ross Wightman
be0944edae
Significant transforms, dataset, dataloading enhancements.
2024-01-08 09:38:42 -08:00
Ross Wightman
b5a4fa9c3b
Add pos_weight and support for summing over classes to BCE impl in train scripts
2023-12-30 12:13:06 -08:00
方曦
9dbea3bef6
fix cls head in hgnet
2023-12-27 21:26:26 +08:00
SeeFun
56ae8b906d
fix reset head in hgnet
2023-12-27 20:11:29 +08:00
SeeFun
6862c9850a
fix backward in hgnet
2023-12-27 16:49:37 +08:00
SeeFun
6cd28bc5c2
Merge branch 'huggingface:main' into master
2023-12-27 16:43:37 +08:00
Ross Wightman
f2fdd97e9f
Add parsable json results output for train.py, tweak --pretrained-path to force head adaptation
2023-12-22 11:18:25 -08:00
LR
e0079c92da
Update eva.py ( #2058 )
...
* Update eva.py
When argument class token = False, self.cls_token = None.
Prevents error from attempting trunc_normal_ on None:
AttributeError: 'NoneType' object has no attribute 'uniform_'
* Update eva.py
fix
2023-12-16 15:10:45 -08:00
Li zhuoqun
7da34a999a
add type annotations in the code of swin_transformer_v2
2023-12-15 09:31:25 -08:00
Fredo Guan
bbe798317f
Update EdgeNeXt to use ClassifierHead as per ConvNeXt ( #2051 )
...
* Update edgenext.py
2023-12-11 12:17:19 -08:00
Ross Wightman
711c5dee6d
Update sgdw for older pytorch
2023-12-11 12:10:29 -08:00
Ross Wightman
60b170b200
Add --pretrained-path arg to train script to allow passing local checkpoint as pretrained. Add missing/unexpected keys log.
2023-12-11 12:10:29 -08:00
Ross Wightman
17a47c0e35
Add SGDW optimizer
2023-12-11 12:10:29 -08:00
Fredo Guan
2597ce2860
Update davit.py
2023-12-11 11:13:04 -08:00
Ross Wightman
c82598fc5e
Remove deprecated doc delete workflows
2023-12-05 12:04:38 -08:00
akiyuki ishikawa
2bd043ce5d
fix doc position
2023-12-05 12:00:51 -08:00
akiyuki ishikawa
4f2e1bf4cb
Add missing docs in SwinTransformerStage
2023-12-05 12:00:51 -08:00
Ross Wightman
df7ae11eb2
Add device arg for patch embed resize, fix #2024
2023-12-04 11:42:13 -08:00