pytorch-image-models

mirror of https://github.com/huggingface/pytorch-image-models.git synced 2025-06-03 15:01:08 +08:00

Author	SHA1	Message	Date
Fredo Guan	edea013dd1	Davit std (#3 ) Davit with all features working	2022-12-09 02:53:21 -08:00
Ross Wightman	7c4ed4d5a4	Add EVA-large models	2022-12-08 16:21:30 -08:00
Fredo Guan	434a03937d	Merge branch 'rwightman:main' into main	2022-12-08 08:05:16 -08:00
Ross Wightman	98047ef5e3	Add EVA FT results, hopefully fix BEiT test failures	2022-12-07 08:54:06 -08:00
Ross Wightman	3cc4d7a894	Fix missing register for 224 eva model	2022-12-07 08:54:06 -08:00
Ross Wightman	eba07b0de7	Add eva models to beit.py	2022-12-07 08:54:06 -08:00
Fredo Guan	3bd96609c8	Davit (#1 ) Implement the davit model from https://arxiv.org/abs/2204.03645 and https://github.com/dingmyu/davit	2022-12-06 17:19:25 -08:00
Ross Wightman	927f031293	Major module / path restructure, timm.models.layers -> timm.layers, add _ prefix to all non model modules in timm.models	2022-12-06 15:00:06 -08:00
Ross Wightman	3785c234d7	Remove clip vit models that won't be ft and comment two that aren't uploaded yet	2022-12-05 10:21:34 -08:00
Ross Wightman	755570e2d6	Rename _pretrained.py -> pretrained.py, not feasible to change the other files to same scheme without breaking uses	2022-12-05 10:21:34 -08:00
Ross Wightman	72cfa57761	Add ported Tensorflow MaxVit weights. Add a few more CLIP ViT fine-tunes. Tweak some model tag names. Improve model tag name sorting. Update HF hub push config layout.	2022-12-05 10:21:34 -08:00
Ross Wightman	4d5c395160	MaxVit, ViT, ConvNeXt, and EfficientNet-v2 updates * Add support for TF weights and modelling specifics to MaxVit (testing ported weights) * More fine-tuned CLIP ViT configs * ConvNeXt and MaxVit updated to new pretrained cfgs use * EfficientNetV2, MaxVit and ConvNeXt high res models use squash crop/resize	2022-12-05 10:21:34 -08:00
Ross Wightman	9da7e3a799	Add crop_mode for pretraind config / image transforms. Add support for dynamo compilation to benchmark/train/validate	2022-12-05 10:21:34 -08:00
Ross Wightman	b2b6285af7	Add two more FT clip weights	2022-12-05 10:21:34 -08:00
Ross Wightman	5895056dc4	Add openai b32 ft	2022-12-05 10:21:34 -08:00
Ross Wightman	9dea5143d5	Adding more clip ft variants	2022-12-05 10:21:34 -08:00
Ross Wightman	444dcba4ad	CLIP B16 12k weights added	2022-12-05 10:21:34 -08:00
Ross Wightman	dff4717cbf	Add clip b16 384x384 finetunes	2022-12-05 10:21:34 -08:00
Ross Wightman	883fa2eeaa	Add fine-tuned B/16 224x224 in1k clip models	2022-12-05 10:21:34 -08:00
Ross Wightman	9a3d2ac2d5	Add latest CLIP ViT fine-tune pretrained configs / model entrypt updates	2022-12-05 10:21:34 -08:00
Ross Wightman	42bbbddee9	Add missing model config	2022-12-05 10:21:34 -08:00
Ross Wightman	def68befa7	Updating vit model defs for mult-weight support trial (vit first). Prepping for CLIP (laion2b and openai) fine-tuned weights.	2022-12-05 10:21:34 -08:00
Ross Wightman	0dadb4a6e9	Initial multi-weight support, handled so old pretraing config handling co-exists with new tags.	2022-12-05 10:21:34 -08:00
Wauplin	9b114754db	refactor push_to_hub helper	2022-11-16 12:03:34 +01:00
Wauplin	ae0a0db7de	Create repo before cloning with Repository.clone_from	2022-11-15 15:17:20 +01:00
Ross Wightman	803254bb40	Fix spacing misalignment for fast norm path in LayerNorm modules	2022-10-24 21:43:49 -07:00
Ross Wightman	6635bc3f7d	Merge pull request #1479 from rwightman/script_cleanup Train / val script enhancements, non-GPU (ie CPU) device support, HF datasets support, TFDS/WDS dataloading improvements	2022-10-15 09:29:39 -07:00
Ross Wightman	0e6023f032	Merge pull request #1381 from ChristophReich1996/master Fix typo in PositionalEncodingFourier	2022-10-14 18:34:33 -07:00
Ross Wightman	66f4af7090	Merge remote-tracking branch 'origin/master' into script_cleanup	2022-10-14 15:54:00 -07:00
Ross Wightman	9914f744dc	Add more maxxvit weights includ ConvNeXt conv block based experiments.	2022-10-10 21:49:18 -07:00
Mohamed Rashad	8fda68aff6	Fix repo id bug This to fix this issue #1482	2022-10-05 16:26:06 +02:00
Ross Wightman	1199c5a1a4	clip_laion2b models need 1e-5 eps for LayerNorm	2022-09-25 10:36:54 -07:00
Ross Wightman	e858912e0c	Add brute-force checkpoint remapping option	2022-09-23 16:07:03 -07:00
Ross Wightman	b293dfa595	Add CL SE module	2022-09-23 16:06:09 -07:00
Ross Wightman	a383ef99f5	Make huggingface_hub necessary if it's the only source for a pretrained weight	2022-09-23 13:54:21 -07:00
Ross Wightman	e069249a2d	Add hf hub entries for laion2b clip models, add huggingface_hub dependency, update some setup/reqs, torch >= 1.7	2022-09-16 21:39:05 -07:00
Ross Wightman	9d65557be3	Fix errant import	2022-09-15 17:47:23 -07:00
Ross Wightman	9709dbaaa9	Adding support for fine-tune CLIP LAION-2B image tower weights for B/32, L/14, H/14 and g/14. Still WIP	2022-09-15 17:25:59 -07:00
Ross Wightman	a520da9b49	Update tresnet features_info for v2	2022-09-13 20:54:54 -07:00
Ross Wightman	c8ab747bf4	BEiT-V2 checkpoints didn't remove 'module' from weights, adapt checkpoint filter	2022-09-13 17:56:49 -07:00
Ross Wightman	73049dc2aa	Fix type in dla weight update	2022-09-13 17:52:45 -07:00
Ross Wightman	e11efa872d	Update a bunch of weights with external links to timm release assets. Fixes issue with *aliyuncs.com returning forbidden. Did pickle scan / verify and re-hash. Add TresNet-V2-L weights.	2022-09-13 16:35:26 -07:00
Ross Wightman	fa8c84eede	Update maxvit_tiny_256 weight to better iter, add coatnet / maxvit / maxxvit model defs for future runs	2022-09-07 12:37:37 -07:00
Ross Wightman	c1b3cea19d	Add maxvit_rmlp_tiny_rw_256 model def and weights w/ 84.2 top-1 @ 256, 84.8 @ 320	2022-09-07 10:27:11 -07:00
Ross Wightman	914544fc81	Add beitv2 224x224 checkpoints from https://github.com/microsoft/unilm/tree/master/beit2	2022-09-06 20:25:18 -07:00
Ross Wightman	dc90816f26	Add `maxvit_tiny_rw_224` weights 83.5 @ 224 and `maxvit_rmlp_pico_rw_256` relpos weights, 80.5 @ 256, 81.3 @ 320	2022-09-06 16:14:41 -07:00
Ross Wightman	f489f02ad1	Make gcvit window size ratio based to improve resolution changing support #1449 . Change default init to original.	2022-09-06 16:14:00 -07:00
Ross Wightman	7f1b223c02	Add maxvit_rmlp_nano_rw_256 model def & weights, make window/grid size dynamic wrt img_size by default	2022-08-29 15:49:32 -07:00
Ross Wightman	e6a4361306	pretrained_cfg entry for mvitv2_small_cls	2022-08-28 15:27:01 -07:00
Ross Wightman	f66e5f0e35	Fix class token support in MViT-V2, add small_class variant to ensure it's tested. Fix #1443	2022-08-28 15:24:04 -07:00

1 2 3 4 5 ...

833 Commits