Commit Graph

116 Commits (b0b4422736d069c6eaeb3e0e584ea8d42b9d4138)

Author SHA1 Message Date
John b0b4422736 fix a redundant 2023-09-05 22:22:43 +08:00
John 9b75ce0aa4 only keep one file to set swin transformer v2 model config 2023-09-05 22:16:07 +08:00
John f4d372ba7d only keep one file to set swin transformer model config 2023-09-05 21:26:43 +08:00
John ed3b7f8ae6 format all file names 2023-09-05 16:00:29 +08:00
John ddc6d0b121 Merge remote-tracking branch 'upstream/dev' into dev 2023-09-05 15:23:21 +08:00
ZhangYiqin da1da48eb6
[Enhance] Add iTPN Supports for Non-three channel image (#1735)
* Add channel argments to mae_head

When trying iTPN pretrain, it only supports images with 3 channels. One of the restrictions is from MAEHead.

* Transfer other argments from iTPNHiViT to HiViT

The HiViT supports specifying channels, but the iTPNHiViT class can't pass channel argments to it. This is one of the reasons that iTPNHiViT implementation only support images with 3 channels.

* Update itpn.py

Fix hint problem
2023-09-04 13:11:16 +08:00
John 634852ad61 [CodeCamp2023-338] New Version of config Adapting Swin Transformer Algorithm 2023-08-31 18:15:47 +08:00
mzr1996 6bb0c8a987 Bump version to v1.0.2 2023-08-15 11:51:12 +08:00
mzr1996 bf62497e02 Merge remote-tracking branch 'origin/main' into dev 2023-08-15 11:37:22 +08:00
mstwutao 6474d6befa
[CodeCamp2023-336] New Version of `config` Adapting MAE Algorithm (#1750)
* fix typo MIMHIVIT to MAEHiViT

* fix typo MIMHiViT to MAEHiViT

* [CodeCamp2023-336] New version of config adapting MAE algorithm

* pre-commit check

* Revert soft-link modification

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-08-14 17:20:39 +08:00
AzulaFire 1be28ea7c4
[CodeCamp2023-337] New Version of config Adapting ConvNeXt Algorithm (#1760)
* add configs\_base_\datasets\imagenet21k_bs128.py

* update convnext_base_32xb128_in1k_384px.py

* add  convnext-base_32xb128_in1k.py

* add convnext-base_32xb128_in21k.py

* add convnext-large_64xb64_in1k-384px.py

* add convnext-large_64xb64_in1k.py

* add convnext-large_64xb64_in21k.py

* add convnext-small_32xb128_in1k-384px.py

* add convnext-small_32xb128_in1k.py

* add convnext-tiny_32xb128_in1k-384px.py

* add convnext-tiny_32xb128_in1k.py

* add convnext-xlarge_64xb64_in1k-384px.py

* add convnext-xlarge_64xb64_in1k.py

* add convnext-xlarge_64xb64_in21k.py

* pre-commit check
2023-08-14 15:25:59 +08:00
Am_mu bff80d3c48
[CodeCamp2023-335]New version of config adapting BeitV2 Algorithm (#1755) 2023-08-14 15:04:42 +08:00
fanqiNO1 29d706248c
[Enhancement] Support training of BLIP2 (#1700)
* [Fix] Fix BEiT pre_norm

* [Enhancement] Support BLIP2 training

* [Fix] Fix quoted strings

* [Fix] Fix init_weights

* [Fix] Fix with_cls_token

* [Fix] Fix tokenizer

* [Fix] Fix quoted strings

* [Fix] Fix predict

* [Fix] Cancel changing BEiT

* [Fix] Add loading hook

* [Fix] Reformat with yapf

* [Fix] Fix prompt

* [Fix] Fix typo
2023-08-10 11:15:38 +08:00
Yuan Liu fa53174fd9
[Feature]: Add MFF (#1725)
* [Feature]: Add MFF

* [Feature]: Add mff linear prob

* [Feature]: Add ft

* [Fix]: Update docstring

* [Feature]: Update out_indices

* [Feature]: Add prefix to ft

* [Feature]: Add README

* [Feature]: Update readme

* [Feature]: Update README

* [Feature]: Add metafile

* [Feature]: Update README

* [Fix]: Fix lint

* [Feature]: Add UT

* [Feature]: Update paper link
2023-08-08 16:01:07 +08:00
No-518 1dda91bf24
[CodeCamp2023-343] Update dataset_prepare.md (#1732)
* Update dataset_prepare.md

* Enhanced docstring for RefCOCO and updated datasets.rst

* fix ln

* update

---------

Co-authored-by: No-518 <wybang@gmail.com>
Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
2023-08-03 19:24:23 +08:00
Zeyuan 2fb52eefdc
[CodeCamp2023-339] New Version of `config` Adapting Vision Transformer Algorithm (#1727)
* add old config

* add old config

* add old config

* renew vit-base-p16_64xb64_in1k.py

* rename

* finish vit_base_p16_64xb64_in1k_384px.py

* finish vit_base_p32_64xb64_in1k.py and 384px

* finish 4 vit_large*.py

* finish vit_base_p16_32xb128_mae_in1k.py

* add vit_base_p16_4xb544_ipu_in1k.py

* modify data_root

* using  to modify cfg

* pre-commit check

* ignore ipu

* keep other files no change

* remove redefinition

* only keep vit_base_p16.py

* move init_cfg into model.update
2023-08-02 10:06:08 +08:00
Yike Yuan 340d187765
Support Infographic VQA dataset and ANLS metric. (#1667) 2023-08-01 16:22:34 +08:00
Yike Yuan 4f2f3752d9
Support IconQA dataset. (#1670) 2023-08-01 16:14:40 +08:00
Yixiao Fang 5c71eba13d
Bump version to 1.0.1 (#1731)
* bump version to 1.0.1

* update changelog

* update readme

* Update changelog.md

* update requirements

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
2023-07-31 17:08:05 +08:00
fangyixiao18 58a2243d99 Merge branch 'main' into dev 2023-07-28 15:35:55 +08:00
Yixiao Fang 1f99279657
[Fix] Fix dict update in minigpt4. (#1709) 2023-07-28 15:30:30 +08:00
Yixiao Fang b1cd05caf2
[Enhance] Set 'is_init' in some multimodal methods (#1718)
* update is_init of multimodal

* Update minigpt4.py

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
2023-07-28 15:28:07 +08:00
marouane amzil e7fc25cf64
[Fix] Fix nested predict for multi-task prediction. (#1716)
* fix: multi task predict

* change the loop

---------

Co-authored-by: Pierre Colle <piercus@gmail.com>
2023-07-28 13:44:12 +08:00
Nripesh Niketan 4d1dbafaa2
[Enhance] Add GPU Acceleration Apple silicon mac (#1699)
* Add GPU Acceleration Apple silicon mac

* lint fix

* Update launch.py

* Use  to refactor the device selection.

* Update launch.py

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-07-26 17:51:00 +08:00
liyl 2b8d8eecb2
[Fix] Fix the issue #1711 "GaussianBlur doesn't work" (#1722)
* Fix issue 1711. GaussianBlur.

* Fix UT

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-07-25 11:25:32 +08:00
fanqiNO1 64c446d507
[Feature] Support LoRA. (#1687)
* [Feature] Support LoRA

* [Feature] Support LoRA

* [Fix] Fix bugs

* [Refactor] Add copyright

* [Fix] Fix bugs

* [Enhancement] Add

* [Fix] Fix bugs

* [Fix] Fix bugs

* [Fix] Fix bugs

* [Fix] Fix bugs

* [Fix] Fix bugs

* [Docs] Update docstring

* [Docs] Update docstring

* [Refactor] Reformat with yapf

* [Docs] Update docstring

* [Refactor] Docformat

* [Refactor] Fix double-quote-string

* [Fix] fix pytorch version

* [Fix] isort

* [Fix] isort

* [Enhancement] Extend forward

* [Enhancement] Extend test

* [Fix] Fix targets

* [Enhancement] Extend LoRA to frozen models

* [Fix] Fix spelling

* [Fix] Override __getattr__

* [Fix] Add init_cfg

* [Enhancement] Add example config

* [Fix] Fix init_cfg

* [Enhancement] Add merging script

* [Fix] Remove init_cfg

* [Fix] Change lora key

* [Fix] Fix merge scripts

* [Fix] Fix merge scripts

* [Docs] Add docs

* [Fix] fix
2023-07-24 11:30:57 +08:00
mzr1996 60d780f99e Fix docs 2023-07-20 10:21:15 +08:00
Fabien Merceron PRL db395d35b1
fix_freeze_without_cls_token_vit (#1693) 2023-07-14 15:43:19 +08:00
fanqiNO1 465b6bdeec
[Refactor] Fix spelling (#1689) 2023-07-13 15:38:58 +08:00
fanqiNO1 5c43d3ef42
[Refactor] BEiT refactor (#1705)
* [Refactor] BEiT refactor

* [Fix] Fix arch zoo

* [Fix] Fix arch zoo

* [Fix] Fix freeze stages

* [Fix] Fix freeze ln2

* [Fix] Fix freezing vit ln2
2023-07-11 15:49:41 +08:00
Ezra-Yu 78d0ddc852
[Fix] Fix RandomCrop bug (#1706) 2023-07-11 10:18:08 +08:00
Yixiao Fang ae7a7b7560
Bump version to 1.0.0 (#1686)
* bump version to 1.0.0

* update

* update

* fix lint

* update

* update

* update changelog

* update
2023-07-05 11:51:12 +08:00
fanqiNO1 7cbfb36c14
[Refactor] Fix spelling (#1681)
* [Refactor] Fix spelling

* [Refactor] Fix spelling

* [Refactor] Fix spelling

* [Refactor] Fix spelling
2023-07-05 11:07:43 +08:00
Wang Xiang feb0814b2f
[Feature] Transfer shape-bias tool from mmselfsup (#1658)
* Transfer shape-bias tool from mmselfsup

* run shape-bias successfully, add CN docs

* fix unit test bug

* add shape_bias to index.rst in docs

* modified mistakes in shape-bias docs
2023-07-03 11:39:23 +08:00
Peng Lu 00030e3f7d
[Fix] refactor _prepare_pos_embed in ViT to fix bug in loading old checkpoint (#1679) 2023-07-03 11:36:44 +08:00
Wangbo Zhao(黑色枷锁) 68758db7a8
[Fix] freeze pre norm in vision transformer. (#1672) 2023-06-28 17:00:27 +08:00
Yixiao Fang 10685fc81c
[Refactor] Replace if '_base_' with read_base(). (#1665) 2023-06-28 16:57:18 +08:00
Yixiao Fang 70ff2abbf7
[Refactor] Refactor _prepare_pos_embed in ViT (#1656)
* deal with cls_token

* Update implement

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-06-20 17:37:08 +08:00
Ma Zerun 7d850dfadd
[Improve] Update Otter and LLaVA docs and config. (#1653) 2023-06-19 20:16:13 +08:00
Mashiro d6056af2b8
[Fix][New_config] Fix demo bug (#1647)
* Fix demo

* Update implement

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-06-19 15:15:28 +08:00
Yiqin Wang 王逸钦 6d7fe91a98
[Feature] Support Flickr30k Retrieval dataset (#1625)
* format

* remove abs path

* init add flickr30k caption

* remove abs dir

* update blip readme

* add convert sscripts

* minor

* minor
2023-06-19 15:15:03 +08:00
Yixiao Fang a1cfe888e2
[Feature] Support SparK. (#1531)
* add spark configs

* fix configs

* remove repeat aug

* add module codes

* support lr layer decay of resnet

* update

* fix lint

* add metafile and readme

* fix lint

* add models and logs

* refactor codes

* fix lint

* update model rst

* update name

* add docstring

* add ut

* fix lint

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
2023-06-19 11:27:50 +08:00
Ma Zerun bfd49b0d52
[Feature] Support LLaVA (#1652) 2023-06-17 16:05:52 +08:00
Ma Zerun e69bace03f
[Feature] Support otter (#1651)
* [Feature] Support Otter

* Update docs
2023-06-17 16:03:21 +08:00
Yixiao Fang 9d3fc43073
[Feature] Support MiniGPT-4 (#1642)
* support inference of MiniGPT-4

* refine codes

* update metafile, readme and docs

* fix typo

* fix lint

* add ckpt load hook
2023-06-16 22:50:34 +08:00
Yike Yuan a673b048a5
[Feature] Add support for VizWiz dataset. (#1636)
* add vizwiz

* update dataset

* [Fix] Build img_path in data_sample.

* Fix isort.

---------

Co-authored-by: ZhangYuanhan-AI <yuanhan002@ntu.edu.sg>
2023-06-16 17:16:17 +08:00
Yixiao Fang aac398a83f
[Feature] Support new configs. (#1639)
* [Feature] Support new configs (#1638)

* add new config of mae and simclr

* update

* update setup.cfg

* update eva

* update

* update new config

* Add new config

* remove __init__.py

* 1. remove ; 2. remove mmpretrain/configs/_base_/models/convnext

* remove model_wrapper_cfg and add out type

* Add comment for setting default_scope to NOne

* update if '_base_' order

* update

* revert changes

---------

Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>

* Add warn at the head of new config files

---------

Co-authored-by: Mashiro <57566630+HAOCHENYE@users.noreply.github.com>
Co-authored-by: mzr1996 <mzr1996@163.com>
2023-06-16 16:54:45 +08:00
Ezra-Yu 93e0f107c4
[Fix] Fix bug loading IN1k dataset. (#1641) 2023-06-16 15:35:27 +08:00
Yike Yuan 7581b76233
[Feature] Add support for vsr dataset (#1634)
* add VSR dataset

* [Fix] Modify example and load gt_answer as string.

---------

Co-authored-by: ZhangYuanhan-AI <yuanhan002@ntu.edu.sg>
2023-06-15 19:17:02 +08:00
zzc98 53648baca5
[Fix] fix sam bug (#1633) 2023-06-15 10:10:51 +08:00