Commit Graph

240 Commits (dbef2b41c6845e86b68370287537924d46fd41d2)

Author SHA1 Message Date
mzr1996 dbef2b41c6 [Fix] Align COCO dataset format. 2023-06-19 07:24:07 +00:00
Yiqin Wang 王逸钦 6d7fe91a98
[Feature] Support Flickr30k Retrieval dataset (#1625)
* format

* remove abs path

* init add flickr30k caption

* remove abs dir

* update blip readme

* add convert sscripts

* minor

* minor
2023-06-19 15:15:03 +08:00
Yixiao Fang a1cfe888e2
[Feature] Support SparK. (#1531)
* add spark configs

* fix configs

* remove repeat aug

* add module codes

* support lr layer decay of resnet

* update

* fix lint

* add metafile and readme

* fix lint

* add models and logs

* refactor codes

* fix lint

* update model rst

* update name

* add docstring

* add ut

* fix lint

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
2023-06-19 11:27:50 +08:00
Ma Zerun bfd49b0d52
[Feature] Support LLaVA (#1652) 2023-06-17 16:05:52 +08:00
Ma Zerun e69bace03f
[Feature] Support otter (#1651)
* [Feature] Support Otter

* Update docs
2023-06-17 16:03:21 +08:00
Yixiao Fang 9d3fc43073
[Feature] Support MiniGPT-4 (#1642)
* support inference of MiniGPT-4

* refine codes

* update metafile, readme and docs

* fix typo

* fix lint

* add ckpt load hook
2023-06-16 22:50:34 +08:00
Yike Yuan a673b048a5
[Feature] Add support for VizWiz dataset. (#1636)
* add vizwiz

* update dataset

* [Fix] Build img_path in data_sample.

* Fix isort.

---------

Co-authored-by: ZhangYuanhan-AI <yuanhan002@ntu.edu.sg>
2023-06-16 17:16:17 +08:00
Yike Yuan 7581b76233
[Feature] Add support for vsr dataset (#1634)
* add VSR dataset

* [Fix] Modify example and load gt_answer as string.

---------

Co-authored-by: ZhangYuanhan-AI <yuanhan002@ntu.edu.sg>
2023-06-15 19:17:02 +08:00
Hubert 8e9e880601
[Feat] Add download link for coco caption and retrieval annotations. (#1607)
* [Feat] Add download link for coco caption and retrieval annotations.

* minor fix
2023-06-13 10:29:54 +08:00
Yiqin Wang 王逸钦 bb415b91be
[Feature] Support OCR-VQA dataset (#1621)
* support ocrvqa dataset

* minor

* remove abs path

* refine README
2023-06-13 10:28:45 +08:00
Yiqin Wang 王逸钦 dbfb84ccbd
[Feature] Support OK-VQA dataset (#1615)
* add okvqa

* refine README
2023-06-08 16:57:18 +08:00
Wangbo Zhao(黑色枷锁) 3a277ee9e6
[Feature] support TextVQA dataset (#1596)
* [Support] Suport TextVQA dataset

* add folder structure

* fix readme
2023-06-02 11:50:38 +08:00
zzc98 bc3c4a35ee
[Refactor] Support to use "split" to specify training set/validation set in the ImageNet dataset (#1535)
* [Feature]: Add caption

* [Feature]: Update scienceqa

* [CI] Add test mim CI. (#879)

* refactor imagenet dataset

* refactor imagenet dataset

* refactor imagenet dataset

* update imagenet21k

* update configs

* update mnist

* update dataset_prepare.md

* fix sun397 url and update user_guides/dataset_prepare.md

* update dataset_prepare.md

* fix sun397 dataset

* fix sun397

* update chinese dataset_prepare.md

* update dataset_prepare.md

* [Refactor] update voc dataset

* [Refactor] update voc dataset

* refactor imagenet

* refactor imagenet

* use mmengine.fileio

---------

Co-authored-by: liuyuan <3463423099@qq.com>
Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-06-02 11:03:18 +08:00
Yixiao Fang e4c4a81b56
[Feature] Support iTPN and HiViT (#1584)
* hivit added

* Update hivit.py

* Update hivit.py

* Add files via upload

* Update __init__.py

* Add files via upload

* Update __init__.py

* Add files via upload

* Update hivit.py

* Add files via upload

* Add files via upload

* Add files via upload

* Add files via upload

* Update itpn.py

* Add files via upload

* Update __init__.py

* Update mae_hivit-base-p16.py

* Delete mim_itpn-base-p16.py

* Add files via upload

* Update itpn_hivit-base-p16.py

* Update itpn.py

* Update hivit.py

* Update __init__.py

* Update mae.py

* Delete hivit.py

* Update __init__.py

* Delete configs/itpn directory

* Add files via upload

* Add files via upload

* Delete configs/hivit directory

* Add files via upload

* refactor and add metafile and readme

* update clip

* add ut

* update ut

* update

* update docstring

* update model.rst

---------

Co-authored-by: 田运杰 <48153283+sunsmarterjie@users.noreply.github.com>
2023-05-26 12:08:34 +08:00
Ezra-Yu 1f07c92ed1
[Feature] Add retrieval mAP metric. (#1552)
* rebase

* fefine

* fix lint

* update readme

* rebase

* fix lint

* update docstring

* update docstring

* rebase

* rename corespanding names

* rebase
2023-05-26 10:40:08 +08:00
Wangbo Zhao(黑色枷锁) a779c8c5a7
[Feature] Support NoCap dataset based on BLIP. (#1582)
* [Feature] Support nocaps dataset

* precommit

* Use official coco format

* add nocp readme

* fix readme

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-05-23 18:06:43 +08:00
Yuan Liu 46a523ef63
[Feature] Add GQA dataset. (#1585)
* [Feature]: Add GQA dataset

* [Feature]: Add GQA

* [Feature]: Add GQA UT

* [Fix]: Fix hint

* [Feature]: Add BLIP2 GQA

* [Fix]: Fix lint

* [Feature]: Update anno link

* [Fix]: Update docstring

* [Feature]: Update all links
2023-05-23 11:25:42 +08:00
ZhangYiqin 023d6869bd
[Fix] Incorrect stage freeze on RIFormer Model (#1573)
* [Doc] RIFormer's README did not link to its paper properly

* Incorrect code for reproducing RIFormer 

the default value of frozen stage is set to 0, and the doc says that this will lead to no stage be frozen. But the actual case is the patch_embed will be freezed.

This may cause incorrect training, thus influencing the result.

I suggest a careful review.
2023-05-22 16:01:32 +08:00
Yixiao Fang 1e478462b8
[Feature] Support Chinese CLIP. (#1576)
* support cn-clip

* update README

* Update progress bar

* update order of category

* fix lint

* update

* update readme and metafile

* update

* update docstring

* refactor tokenizer

* fix lint

* Update README and progress bar

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-05-22 15:46:13 +08:00
Ma Zerun 6847d20d57
[Feature] Support multiple multi-modal algorithms and inferencers. (#1561)
* [Feat] Migrate blip caption to mmpretrain. (#50)

* Migrate blip caption to mmpretrain

* minor fix

* support train

* [Feature] Support OFA caption task. (#51)

* [Feature] Support OFA caption task.

* Remove duplicated files.

* [Feature] Support OFA vqa task. (#58)

* [Feature] Support OFA vqa task.

* Fix lint.

* [Feat] Add BLIP retrieval to mmpretrain. (#55)

* init

* minor fix for train

* fix according to comments

* refactor

* Update Blip retrieval. (#62)

* [Feature] Support OFA visual grounding task. (#59)

* [Feature] Support OFA visual grounding task.

* minor add TODO

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feat] Add flamingos coco caption and vqa. (#60)

* first init

* init flamingo coco

* add vqa

* minor fix

* remove unnecessary modules

* Update config

* Use `ApplyToList`.

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature]: BLIP2 coco retrieval  (#53)

* [Feature]: Add blip2 retriever

* [Feature]: Add blip2 all modules

* [Feature]: Refine model

* [Feature]: x1

* [Feature]: Runnable coco ret

* [Feature]: Runnable version

* [Feature]: Fix lint

* [Fix]: Fix lint

* [Feature]: Use 364 img size

* [Feature]: Refactor blip2

* [Fix]: Fix lint

* refactor files

* minor fix

* minor fix

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* Remove

* fix blip caption inputs (#68)

* [Feat] Add BLIP NLVR support. (#67)

* first init

* init flamingo coco

* add vqa

* add nlvr

* refactor nlvr

* minor fix

* minor fix

* Update dataset

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature]: BLIP2 Caption (#70)

* [Feature]: Add language model

* [Feature]: blip2 caption forward

* [Feature]: Reproduce the results

* [Feature]: Refactor caption

* refine config

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feat] Migrate BLIP VQA to mmpretrain (#69)

* reformat

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* refactor code

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* Update RefCOCO dataset

* [Fix] fix lint

* [Feature] Implement inference APIs for multi-modal tasks. (#65)

* [Feature] Implement inference APIs for multi-modal tasks.

* [Project] Add gradio demo.

* [Improve] Update requirements

* Update flamingo

* Update blip

* Add NLVR inferencer

* Update flamingo

* Update hugging face model register

* Update ofa vqa

* Update BLIP-vqa (#71)

* Update blip-vqa docstring (#72)

* Refine flamingo docstring (#73)

* [Feature]: BLIP2 VQA (#61)

* [Feature]: VQA forward

* [Feature]: Reproduce accuracy

* [Fix]: Fix lint

* [Fix]: Add blank line

* minor fix

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feature]: BLIP2 docstring (#74)

* [Feature]: Add caption docstring

* [Feature]: Add docstring to blip2 vqa

* [Feature]: Add docstring to retrieval

* Update BLIP-2 metafile and README (#75)

* [Feature]: Add readme and docstring

* Update blip2 results

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature] BLIP Visual Grounding on MMPretrain Branch (#66)

* blip grounding merge with mmpretrain

* remove commit

* blip grounding test and inference api

* refcoco dataset

* refcoco dataset refine config

* rebasing

* gitignore

* rebasing

* minor edit

* minor edit

* Update blip-vqa docstring (#72)

* rebasing

* Revert "minor edit"

This reverts commit 639cec757c215e654625ed0979319e60f0be9044.

* blip grounding final

* precommit

* refine config

* refine config

* Update blip visual grounding

---------

Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: mzr1996 <mzr1996@163.com>

* Update visual grounding metric

* Update OFA docstring, README and metafiles. (#76)

* [Docs] Update installation docs and gradio demo docs. (#77)

* Update OFA name

* Update Visual Grounding Visualizer

* Integrate accelerate support

* Fix imports.

* Fix timm backbone

* Update imports

* Update README

* Update circle ci

* Update flamingo config

* Add gradio demo README

* [Feature]: Add scienceqa (#1571)

* [Feature]: Add scienceqa

* [Feature]: Change param name

* Update docs

* Update video

---------

Co-authored-by: Hubert <42952108+yingfhu@users.noreply.github.com>
Co-authored-by: yingfhu <yingfhu@gmail.com>
Co-authored-by: Yuan Liu <30762564+YuanLiuuuuuu@users.noreply.github.com>
Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: Rongjie Li <limo97@163.com>
2023-05-19 16:50:04 +08:00
zzc98 034919d032
[Feature] add eva02 backbone (#1450)
* [CI] Add test mim CI. (#879)

* [CI] Add test mim CI. (#879)

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update ci

* rebase

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update readme and configs

* update readme and configs

* refactore eva02

* [CI] Add test mim CI. (#879)

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update ci

* rebase

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update readme and configs

* refactore eva02

* update readme and metafile

* update readme and metafile

* update readme and metafile

* update

* rename eva02

* rename eva02

* fix uts

* rename configs

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-06 19:28:31 +08:00
Yixiao Fang d9e561a09d
[Feature] Support dinov2 backbone (#1522)
* support dinov2 backbone

* update metafile and readme

* compatible to use_layer_scale

* update SwiGLUFFN

* add deprecation warning

* update
2023-05-05 16:59:37 +08:00
zzc98 496e098b21
[Feature] Support some downstream classification datasets. (#1467)
* feat: support some downstream classification datasets

* update sun397

* sum

* update sun397

* [CI] Add test mim CI. (#879)

* feat: support some downstream classification datasets

* update sun397

* sum

* update sun397

* rebase

* feat: support some downstream classification datasets

* update sun397

* update sun397

* update sun397

* update sun397

* fix unittest

* update docstring

* rm

* update

* update

* refactor names of datasets

* refactor some implements of datasets

* refactor some implements of datasets

* fix datasets unittest

* refactor cub and stanford cars

* refactor cub and cifar

* refactor cub and cifar

* refactor cub and cifar

* update downstream datasets and docs

* update docstring

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-05 14:43:14 +08:00
Yixiao Fang a3fa328f09
Fix config of beit (#1528) 2023-04-28 16:33:14 +08:00
Weihao Yu 3cd4fd4d64
Update PoolFormer citation to CVPR version (#1505) 2023-04-20 20:22:22 +08:00
takuoko fec3da781f
[Feature] Support GLIP (#1308)
* rebase

* add glip

* update glip

* add links

* rename

* fix doc

---------

Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-04-17 19:19:23 +08:00
Yixiao Fang 0826df8963
[Feature] Add ViT of SAM (#1476)
* add vit of sam

* update

* update

* add ut

* update ut

* remove num_classes

* support dynamic input

* add ut

* add comments

* update ut
2023-04-13 17:03:28 +08:00
Yixiao Fang 9cbeceabb5
Bump version to v1.0.0rc7 (#1465)
* update

* update info

* update changelog

* update

* update description

* change to v1.0.0rc7
2023-04-07 17:34:21 +08:00
Ezra-Yu b8cab5c9f7 update readme 2023-04-06 21:56:25 +08:00
Ezra-Yu 3932ddec10 update ckpt path 2023-04-06 21:56:25 +08:00
techmonsterwang 5c3abb2b2a update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang e115ac89f4 update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang 53a57c6dad update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang e4d8511ddf update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang c9c7d9cc0f update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang a6c24d104e update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang e7da3f29f4 update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang 32c258ff19 update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang 0b70c108b0 update riformer mmpretrain 2023-04-06 21:56:25 +08:00
Yixiao Fang 1ee9bbe050
[Docs] Update links (#1457)
* update links

* update readtherdocs

* update

* update

* fix lint

* update

* update

* update

* update cov branch

* update

* update

* update
2023-04-06 20:58:52 +08:00
Yixiao Fang 9fb4e9c911
[Fix] fix config of maskfeat (#1424) 2023-03-30 11:45:18 +08:00
Yixiao Fang 164f16e248
[Fix] Fix init bug of r50 in contrastive leanrning (#1445) 2023-03-29 15:49:51 +08:00
Ma Zerun c4ccae40db
[Docs] Update user guides docs and tools for MMPretrain. (#1429)
* [Docs] Update user guides docs and tools for MMPretrain.

* Fix UT

* Fix Chinese docs.

* Improve according to comments.

* Fix windows CI.
2023-03-27 14:32:26 +08:00
Ma Zerun 6cedce234e
[Refactor] Update dev scripts to be compatible with selfsup tasks. (#1412)
* [Refactor] Update dev scripts to be compatible with selfsup tasks.

* Fix some missing fields in config files.

* Set maximum number of gpus for local training.

* Update README files

* Update according to comments.
2023-03-20 14:30:57 +08:00
Ma Zerun 4f5b38f225
[Refactor] Update almost tools and add unit tests for these tools. (#1418)
* [Refactor] Update almost tools and add unit tests for these tools.

* Fix Windows UT.
2023-03-17 10:50:51 +08:00
Ma Zerun 3472ee5d2c
[Feature] Implememnt the universal visualizer for multiple task. (#1404)
* [Feature] Implememnt the universal visualizer for multiple task.

* Update tools

* Improve according to comments.

* Fix tools docs

* Add --test-cfg option and set default collate function.
2023-03-09 11:36:54 +08:00
Ma Zerun dbf3df21a3
[Refactor] Use `out_type` to specify ViT-like backbone output. (#1408)
* [Refactor] Use  to specify ViT-like backbone output.

* Fix ClsBatchNormNeck

* Update mmpretrain/models/necks/mae_neck.py

---------

Co-authored-by: Yixiao Fang <36138628+fangyixiao18@users.noreply.github.com>
2023-03-09 11:02:58 +08:00
Yixiao Fang 63e5b512cc
[Refactor] Move part of tool scripts from mmselfsup. (#1403)
* add dataset converters and benchmark .sh

* refine

* fix lint

* add tsne

* rename visualizaition

* update configs and script
2023-03-07 17:57:08 +08:00
Ma Zerun 274a67223e
[Feature] Implement layer-wise learning rate decay optimizer constructor. (#1399)
* [Feature] Implement layer-wise learning rate decay optimizer constructor.

* Use num_layers instead of max_depth to avoid misleading

* Add UT

* Update docstring

* Update log info

* update LearningRateDecay configs

---------

Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
2023-03-07 17:30:39 +08:00
Yixiao Fang 827be6e22d
[Fix] Fix value error while start training. (#1400)
* fix value error of randint

* add missing key in configs
2023-03-07 08:51:31 +08:00