Commit Graph

38 Commits (v1.0.2)

Author SHA1 Message Date
mzr1996 bf62497e02 Merge remote-tracking branch 'origin/main' into dev 2023-08-15 11:37:22 +08:00
Yuan Liu fa53174fd9
[Feature]: Add MFF (#1725)
* [Feature]: Add MFF

* [Feature]: Add mff linear prob

* [Feature]: Add ft

* [Fix]: Update docstring

* [Feature]: Update out_indices

* [Feature]: Add prefix to ft

* [Feature]: Add README

* [Feature]: Update readme

* [Feature]: Update README

* [Feature]: Add metafile

* [Feature]: Update README

* [Fix]: Fix lint

* [Feature]: Add UT

* [Feature]: Update paper link
2023-08-08 16:01:07 +08:00
No-518 1dda91bf24
[CodeCamp2023-343] Update dataset_prepare.md (#1732)
* Update dataset_prepare.md

* Enhanced docstring for RefCOCO and updated datasets.rst

* fix ln

* update

---------

Co-authored-by: No-518 <wybang@gmail.com>
Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
2023-08-03 19:24:23 +08:00
Yike Yuan 340d187765
Support Infographic VQA dataset and ANLS metric. (#1667) 2023-08-01 16:22:34 +08:00
Yike Yuan 4f2f3752d9
Support IconQA dataset. (#1670) 2023-08-01 16:14:40 +08:00
fangyixiao18 58a2243d99 Merge branch 'main' into dev 2023-07-28 15:35:55 +08:00
liyl 2b8d8eecb2
[Fix] Fix the issue #1711 "GaussianBlur doesn't work" (#1722)
* Fix issue 1711. GaussianBlur.

* Fix UT

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-07-25 11:25:32 +08:00
Ezra-Yu 78d0ddc852
[Fix] Fix RandomCrop bug (#1706) 2023-07-11 10:18:08 +08:00
Mashiro d6056af2b8
[Fix][New_config] Fix demo bug (#1647)
* Fix demo

* Update implement

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-06-19 15:15:28 +08:00
Yiqin Wang 王逸钦 6d7fe91a98
[Feature] Support Flickr30k Retrieval dataset (#1625)
* format

* remove abs path

* init add flickr30k caption

* remove abs dir

* update blip readme

* add convert sscripts

* minor

* minor
2023-06-19 15:15:03 +08:00
Yike Yuan a673b048a5
[Feature] Add support for VizWiz dataset. (#1636)
* add vizwiz

* update dataset

* [Fix] Build img_path in data_sample.

* Fix isort.

---------

Co-authored-by: ZhangYuanhan-AI <yuanhan002@ntu.edu.sg>
2023-06-16 17:16:17 +08:00
Yixiao Fang aac398a83f
[Feature] Support new configs. (#1639)
* [Feature] Support new configs (#1638)

* add new config of mae and simclr

* update

* update setup.cfg

* update eva

* update

* update new config

* Add new config

* remove __init__.py

* 1. remove ; 2. remove mmpretrain/configs/_base_/models/convnext

* remove model_wrapper_cfg and add out type

* Add comment for setting default_scope to NOne

* update if '_base_' order

* update

* revert changes

---------

Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>

* Add warn at the head of new config files

---------

Co-authored-by: Mashiro <57566630+HAOCHENYE@users.noreply.github.com>
Co-authored-by: mzr1996 <mzr1996@163.com>
2023-06-16 16:54:45 +08:00
Ezra-Yu 93e0f107c4
[Fix] Fix bug loading IN1k dataset. (#1641) 2023-06-16 15:35:27 +08:00
Yike Yuan 7581b76233
[Feature] Add support for vsr dataset (#1634)
* add VSR dataset

* [Fix] Modify example and load gt_answer as string.

---------

Co-authored-by: ZhangYuanhan-AI <yuanhan002@ntu.edu.sg>
2023-06-15 19:17:02 +08:00
Yiqin Wang 王逸钦 bb415b91be
[Feature] Support OCR-VQA dataset (#1621)
* support ocrvqa dataset

* minor

* remove abs path

* refine README
2023-06-13 10:28:45 +08:00
Yuan Liu bddbc085fc
[Feature]: Add image_only param (#1613)
* [Feature]: Add image_only param

* [Feature]: Use image_only
2023-06-06 12:50:42 +08:00
Wangbo Zhao(黑色枷锁) 3a277ee9e6
[Feature] support TextVQA dataset (#1596)
* [Support] Suport TextVQA dataset

* add folder structure

* fix readme
2023-06-02 11:50:38 +08:00
zzc98 bc3c4a35ee
[Refactor] Support to use "split" to specify training set/validation set in the ImageNet dataset (#1535)
* [Feature]: Add caption

* [Feature]: Update scienceqa

* [CI] Add test mim CI. (#879)

* refactor imagenet dataset

* refactor imagenet dataset

* refactor imagenet dataset

* update imagenet21k

* update configs

* update mnist

* update dataset_prepare.md

* fix sun397 url and update user_guides/dataset_prepare.md

* update dataset_prepare.md

* fix sun397 dataset

* fix sun397

* update chinese dataset_prepare.md

* update dataset_prepare.md

* [Refactor] update voc dataset

* [Refactor] update voc dataset

* refactor imagenet

* refactor imagenet

* use mmengine.fileio

---------

Co-authored-by: liuyuan <3463423099@qq.com>
Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-06-02 11:03:18 +08:00
Ma Zerun 5bd088ef43
[Fix] Update torchvision transform wrapper (#1595)
* Update torchvision transform wrapper

* Update requirements

* fix unit tests

---------

Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-26 17:56:09 +08:00
Wangbo Zhao(黑色枷锁) a779c8c5a7
[Feature] Support NoCap dataset based on BLIP. (#1582)
* [Feature] Support nocaps dataset

* precommit

* Use official coco format

* add nocp readme

* fix readme

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-05-23 18:06:43 +08:00
Yuan Liu 46a523ef63
[Feature] Add GQA dataset. (#1585)
* [Feature]: Add GQA dataset

* [Feature]: Add GQA

* [Feature]: Add GQA UT

* [Fix]: Fix hint

* [Feature]: Add BLIP2 GQA

* [Fix]: Fix lint

* [Feature]: Update anno link

* [Fix]: Update docstring

* [Feature]: Update all links
2023-05-23 11:25:42 +08:00
Yuan Liu be389eb846
[Fix] Fix scienceqa (#1581) 2023-05-22 16:10:17 +08:00
Yixiao Fang 1e478462b8
[Feature] Support Chinese CLIP. (#1576)
* support cn-clip

* update README

* Update progress bar

* update order of category

* fix lint

* update

* update readme and metafile

* update

* update docstring

* refactor tokenizer

* fix lint

* Update README and progress bar

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-05-22 15:46:13 +08:00
liuyuan 74f24658e7 [Fix]: Delete GQA 2023-05-22 11:57:18 +08:00
liuyuan 13e4d6c512 [Fix]: Fix UT 2023-05-22 11:55:08 +08:00
liuyuan 1537d46596 [Feature]: Update scienceqa 2023-05-22 11:31:07 +08:00
liuyuan 1b8e86dca6 [Feature]: Add caption 2023-05-22 11:31:07 +08:00
Ma Zerun 6847d20d57
[Feature] Support multiple multi-modal algorithms and inferencers. (#1561)
* [Feat] Migrate blip caption to mmpretrain. (#50)

* Migrate blip caption to mmpretrain

* minor fix

* support train

* [Feature] Support OFA caption task. (#51)

* [Feature] Support OFA caption task.

* Remove duplicated files.

* [Feature] Support OFA vqa task. (#58)

* [Feature] Support OFA vqa task.

* Fix lint.

* [Feat] Add BLIP retrieval to mmpretrain. (#55)

* init

* minor fix for train

* fix according to comments

* refactor

* Update Blip retrieval. (#62)

* [Feature] Support OFA visual grounding task. (#59)

* [Feature] Support OFA visual grounding task.

* minor add TODO

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feat] Add flamingos coco caption and vqa. (#60)

* first init

* init flamingo coco

* add vqa

* minor fix

* remove unnecessary modules

* Update config

* Use `ApplyToList`.

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature]: BLIP2 coco retrieval  (#53)

* [Feature]: Add blip2 retriever

* [Feature]: Add blip2 all modules

* [Feature]: Refine model

* [Feature]: x1

* [Feature]: Runnable coco ret

* [Feature]: Runnable version

* [Feature]: Fix lint

* [Fix]: Fix lint

* [Feature]: Use 364 img size

* [Feature]: Refactor blip2

* [Fix]: Fix lint

* refactor files

* minor fix

* minor fix

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* Remove

* fix blip caption inputs (#68)

* [Feat] Add BLIP NLVR support. (#67)

* first init

* init flamingo coco

* add vqa

* add nlvr

* refactor nlvr

* minor fix

* minor fix

* Update dataset

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature]: BLIP2 Caption (#70)

* [Feature]: Add language model

* [Feature]: blip2 caption forward

* [Feature]: Reproduce the results

* [Feature]: Refactor caption

* refine config

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feat] Migrate BLIP VQA to mmpretrain (#69)

* reformat

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* refactor code

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* Update RefCOCO dataset

* [Fix] fix lint

* [Feature] Implement inference APIs for multi-modal tasks. (#65)

* [Feature] Implement inference APIs for multi-modal tasks.

* [Project] Add gradio demo.

* [Improve] Update requirements

* Update flamingo

* Update blip

* Add NLVR inferencer

* Update flamingo

* Update hugging face model register

* Update ofa vqa

* Update BLIP-vqa (#71)

* Update blip-vqa docstring (#72)

* Refine flamingo docstring (#73)

* [Feature]: BLIP2 VQA (#61)

* [Feature]: VQA forward

* [Feature]: Reproduce accuracy

* [Fix]: Fix lint

* [Fix]: Add blank line

* minor fix

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feature]: BLIP2 docstring (#74)

* [Feature]: Add caption docstring

* [Feature]: Add docstring to blip2 vqa

* [Feature]: Add docstring to retrieval

* Update BLIP-2 metafile and README (#75)

* [Feature]: Add readme and docstring

* Update blip2 results

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature] BLIP Visual Grounding on MMPretrain Branch (#66)

* blip grounding merge with mmpretrain

* remove commit

* blip grounding test and inference api

* refcoco dataset

* refcoco dataset refine config

* rebasing

* gitignore

* rebasing

* minor edit

* minor edit

* Update blip-vqa docstring (#72)

* rebasing

* Revert "minor edit"

This reverts commit 639cec757c215e654625ed0979319e60f0be9044.

* blip grounding final

* precommit

* refine config

* refine config

* Update blip visual grounding

---------

Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: mzr1996 <mzr1996@163.com>

* Update visual grounding metric

* Update OFA docstring, README and metafiles. (#76)

* [Docs] Update installation docs and gradio demo docs. (#77)

* Update OFA name

* Update Visual Grounding Visualizer

* Integrate accelerate support

* Fix imports.

* Fix timm backbone

* Update imports

* Update README

* Update circle ci

* Update flamingo config

* Add gradio demo README

* [Feature]: Add scienceqa (#1571)

* [Feature]: Add scienceqa

* [Feature]: Change param name

* Update docs

* Update video

---------

Co-authored-by: Hubert <42952108+yingfhu@users.noreply.github.com>
Co-authored-by: yingfhu <yingfhu@gmail.com>
Co-authored-by: Yuan Liu <30762564+YuanLiuuuuuu@users.noreply.github.com>
Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: Rongjie Li <limo97@163.com>
2023-05-19 16:50:04 +08:00
zzc98 496e098b21
[Feature] Support some downstream classification datasets. (#1467)
* feat: support some downstream classification datasets

* update sun397

* sum

* update sun397

* [CI] Add test mim CI. (#879)

* feat: support some downstream classification datasets

* update sun397

* sum

* update sun397

* rebase

* feat: support some downstream classification datasets

* update sun397

* update sun397

* update sun397

* update sun397

* fix unittest

* update docstring

* rm

* update

* update

* refactor names of datasets

* refactor some implements of datasets

* refactor some implements of datasets

* fix datasets unittest

* refactor cub and stanford cars

* refactor cub and cifar

* refactor cub and cifar

* refactor cub and cifar

* update downstream datasets and docs

* update docstring

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-05 14:43:14 +08:00
Ezra-Yu 645e2b4ed4
[DOC] Fix typo in MultiLabelDataset docstring (#1483)
* fix doc

* fix ci

* fix ci

* fix ci

* fix ci
2023-04-14 13:57:54 +08:00
Ezra-Yu 99e48116aa
[Feature] Register torchvision transforms into mmcls (#1265)
* [Enhance] Add stochastic depth decay rule in resnet. (#1363)

* add stochastic depth decay rule to drop path rate

* add default value

* update

* pass ut

* update

* pass ut

* remove np

* rebase

* update ToPIL and ToNumpy

* rebase

* rebase

* rebase

* rebase

* add readme

* fix review suggestions

* rebase

* fix conflicts

* fix conflicts

* fix lint

* remove comments

* remove useless code

* update docstring

* update doc API

* update doc

---------

Co-authored-by: Yixiao Fang <36138628+fangyixiao18@users.noreply.github.com>
2023-04-13 18:05:57 +08:00
Ma Zerun c4ccae40db
[Docs] Update user guides docs and tools for MMPretrain. (#1429)
* [Docs] Update user guides docs and tools for MMPretrain.

* Fix UT

* Fix Chinese docs.

* Improve according to comments.

* Fix windows CI.
2023-03-27 14:32:26 +08:00
Yixiao Fang 827be6e22d
[Fix] Fix value error while start training. (#1400)
* fix value error of randint

* add missing key in configs
2023-03-07 08:51:31 +08:00
Yixiao Fang 08dc8c75d3
[Refactor] Add selfsup algorithms. (#1389)
* remove basehead

* add moco series

* add byol simclr simsiam

* add ut

* update configs

* add simsiam hook

* add and refactor beit

* update ut

* add cae

* update extract_feat

* refactor cae

* add mae

* refactor data preprocessor

* update heads

* add maskfeat

* add milan

* add simmim

* add mixmim

* fix lint

* fix ut

* fix lint

* add eva

* add densecl

* add barlowtwins

* add swav

* fix lint

* update readtherdocs rst

* update docs

* update

* Decrease UT memory usage

* Fix docstring

* update DALLEEncoder

* Update model docs

* refactor dalle encoder

* update docstring

* fix ut

* fix config error

* add val_cfg and test_cfg

* refactor clip generator

* fix lint

* pass check

* fix ut

* add lars

* update type of BEiT in configs

* Use MMEngine style momentum in EMA.

* apply mmpretrain solarize

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-03-06 16:53:15 +08:00
Ma Zerun a05c79e806
[Refactor] Move transforms in mmselfsup to mmpretrain. (#1396)
* [Refactor] Move transforms in mmselfsup to mmpretrain.

* Update transform docs and configs. And register some mmcv transforms in
mmpretrain.

* Fix missing transform wrapper.

* update selfsup transforms

* Fix UT

* Fix UT

* update gaussianblur inconfigs

---------

Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
2023-03-03 15:01:11 +08:00
Yixiao Fang 75c79311f4
[Refactor] Update datasets (#1375)
* add ut

* add places205

* support ann_file without labels

* temp test

* update custom

* update

* update ut

* Update CustomDataset.

* Update Places205.

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-02-27 15:42:22 +08:00
Ma Zerun 36bea13fca
[Refactor] Refactor ClsDatasample to a union DataSample. (#1371)
* [Refactor] Refactor ClsDatasample to a union DataSample.

* Add  method

* Fix docstring

* Update docstring.
2023-02-23 10:07:53 +08:00
mzr1996 0979e78573 Rename the package name to `mmpretrain`. 2023-02-17 15:20:55 +08:00