mzr1996
bf62497e02
Merge remote-tracking branch 'origin/main' into dev
2023-08-15 11:37:22 +08:00
Yuan Liu
fa53174fd9
[Feature]: Add MFF ( #1725 )
...
* [Feature]: Add MFF
* [Feature]: Add mff linear prob
* [Feature]: Add ft
* [Fix]: Update docstring
* [Feature]: Update out_indices
* [Feature]: Add prefix to ft
* [Feature]: Add README
* [Feature]: Update readme
* [Feature]: Update README
* [Feature]: Add metafile
* [Feature]: Update README
* [Fix]: Fix lint
* [Feature]: Add UT
* [Feature]: Update paper link
2023-08-08 16:01:07 +08:00
No-518
1dda91bf24
[CodeCamp2023-343] Update dataset_prepare.md ( #1732 )
...
* Update dataset_prepare.md
* Enhanced docstring for RefCOCO and updated datasets.rst
* fix ln
* update
---------
Co-authored-by: No-518 <wybang@gmail.com>
Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
2023-08-03 19:24:23 +08:00
Yike Yuan
340d187765
Support Infographic VQA dataset and ANLS metric. ( #1667 )
2023-08-01 16:22:34 +08:00
Yike Yuan
4f2f3752d9
Support IconQA dataset. ( #1670 )
2023-08-01 16:14:40 +08:00
fangyixiao18
58a2243d99
Merge branch 'main' into dev
2023-07-28 15:35:55 +08:00
liyl
2b8d8eecb2
[Fix] Fix the issue #1711 "GaussianBlur doesn't work" ( #1722 )
...
* Fix issue 1711. GaussianBlur.
* Fix UT
---------
Co-authored-by: mzr1996 <mzr1996@163.com>
2023-07-25 11:25:32 +08:00
Ezra-Yu
78d0ddc852
[Fix] Fix RandomCrop bug ( #1706 )
2023-07-11 10:18:08 +08:00
Mashiro
d6056af2b8
[Fix][New_config] Fix demo bug ( #1647 )
...
* Fix demo
* Update implement
---------
Co-authored-by: mzr1996 <mzr1996@163.com>
2023-06-19 15:15:28 +08:00
Yiqin Wang 王逸钦
6d7fe91a98
[Feature] Support Flickr30k Retrieval dataset ( #1625 )
...
* format
* remove abs path
* init add flickr30k caption
* remove abs dir
* update blip readme
* add convert sscripts
* minor
* minor
2023-06-19 15:15:03 +08:00
Yike Yuan
a673b048a5
[Feature] Add support for VizWiz dataset. ( #1636 )
...
* add vizwiz
* update dataset
* [Fix] Build img_path in data_sample.
* Fix isort.
---------
Co-authored-by: ZhangYuanhan-AI <yuanhan002@ntu.edu.sg>
2023-06-16 17:16:17 +08:00
Yixiao Fang
aac398a83f
[Feature] Support new configs. ( #1639 )
...
* [Feature] Support new configs (#1638 )
* add new config of mae and simclr
* update
* update setup.cfg
* update eva
* update
* update new config
* Add new config
* remove __init__.py
* 1. remove ; 2. remove mmpretrain/configs/_base_/models/convnext
* remove model_wrapper_cfg and add out type
* Add comment for setting default_scope to NOne
* update if '_base_' order
* update
* revert changes
---------
Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
* Add warn at the head of new config files
---------
Co-authored-by: Mashiro <57566630+HAOCHENYE@users.noreply.github.com>
Co-authored-by: mzr1996 <mzr1996@163.com>
2023-06-16 16:54:45 +08:00
Ezra-Yu
93e0f107c4
[Fix] Fix bug loading IN1k dataset. ( #1641 )
2023-06-16 15:35:27 +08:00
Yike Yuan
7581b76233
[Feature] Add support for vsr dataset ( #1634 )
...
* add VSR dataset
* [Fix] Modify example and load gt_answer as string.
---------
Co-authored-by: ZhangYuanhan-AI <yuanhan002@ntu.edu.sg>
2023-06-15 19:17:02 +08:00
Yiqin Wang 王逸钦
bb415b91be
[Feature] Support OCR-VQA dataset ( #1621 )
...
* support ocrvqa dataset
* minor
* remove abs path
* refine README
2023-06-13 10:28:45 +08:00
Yuan Liu
bddbc085fc
[Feature]: Add image_only param ( #1613 )
...
* [Feature]: Add image_only param
* [Feature]: Use image_only
2023-06-06 12:50:42 +08:00
Wangbo Zhao(黑色枷锁)
3a277ee9e6
[Feature] support TextVQA dataset ( #1596 )
...
* [Support] Suport TextVQA dataset
* add folder structure
* fix readme
2023-06-02 11:50:38 +08:00
zzc98
bc3c4a35ee
[Refactor] Support to use "split" to specify training set/validation set in the ImageNet dataset ( #1535 )
...
* [Feature]: Add caption
* [Feature]: Update scienceqa
* [CI] Add test mim CI. (#879 )
* refactor imagenet dataset
* refactor imagenet dataset
* refactor imagenet dataset
* update imagenet21k
* update configs
* update mnist
* update dataset_prepare.md
* fix sun397 url and update user_guides/dataset_prepare.md
* update dataset_prepare.md
* fix sun397 dataset
* fix sun397
* update chinese dataset_prepare.md
* update dataset_prepare.md
* [Refactor] update voc dataset
* [Refactor] update voc dataset
* refactor imagenet
* refactor imagenet
* use mmengine.fileio
---------
Co-authored-by: liuyuan <3463423099@qq.com>
Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-06-02 11:03:18 +08:00
Ma Zerun
5bd088ef43
[Fix] Update torchvision transform wrapper ( #1595 )
...
* Update torchvision transform wrapper
* Update requirements
* fix unit tests
---------
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-26 17:56:09 +08:00
Wangbo Zhao(黑色枷锁)
a779c8c5a7
[Feature] Support NoCap dataset based on BLIP. ( #1582 )
...
* [Feature] Support nocaps dataset
* precommit
* Use official coco format
* add nocp readme
* fix readme
---------
Co-authored-by: mzr1996 <mzr1996@163.com>
2023-05-23 18:06:43 +08:00
Yuan Liu
46a523ef63
[Feature] Add GQA dataset. ( #1585 )
...
* [Feature]: Add GQA dataset
* [Feature]: Add GQA
* [Feature]: Add GQA UT
* [Fix]: Fix hint
* [Feature]: Add BLIP2 GQA
* [Fix]: Fix lint
* [Feature]: Update anno link
* [Fix]: Update docstring
* [Feature]: Update all links
2023-05-23 11:25:42 +08:00
Yuan Liu
be389eb846
[Fix] Fix scienceqa ( #1581 )
2023-05-22 16:10:17 +08:00
Yixiao Fang
1e478462b8
[Feature] Support Chinese CLIP. ( #1576 )
...
* support cn-clip
* update README
* Update progress bar
* update order of category
* fix lint
* update
* update readme and metafile
* update
* update docstring
* refactor tokenizer
* fix lint
* Update README and progress bar
---------
Co-authored-by: mzr1996 <mzr1996@163.com>
2023-05-22 15:46:13 +08:00
liuyuan
74f24658e7
[Fix]: Delete GQA
2023-05-22 11:57:18 +08:00
liuyuan
13e4d6c512
[Fix]: Fix UT
2023-05-22 11:55:08 +08:00
liuyuan
1537d46596
[Feature]: Update scienceqa
2023-05-22 11:31:07 +08:00
liuyuan
1b8e86dca6
[Feature]: Add caption
2023-05-22 11:31:07 +08:00
Ma Zerun
6847d20d57
[Feature] Support multiple multi-modal algorithms and inferencers. ( #1561 )
...
* [Feat] Migrate blip caption to mmpretrain. (#50 )
* Migrate blip caption to mmpretrain
* minor fix
* support train
* [Feature] Support OFA caption task. (#51 )
* [Feature] Support OFA caption task.
* Remove duplicated files.
* [Feature] Support OFA vqa task. (#58 )
* [Feature] Support OFA vqa task.
* Fix lint.
* [Feat] Add BLIP retrieval to mmpretrain. (#55 )
* init
* minor fix for train
* fix according to comments
* refactor
* Update Blip retrieval. (#62 )
* [Feature] Support OFA visual grounding task. (#59 )
* [Feature] Support OFA visual grounding task.
* minor add TODO
---------
Co-authored-by: yingfhu <yingfhu@gmail.com>
* [Feat] Add flamingos coco caption and vqa. (#60 )
* first init
* init flamingo coco
* add vqa
* minor fix
* remove unnecessary modules
* Update config
* Use `ApplyToList`.
---------
Co-authored-by: mzr1996 <mzr1996@163.com>
* [Feature]: BLIP2 coco retrieval (#53 )
* [Feature]: Add blip2 retriever
* [Feature]: Add blip2 all modules
* [Feature]: Refine model
* [Feature]: x1
* [Feature]: Runnable coco ret
* [Feature]: Runnable version
* [Feature]: Fix lint
* [Fix]: Fix lint
* [Feature]: Use 364 img size
* [Feature]: Refactor blip2
* [Fix]: Fix lint
* refactor files
* minor fix
* minor fix
---------
Co-authored-by: yingfhu <yingfhu@gmail.com>
* Remove
* fix blip caption inputs (#68 )
* [Feat] Add BLIP NLVR support. (#67 )
* first init
* init flamingo coco
* add vqa
* add nlvr
* refactor nlvr
* minor fix
* minor fix
* Update dataset
---------
Co-authored-by: mzr1996 <mzr1996@163.com>
* [Feature]: BLIP2 Caption (#70 )
* [Feature]: Add language model
* [Feature]: blip2 caption forward
* [Feature]: Reproduce the results
* [Feature]: Refactor caption
* refine config
---------
Co-authored-by: yingfhu <yingfhu@gmail.com>
* [Feat] Migrate BLIP VQA to mmpretrain (#69 )
* reformat
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* change
* refactor code
---------
Co-authored-by: yingfhu <yingfhu@gmail.com>
* Update RefCOCO dataset
* [Fix] fix lint
* [Feature] Implement inference APIs for multi-modal tasks. (#65 )
* [Feature] Implement inference APIs for multi-modal tasks.
* [Project] Add gradio demo.
* [Improve] Update requirements
* Update flamingo
* Update blip
* Add NLVR inferencer
* Update flamingo
* Update hugging face model register
* Update ofa vqa
* Update BLIP-vqa (#71 )
* Update blip-vqa docstring (#72 )
* Refine flamingo docstring (#73 )
* [Feature]: BLIP2 VQA (#61 )
* [Feature]: VQA forward
* [Feature]: Reproduce accuracy
* [Fix]: Fix lint
* [Fix]: Add blank line
* minor fix
---------
Co-authored-by: yingfhu <yingfhu@gmail.com>
* [Feature]: BLIP2 docstring (#74 )
* [Feature]: Add caption docstring
* [Feature]: Add docstring to blip2 vqa
* [Feature]: Add docstring to retrieval
* Update BLIP-2 metafile and README (#75 )
* [Feature]: Add readme and docstring
* Update blip2 results
---------
Co-authored-by: mzr1996 <mzr1996@163.com>
* [Feature] BLIP Visual Grounding on MMPretrain Branch (#66 )
* blip grounding merge with mmpretrain
* remove commit
* blip grounding test and inference api
* refcoco dataset
* refcoco dataset refine config
* rebasing
* gitignore
* rebasing
* minor edit
* minor edit
* Update blip-vqa docstring (#72 )
* rebasing
* Revert "minor edit"
This reverts commit 639cec757c215e654625ed0979319e60f0be9044.
* blip grounding final
* precommit
* refine config
* refine config
* Update blip visual grounding
---------
Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: mzr1996 <mzr1996@163.com>
* Update visual grounding metric
* Update OFA docstring, README and metafiles. (#76 )
* [Docs] Update installation docs and gradio demo docs. (#77 )
* Update OFA name
* Update Visual Grounding Visualizer
* Integrate accelerate support
* Fix imports.
* Fix timm backbone
* Update imports
* Update README
* Update circle ci
* Update flamingo config
* Add gradio demo README
* [Feature]: Add scienceqa (#1571 )
* [Feature]: Add scienceqa
* [Feature]: Change param name
* Update docs
* Update video
---------
Co-authored-by: Hubert <42952108+yingfhu@users.noreply.github.com>
Co-authored-by: yingfhu <yingfhu@gmail.com>
Co-authored-by: Yuan Liu <30762564+YuanLiuuuuuu@users.noreply.github.com>
Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: Rongjie Li <limo97@163.com>
2023-05-19 16:50:04 +08:00
zzc98
496e098b21
[Feature] Support some downstream classification datasets. ( #1467 )
...
* feat: support some downstream classification datasets
* update sun397
* sum
* update sun397
* [CI] Add test mim CI. (#879 )
* feat: support some downstream classification datasets
* update sun397
* sum
* update sun397
* rebase
* feat: support some downstream classification datasets
* update sun397
* update sun397
* update sun397
* update sun397
* fix unittest
* update docstring
* rm
* update
* update
* refactor names of datasets
* refactor some implements of datasets
* refactor some implements of datasets
* fix datasets unittest
* refactor cub and stanford cars
* refactor cub and cifar
* refactor cub and cifar
* refactor cub and cifar
* update downstream datasets and docs
* update docstring
---------
Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-05 14:43:14 +08:00
Ezra-Yu
645e2b4ed4
[DOC] Fix typo in MultiLabelDataset docstring ( #1483 )
...
* fix doc
* fix ci
* fix ci
* fix ci
* fix ci
2023-04-14 13:57:54 +08:00
Ezra-Yu
99e48116aa
[Feature] Register torchvision transforms into mmcls ( #1265 )
...
* [Enhance] Add stochastic depth decay rule in resnet. (#1363 )
* add stochastic depth decay rule to drop path rate
* add default value
* update
* pass ut
* update
* pass ut
* remove np
* rebase
* update ToPIL and ToNumpy
* rebase
* rebase
* rebase
* rebase
* add readme
* fix review suggestions
* rebase
* fix conflicts
* fix conflicts
* fix lint
* remove comments
* remove useless code
* update docstring
* update doc API
* update doc
---------
Co-authored-by: Yixiao Fang <36138628+fangyixiao18@users.noreply.github.com>
2023-04-13 18:05:57 +08:00
Ma Zerun
c4ccae40db
[Docs] Update user guides docs and tools for MMPretrain. ( #1429 )
...
* [Docs] Update user guides docs and tools for MMPretrain.
* Fix UT
* Fix Chinese docs.
* Improve according to comments.
* Fix windows CI.
2023-03-27 14:32:26 +08:00
Yixiao Fang
827be6e22d
[Fix] Fix value error while start training. ( #1400 )
...
* fix value error of randint
* add missing key in configs
2023-03-07 08:51:31 +08:00
Yixiao Fang
08dc8c75d3
[Refactor] Add selfsup algorithms. ( #1389 )
...
* remove basehead
* add moco series
* add byol simclr simsiam
* add ut
* update configs
* add simsiam hook
* add and refactor beit
* update ut
* add cae
* update extract_feat
* refactor cae
* add mae
* refactor data preprocessor
* update heads
* add maskfeat
* add milan
* add simmim
* add mixmim
* fix lint
* fix ut
* fix lint
* add eva
* add densecl
* add barlowtwins
* add swav
* fix lint
* update readtherdocs rst
* update docs
* update
* Decrease UT memory usage
* Fix docstring
* update DALLEEncoder
* Update model docs
* refactor dalle encoder
* update docstring
* fix ut
* fix config error
* add val_cfg and test_cfg
* refactor clip generator
* fix lint
* pass check
* fix ut
* add lars
* update type of BEiT in configs
* Use MMEngine style momentum in EMA.
* apply mmpretrain solarize
---------
Co-authored-by: mzr1996 <mzr1996@163.com>
2023-03-06 16:53:15 +08:00
Ma Zerun
a05c79e806
[Refactor] Move transforms in mmselfsup to mmpretrain. ( #1396 )
...
* [Refactor] Move transforms in mmselfsup to mmpretrain.
* Update transform docs and configs. And register some mmcv transforms in
mmpretrain.
* Fix missing transform wrapper.
* update selfsup transforms
* Fix UT
* Fix UT
* update gaussianblur inconfigs
---------
Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
2023-03-03 15:01:11 +08:00
Yixiao Fang
75c79311f4
[Refactor] Update datasets ( #1375 )
...
* add ut
* add places205
* support ann_file without labels
* temp test
* update custom
* update
* update ut
* Update CustomDataset.
* Update Places205.
---------
Co-authored-by: mzr1996 <mzr1996@163.com>
2023-02-27 15:42:22 +08:00
Ma Zerun
36bea13fca
[Refactor] Refactor ClsDatasample to a union DataSample. ( #1371 )
...
* [Refactor] Refactor ClsDatasample to a union DataSample.
* Add method
* Fix docstring
* Update docstring.
2023-02-23 10:07:53 +08:00
mzr1996
0979e78573
Rename the package name to `mmpretrain`.
2023-02-17 15:20:55 +08:00