Commit Graph

53 Commits (b058912c0cc79312cf656ca98b4068c30a9f9cdd)

Author SHA1 Message Date
Yixiao Fang 1e478462b8
[Feature] Support Chinese CLIP. (#1576)
* support cn-clip

* update README

* Update progress bar

* update order of category

* fix lint

* update

* update readme and metafile

* update

* update docstring

* refactor tokenizer

* fix lint

* Update README and progress bar

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-05-22 15:46:13 +08:00
liuyuan 74f24658e7 [Fix]: Delete GQA 2023-05-22 11:57:18 +08:00
liuyuan 13e4d6c512 [Fix]: Fix UT 2023-05-22 11:55:08 +08:00
liuyuan b0ad99afb9 [Fix]: Fix bug 2023-05-22 11:38:34 +08:00
liuyuan 1537d46596 [Feature]: Update scienceqa 2023-05-22 11:31:07 +08:00
liuyuan 87f849cbb6 [Feature]: Add scienceqa metric 2023-05-22 11:31:07 +08:00
liuyuan 1b8e86dca6 [Feature]: Add caption 2023-05-22 11:31:07 +08:00
Ma Zerun 6847d20d57
[Feature] Support multiple multi-modal algorithms and inferencers. (#1561)
* [Feat] Migrate blip caption to mmpretrain. (#50)

* Migrate blip caption to mmpretrain

* minor fix

* support train

* [Feature] Support OFA caption task. (#51)

* [Feature] Support OFA caption task.

* Remove duplicated files.

* [Feature] Support OFA vqa task. (#58)

* [Feature] Support OFA vqa task.

* Fix lint.

* [Feat] Add BLIP retrieval to mmpretrain. (#55)

* init

* minor fix for train

* fix according to comments

* refactor

* Update Blip retrieval. (#62)

* [Feature] Support OFA visual grounding task. (#59)

* [Feature] Support OFA visual grounding task.

* minor add TODO

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feat] Add flamingos coco caption and vqa. (#60)

* first init

* init flamingo coco

* add vqa

* minor fix

* remove unnecessary modules

* Update config

* Use `ApplyToList`.

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature]: BLIP2 coco retrieval  (#53)

* [Feature]: Add blip2 retriever

* [Feature]: Add blip2 all modules

* [Feature]: Refine model

* [Feature]: x1

* [Feature]: Runnable coco ret

* [Feature]: Runnable version

* [Feature]: Fix lint

* [Fix]: Fix lint

* [Feature]: Use 364 img size

* [Feature]: Refactor blip2

* [Fix]: Fix lint

* refactor files

* minor fix

* minor fix

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* Remove

* fix blip caption inputs (#68)

* [Feat] Add BLIP NLVR support. (#67)

* first init

* init flamingo coco

* add vqa

* add nlvr

* refactor nlvr

* minor fix

* minor fix

* Update dataset

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature]: BLIP2 Caption (#70)

* [Feature]: Add language model

* [Feature]: blip2 caption forward

* [Feature]: Reproduce the results

* [Feature]: Refactor caption

* refine config

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feat] Migrate BLIP VQA to mmpretrain (#69)

* reformat

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* refactor code

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* Update RefCOCO dataset

* [Fix] fix lint

* [Feature] Implement inference APIs for multi-modal tasks. (#65)

* [Feature] Implement inference APIs for multi-modal tasks.

* [Project] Add gradio demo.

* [Improve] Update requirements

* Update flamingo

* Update blip

* Add NLVR inferencer

* Update flamingo

* Update hugging face model register

* Update ofa vqa

* Update BLIP-vqa (#71)

* Update blip-vqa docstring (#72)

* Refine flamingo docstring (#73)

* [Feature]: BLIP2 VQA (#61)

* [Feature]: VQA forward

* [Feature]: Reproduce accuracy

* [Fix]: Fix lint

* [Fix]: Add blank line

* minor fix

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feature]: BLIP2 docstring (#74)

* [Feature]: Add caption docstring

* [Feature]: Add docstring to blip2 vqa

* [Feature]: Add docstring to retrieval

* Update BLIP-2 metafile and README (#75)

* [Feature]: Add readme and docstring

* Update blip2 results

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature] BLIP Visual Grounding on MMPretrain Branch (#66)

* blip grounding merge with mmpretrain

* remove commit

* blip grounding test and inference api

* refcoco dataset

* refcoco dataset refine config

* rebasing

* gitignore

* rebasing

* minor edit

* minor edit

* Update blip-vqa docstring (#72)

* rebasing

* Revert "minor edit"

This reverts commit 639cec757c215e654625ed0979319e60f0be9044.

* blip grounding final

* precommit

* refine config

* refine config

* Update blip visual grounding

---------

Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: mzr1996 <mzr1996@163.com>

* Update visual grounding metric

* Update OFA docstring, README and metafiles. (#76)

* [Docs] Update installation docs and gradio demo docs. (#77)

* Update OFA name

* Update Visual Grounding Visualizer

* Integrate accelerate support

* Fix imports.

* Fix timm backbone

* Update imports

* Update README

* Update circle ci

* Update flamingo config

* Add gradio demo README

* [Feature]: Add scienceqa (#1571)

* [Feature]: Add scienceqa

* [Feature]: Change param name

* Update docs

* Update video

---------

Co-authored-by: Hubert <42952108+yingfhu@users.noreply.github.com>
Co-authored-by: yingfhu <yingfhu@gmail.com>
Co-authored-by: Yuan Liu <30762564+YuanLiuuuuuu@users.noreply.github.com>
Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: Rongjie Li <limo97@163.com>
2023-05-19 16:50:04 +08:00
Yixiao Fang 770eb8e24a
[Fix] Fix ddp bugs caused by `out_type`. (#1570)
* set out_type to be 'raw'

* update test
2023-05-17 17:32:10 +08:00
zzc98 034919d032
[Feature] add eva02 backbone (#1450)
* [CI] Add test mim CI. (#879)

* [CI] Add test mim CI. (#879)

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update ci

* rebase

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update readme and configs

* update readme and configs

* refactore eva02

* [CI] Add test mim CI. (#879)

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update ci

* rebase

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update readme and configs

* refactore eva02

* update readme and metafile

* update readme and metafile

* update readme and metafile

* update

* rename eva02

* rename eva02

* fix uts

* rename configs

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-06 19:28:31 +08:00
Ezra-Yu 7f4eccbecf
[Fix] Fix multi-task-head loss potential bug (#1530)
* fix bug

* add comments
2023-05-06 18:04:34 +08:00
Ezra-Yu 9cf37b315c
[DOC] Refine Inference Doc (#1489)
* update en doc

* update

* update zh doc

* refine

* refine
2023-05-06 17:54:13 +08:00
Kei-Chi Tse afa60c73bb
[Fix] Support bce loss without batch augmentations (#1525)
* Support bce loss without batch augmentations

---------

Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-05 17:19:42 +08:00
Yixiao Fang d9e561a09d
[Feature] Support dinov2 backbone (#1522)
* support dinov2 backbone

* update metafile and readme

* compatible to use_layer_scale

* update SwiGLUFFN

* add deprecation warning

* update
2023-05-05 16:59:37 +08:00
zzc98 496e098b21
[Feature] Support some downstream classification datasets. (#1467)
* feat: support some downstream classification datasets

* update sun397

* sum

* update sun397

* [CI] Add test mim CI. (#879)

* feat: support some downstream classification datasets

* update sun397

* sum

* update sun397

* rebase

* feat: support some downstream classification datasets

* update sun397

* update sun397

* update sun397

* update sun397

* fix unittest

* update docstring

* rm

* update

* update

* refactor names of datasets

* refactor some implements of datasets

* refactor some implements of datasets

* fix datasets unittest

* refactor cub and stanford cars

* refactor cub and cifar

* refactor cub and cifar

* refactor cub and cifar

* update downstream datasets and docs

* update docstring

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-05 14:43:14 +08:00
Yixiao Fang 15cc2a5193
[Fix] Fix clip generator init bug (#1518) 2023-04-25 19:35:09 +08:00
Wangbo Zhao(黑色枷锁) e954cf0aaf
[Fix] Fix the bug in binary cross entropy loss (#1499)
* [Fix] Fix the bug in binary cross entropy loss

 Fix the bug in binary cross entropy loss when using multi-label datasets e.g.VOC2007

* update ci

---------

Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-04-19 13:53:31 +08:00
Yixiao Fang 2c913020b9
[Refactor] Support to freeze channel reduction and add layer decay function (#1490)
* support to freeze channel reduction module

* add layer decay setting function
2023-04-17 13:36:47 +08:00
Yixiao Fang e93d124ad4
[Refactor] Support resizing pos_embed while loading ckpt and format output (#1488)
* support resize pos_embed while loading ckpt

* update
2023-04-14 19:08:35 +08:00
Ezra-Yu 645e2b4ed4
[DOC] Fix typo in MultiLabelDataset docstring (#1483)
* fix doc

* fix ci

* fix ci

* fix ci

* fix ci
2023-04-14 13:57:54 +08:00
Ezra-Yu 99e48116aa
[Feature] Register torchvision transforms into mmcls (#1265)
* [Enhance] Add stochastic depth decay rule in resnet. (#1363)

* add stochastic depth decay rule to drop path rate

* add default value

* update

* pass ut

* update

* pass ut

* remove np

* rebase

* update ToPIL and ToNumpy

* rebase

* rebase

* rebase

* rebase

* add readme

* fix review suggestions

* rebase

* fix conflicts

* fix conflicts

* fix lint

* remove comments

* remove useless code

* update docstring

* update doc API

* update doc

---------

Co-authored-by: Yixiao Fang <36138628+fangyixiao18@users.noreply.github.com>
2023-04-13 18:05:57 +08:00
Yixiao Fang 0826df8963
[Feature] Add ViT of SAM (#1476)
* add vit of sam

* update

* update

* add ut

* update ut

* remove num_classes

* support dynamic input

* add ut

* add comments

* update ut
2023-04-13 17:03:28 +08:00
Yixiao Fang 9cbeceabb5
Bump version to v1.0.0rc7 (#1465)
* update

* update info

* update changelog

* update

* update description

* change to v1.0.0rc7
2023-04-07 17:34:21 +08:00
Ezra-Yu 47e033c466
[Fix] fix pbn bug (#1466) 2023-04-07 15:38:04 +08:00
Yixiao Fang 79ddc0f874
[Refactor] Update CI and issue template (#1463)
* Update CI

* update issue template

* update

* update collect_env function
2023-04-07 14:29:41 +08:00
Ezra-Yu 05124dbb71 fix lint 2023-04-06 22:01:11 +08:00
Ezra-Yu b8cab5c9f7 update readme 2023-04-06 21:56:25 +08:00
Ezra-Yu 3932ddec10 update ckpt path 2023-04-06 21:56:25 +08:00
techmonsterwang a6c24d104e update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang 32c258ff19 update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang 0b70c108b0 update riformer mmpretrain 2023-04-06 21:56:25 +08:00
Yixiao Fang 1ee9bbe050
[Docs] Update links (#1457)
* update links

* update readtherdocs

* update

* update

* fix lint

* update

* update

* update

* update cov branch

* update

* update

* update
2023-04-06 20:58:52 +08:00
Yixiao Fang 75dceaa78f
[Refactor] Add ln to vit avg_featmap output (#1447) 2023-04-06 11:59:39 +08:00
Ezra-Yu 3a25b13eb3
[Fix] Update CI (#1454)
* update ci

* update window ci

* update window ci

* update window ci

* update window ci

* update window ci

* update window ci

* update window ci

* update window ci

* update window ci

* update window ci

* update window ci

* ignore tests/test_tools.py
2023-04-06 10:52:08 +08:00
Ma Zerun b017670e1b
[Improve] Use PyTorch official `scaled_dot_product_attention` to accelerate `MultiheadAttention`. (#1434)
* [Improve] Use PyTorch official `scaled_dot_product_attention` to accelerate `MultiheadAttention`.

* Support `--local-rank` and `--amp` option for new version PyTorch.

* Fix imports and UT.
2023-03-29 15:50:44 +08:00
Ma Zerun c4ccae40db
[Docs] Update user guides docs and tools for MMPretrain. (#1429)
* [Docs] Update user guides docs and tools for MMPretrain.

* Fix UT

* Fix Chinese docs.

* Improve according to comments.

* Fix windows CI.
2023-03-27 14:32:26 +08:00
Ma Zerun 6cedce234e
[Refactor] Update dev scripts to be compatible with selfsup tasks. (#1412)
* [Refactor] Update dev scripts to be compatible with selfsup tasks.

* Fix some missing fields in config files.

* Set maximum number of gpus for local training.

* Update README files

* Update according to comments.
2023-03-20 14:30:57 +08:00
Yixiao Fang 8875e9da92
[Docs] Update migration.md (#1417)
* update migration

* refine table

* update zh_cn

* fix lint

* Polish the documentation by ChatGPT.

* Update sphinx version and fix some warning.

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-03-17 10:30:09 +08:00
Yixiao Fang 76a1f3f735
[Refactor] Refactor the `browse_dataset.py` to support selfsup pipeline. (#1414)
* refactor browsedataset to support selfsup pipeline

* update make_grid to support list input

* mode 'transformed' supports list

* Beautify the visualization image.

* Fix compatitibly bug with matplotlib=3.5

* remove print

* fix bug of resize

* Apply mask only on the first image.

* Remove master only for some API in visualizer.

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-03-15 14:18:36 +08:00
Ma Zerun 3472ee5d2c
[Feature] Implememnt the universal visualizer for multiple task. (#1404)
* [Feature] Implememnt the universal visualizer for multiple task.

* Update tools

* Improve according to comments.

* Fix tools docs

* Add --test-cfg option and set default collate function.
2023-03-09 11:36:54 +08:00
Ma Zerun dbf3df21a3
[Refactor] Use `out_type` to specify ViT-like backbone output. (#1408)
* [Refactor] Use  to specify ViT-like backbone output.

* Fix ClsBatchNormNeck

* Update mmpretrain/models/necks/mae_neck.py

---------

Co-authored-by: Yixiao Fang <36138628+fangyixiao18@users.noreply.github.com>
2023-03-09 11:02:58 +08:00
Ma Zerun 274a67223e
[Feature] Implement layer-wise learning rate decay optimizer constructor. (#1399)
* [Feature] Implement layer-wise learning rate decay optimizer constructor.

* Use num_layers instead of max_depth to avoid misleading

* Add UT

* Update docstring

* Update log info

* update LearningRateDecay configs

---------

Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
2023-03-07 17:30:39 +08:00
Yixiao Fang 827be6e22d
[Fix] Fix value error while start training. (#1400)
* fix value error of randint

* add missing key in configs
2023-03-07 08:51:31 +08:00
Yixiao Fang 08dc8c75d3
[Refactor] Add selfsup algorithms. (#1389)
* remove basehead

* add moco series

* add byol simclr simsiam

* add ut

* update configs

* add simsiam hook

* add and refactor beit

* update ut

* add cae

* update extract_feat

* refactor cae

* add mae

* refactor data preprocessor

* update heads

* add maskfeat

* add milan

* add simmim

* add mixmim

* fix lint

* fix ut

* fix lint

* add eva

* add densecl

* add barlowtwins

* add swav

* fix lint

* update readtherdocs rst

* update docs

* update

* Decrease UT memory usage

* Fix docstring

* update DALLEEncoder

* Update model docs

* refactor dalle encoder

* update docstring

* fix ut

* fix config error

* add val_cfg and test_cfg

* refactor clip generator

* fix lint

* pass check

* fix ut

* add lars

* update type of BEiT in configs

* Use MMEngine style momentum in EMA.

* apply mmpretrain solarize

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-03-06 16:53:15 +08:00
Ma Zerun a05c79e806
[Refactor] Move transforms in mmselfsup to mmpretrain. (#1396)
* [Refactor] Move transforms in mmselfsup to mmpretrain.

* Update transform docs and configs. And register some mmcv transforms in
mmpretrain.

* Fix missing transform wrapper.

* update selfsup transforms

* Fix UT

* Fix UT

* update gaussianblur inconfigs

---------

Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
2023-03-03 15:01:11 +08:00
Yixiao Fang c9670173aa
[Refactor] Move and refactor utils from mmselfsup. (#1385)
* add heads

* add losses

* fix

* remove mim head

* add modified backbones and target generators

* fix lint

* fix lint

* add heads

* add losses

* fix

* add data preprocessor from mmselfsup

* add ut for data prepocessor

* add GatherLayer

* add ema

* add batch shuffle

* add misc

* fix lint

* update

* update docstring
2023-02-28 17:04:40 +08:00
Ma Zerun 414ba80274
[Refactor] Refactor APIs, add `ImageRetrievalInferencer` and `FeatureExtractor`. (#1380)
* [Refactor] Refactor APIs, add `ImageRetrievalInferencer` and `FeatureExtractor'.

* Update image retrieval

* Update FeatureExtractor

* Fix UT
2023-02-28 16:31:42 +08:00
Yixiao Fang e453a45d31
[Refactor] Add self-supervised backbones and target generators. (#1379)
* add heads

* add losses

* fix

* remove mim head

* add modified backbones and target generators

* add unittest

* refactor caevit

* add window_size check

* fix lint

* apply new DataSample

* fix ut error

* update ut

* fix ut

* fix lint

* Update base modules.

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-02-28 15:59:17 +08:00
Yixiao Fang 63d9f27fde
[Refactor] Add necks, heads and losses for the self-supervised task. (#1376)
* add necks

* refactor linear neck

* rename simmim neck

* add heads

* add losses

* fix

* add unittest

* update

* update cae

* remove mim head

* update config
2023-02-28 10:05:00 +08:00
Yixiao Fang 75c79311f4
[Refactor] Update datasets (#1375)
* add ut

* add places205

* support ann_file without labels

* temp test

* update custom

* update

* update ut

* Update CustomDataset.

* Update Places205.

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-02-27 15:42:22 +08:00