Commit Graph

871 Commits (b058912c0cc79312cf656ca98b4068c30a9f9cdd)
 

Author SHA1 Message Date
zzc98 b058912c0c
[Docs] Fix example_project README (#1575) 2023-05-22 15:47:03 +08:00
Yixiao Fang 1e478462b8
[Feature] Support Chinese CLIP. (#1576)
* support cn-clip

* update README

* Update progress bar

* update order of category

* fix lint

* update

* update readme and metafile

* update

* update docstring

* refactor tokenizer

* fix lint

* Update README and progress bar

---------

Co-authored-by: mzr1996 <mzr1996@163.com>
2023-05-22 15:46:13 +08:00
Yuan Liu d04ef8a29e
Merge pull request #1577 from YuanLiuuuuuu/scienceqa_metrics
[Feature]: Add ScienceQA Metrics
2023-05-22 13:08:06 +08:00
liuyuan 74f24658e7 [Fix]: Delete GQA 2023-05-22 11:57:18 +08:00
liuyuan 13e4d6c512 [Fix]: Fix UT 2023-05-22 11:55:08 +08:00
liuyuan b0ad99afb9 [Fix]: Fix bug 2023-05-22 11:38:34 +08:00
liuyuan 1537d46596 [Feature]: Update scienceqa 2023-05-22 11:31:07 +08:00
liuyuan 87f849cbb6 [Feature]: Add scienceqa metric 2023-05-22 11:31:07 +08:00
liuyuan 1b8e86dca6 [Feature]: Add caption 2023-05-22 11:31:07 +08:00
Ma Zerun 6847d20d57
[Feature] Support multiple multi-modal algorithms and inferencers. (#1561)
* [Feat] Migrate blip caption to mmpretrain. (#50)

* Migrate blip caption to mmpretrain

* minor fix

* support train

* [Feature] Support OFA caption task. (#51)

* [Feature] Support OFA caption task.

* Remove duplicated files.

* [Feature] Support OFA vqa task. (#58)

* [Feature] Support OFA vqa task.

* Fix lint.

* [Feat] Add BLIP retrieval to mmpretrain. (#55)

* init

* minor fix for train

* fix according to comments

* refactor

* Update Blip retrieval. (#62)

* [Feature] Support OFA visual grounding task. (#59)

* [Feature] Support OFA visual grounding task.

* minor add TODO

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feat] Add flamingos coco caption and vqa. (#60)

* first init

* init flamingo coco

* add vqa

* minor fix

* remove unnecessary modules

* Update config

* Use `ApplyToList`.

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature]: BLIP2 coco retrieval  (#53)

* [Feature]: Add blip2 retriever

* [Feature]: Add blip2 all modules

* [Feature]: Refine model

* [Feature]: x1

* [Feature]: Runnable coco ret

* [Feature]: Runnable version

* [Feature]: Fix lint

* [Fix]: Fix lint

* [Feature]: Use 364 img size

* [Feature]: Refactor blip2

* [Fix]: Fix lint

* refactor files

* minor fix

* minor fix

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* Remove

* fix blip caption inputs (#68)

* [Feat] Add BLIP NLVR support. (#67)

* first init

* init flamingo coco

* add vqa

* add nlvr

* refactor nlvr

* minor fix

* minor fix

* Update dataset

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature]: BLIP2 Caption (#70)

* [Feature]: Add language model

* [Feature]: blip2 caption forward

* [Feature]: Reproduce the results

* [Feature]: Refactor caption

* refine config

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feat] Migrate BLIP VQA to mmpretrain (#69)

* reformat

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* change

* refactor code

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* Update RefCOCO dataset

* [Fix] fix lint

* [Feature] Implement inference APIs for multi-modal tasks. (#65)

* [Feature] Implement inference APIs for multi-modal tasks.

* [Project] Add gradio demo.

* [Improve] Update requirements

* Update flamingo

* Update blip

* Add NLVR inferencer

* Update flamingo

* Update hugging face model register

* Update ofa vqa

* Update BLIP-vqa (#71)

* Update blip-vqa docstring (#72)

* Refine flamingo docstring (#73)

* [Feature]: BLIP2 VQA (#61)

* [Feature]: VQA forward

* [Feature]: Reproduce accuracy

* [Fix]: Fix lint

* [Fix]: Add blank line

* minor fix

---------

Co-authored-by: yingfhu <yingfhu@gmail.com>

* [Feature]: BLIP2 docstring (#74)

* [Feature]: Add caption docstring

* [Feature]: Add docstring to blip2 vqa

* [Feature]: Add docstring to retrieval

* Update BLIP-2 metafile and README (#75)

* [Feature]: Add readme and docstring

* Update blip2 results

---------

Co-authored-by: mzr1996 <mzr1996@163.com>

* [Feature] BLIP Visual Grounding on MMPretrain Branch (#66)

* blip grounding merge with mmpretrain

* remove commit

* blip grounding test and inference api

* refcoco dataset

* refcoco dataset refine config

* rebasing

* gitignore

* rebasing

* minor edit

* minor edit

* Update blip-vqa docstring (#72)

* rebasing

* Revert "minor edit"

This reverts commit 639cec757c215e654625ed0979319e60f0be9044.

* blip grounding final

* precommit

* refine config

* refine config

* Update blip visual grounding

---------

Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: mzr1996 <mzr1996@163.com>

* Update visual grounding metric

* Update OFA docstring, README and metafiles. (#76)

* [Docs] Update installation docs and gradio demo docs. (#77)

* Update OFA name

* Update Visual Grounding Visualizer

* Integrate accelerate support

* Fix imports.

* Fix timm backbone

* Update imports

* Update README

* Update circle ci

* Update flamingo config

* Add gradio demo README

* [Feature]: Add scienceqa (#1571)

* [Feature]: Add scienceqa

* [Feature]: Change param name

* Update docs

* Update video

---------

Co-authored-by: Hubert <42952108+yingfhu@users.noreply.github.com>
Co-authored-by: yingfhu <yingfhu@gmail.com>
Co-authored-by: Yuan Liu <30762564+YuanLiuuuuuu@users.noreply.github.com>
Co-authored-by: Yiqin Wang 王逸钦 <wyq1217@outlook.com>
Co-authored-by: Rongjie Li <limo97@163.com>
2023-05-19 16:50:04 +08:00
Yixiao Fang 770eb8e24a
[Fix] Fix ddp bugs caused by `out_type`. (#1570)
* set out_type to be 'raw'

* update test
2023-05-17 17:32:10 +08:00
zzc98 034919d032
[Feature] add eva02 backbone (#1450)
* [CI] Add test mim CI. (#879)

* [CI] Add test mim CI. (#879)

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update ci

* rebase

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update readme and configs

* update readme and configs

* refactore eva02

* [CI] Add test mim CI. (#879)

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update ci

* rebase

* feat: add eva02 backbone

* feat: add eva02 backbone

* feat: add eva02 backbone

* update

* update readme and configs

* refactore eva02

* update readme and metafile

* update readme and metafile

* update readme and metafile

* update

* rename eva02

* rename eva02

* fix uts

* rename configs

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-06 19:28:31 +08:00
Ezra-Yu 7f4eccbecf
[Fix] Fix multi-task-head loss potential bug (#1530)
* fix bug

* add comments
2023-05-06 18:04:34 +08:00
Ezra-Yu 9cf37b315c
[DOC] Refine Inference Doc (#1489)
* update en doc

* update

* update zh doc

* refine

* refine
2023-05-06 17:54:13 +08:00
Kei-Chi Tse afa60c73bb
[Fix] Support bce loss without batch augmentations (#1525)
* Support bce loss without batch augmentations

---------

Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-05 17:19:42 +08:00
Yixiao Fang d9e561a09d
[Feature] Support dinov2 backbone (#1522)
* support dinov2 backbone

* update metafile and readme

* compatible to use_layer_scale

* update SwiGLUFFN

* add deprecation warning

* update
2023-05-05 16:59:37 +08:00
zzc98 496e098b21
[Feature] Support some downstream classification datasets. (#1467)
* feat: support some downstream classification datasets

* update sun397

* sum

* update sun397

* [CI] Add test mim CI. (#879)

* feat: support some downstream classification datasets

* update sun397

* sum

* update sun397

* rebase

* feat: support some downstream classification datasets

* update sun397

* update sun397

* update sun397

* update sun397

* fix unittest

* update docstring

* rm

* update

* update

* refactor names of datasets

* refactor some implements of datasets

* refactor some implements of datasets

* fix datasets unittest

* refactor cub and stanford cars

* refactor cub and cifar

* refactor cub and cifar

* refactor cub and cifar

* update downstream datasets and docs

* update docstring

---------

Co-authored-by: Ma Zerun <mzr1996@163.com>
Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-05-05 14:43:14 +08:00
Yixiao Fang a3fa328f09
Fix config of beit (#1528) 2023-04-28 16:33:14 +08:00
Choi Sau Deng b51d7d21de
[DOC] Add doc for usage of confusion matrix (#1513)
* add_doc_for_confusion_matrix

* add_doc_for_confusion_matrix_fix_mmcls

* add_doc_for_confusion_matrix_fix_shell

* add_doc_for_confusion_matrix_fix_shell

* fix

* update

---------

Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
2023-04-27 14:56:44 +08:00
Yixiao Fang 15cc2a5193
[Fix] Fix clip generator init bug (#1518) 2023-04-25 19:35:09 +08:00
Ezra-Yu 6ceba070a8
[DOC] Update MMagic link (#1517)
* update repo links

* update mmengine links

* update mmengine links
2023-04-25 19:12:34 +08:00
Weihao Yu 3cd4fd4d64
Update PoolFormer citation to CVPR version (#1505) 2023-04-20 20:22:22 +08:00
Wangbo Zhao(黑色枷锁) e954cf0aaf
[Fix] Fix the bug in binary cross entropy loss (#1499)
* [Fix] Fix the bug in binary cross entropy loss

 Fix the bug in binary cross entropy loss when using multi-label datasets e.g.VOC2007

* update ci

---------

Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-04-19 13:53:31 +08:00
takuoko fec3da781f
[Feature] Support GLIP (#1308)
* rebase

* add glip

* update glip

* add links

* rename

* fix doc

---------

Co-authored-by: Ezra-Yu <18586273+Ezra-Yu@users.noreply.github.com>
2023-04-17 19:19:23 +08:00
Yixiao Fang 2c913020b9
[Refactor] Support to freeze channel reduction and add layer decay function (#1490)
* support to freeze channel reduction module

* add layer decay setting function
2023-04-17 13:36:47 +08:00
Yixiao Fang e93d124ad4
[Refactor] Support resizing pos_embed while loading ckpt and format output (#1488)
* support resize pos_embed while loading ckpt

* update
2023-04-14 19:08:35 +08:00
Yixiao Fang 02571fe4b8
[Docs] Add NPU support page (#1481)
* add npu docs

* fix lint
2023-04-14 13:58:10 +08:00
Ezra-Yu 645e2b4ed4
[DOC] Fix typo in MultiLabelDataset docstring (#1483)
* fix doc

* fix ci

* fix ci

* fix ci

* fix ci
2023-04-14 13:57:54 +08:00
Ezra-Yu 99e48116aa
[Feature] Register torchvision transforms into mmcls (#1265)
* [Enhance] Add stochastic depth decay rule in resnet. (#1363)

* add stochastic depth decay rule to drop path rate

* add default value

* update

* pass ut

* update

* pass ut

* remove np

* rebase

* update ToPIL and ToNumpy

* rebase

* rebase

* rebase

* rebase

* add readme

* fix review suggestions

* rebase

* fix conflicts

* fix conflicts

* fix lint

* remove comments

* remove useless code

* update docstring

* update doc API

* update doc

---------

Co-authored-by: Yixiao Fang <36138628+fangyixiao18@users.noreply.github.com>
2023-04-13 18:05:57 +08:00
Yixiao Fang 0826df8963
[Feature] Add ViT of SAM (#1476)
* add vit of sam

* update

* update

* add ut

* update ut

* remove num_classes

* support dynamic input

* add ut

* add comments

* update ut
2023-04-13 17:03:28 +08:00
Mr.Li e80418a424
[Docs] train cfg: Removed old description (#1473) 2023-04-10 15:37:14 +08:00
Yixiao Fang 9cbeceabb5
Bump version to v1.0.0rc7 (#1465)
* update

* update info

* update changelog

* update

* update description

* change to v1.0.0rc7
2023-04-07 17:34:21 +08:00
Ezra-Yu 47e033c466
[Fix] fix pbn bug (#1466) 2023-04-07 15:38:04 +08:00
Yixiao Fang 5ea46fbbbc
[Docs] Fix docs link (#1464) 2023-04-07 15:06:18 +08:00
mzr1996 1e78f09d87 Update docs style. 2023-04-07 14:52:34 +08:00
Yixiao Fang 79ddc0f874
[Refactor] Update CI and issue template (#1463)
* Update CI

* update issue template

* update

* update collect_env function
2023-04-07 14:29:41 +08:00
Ezra-Yu 411e05a705
Merge pull request #1453 from techmonsterwang/riformer_mmpt
[Feature] Add RIFormer Backbone
2023-04-07 10:32:23 +08:00
Ezra-Yu 05124dbb71 fix lint 2023-04-06 22:01:11 +08:00
Ezra-Yu b8cab5c9f7 update readme 2023-04-06 21:56:25 +08:00
Ezra-Yu 3932ddec10 update ckpt path 2023-04-06 21:56:25 +08:00
techmonsterwang 5c3abb2b2a update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang e115ac89f4 update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang 53a57c6dad update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang e4d8511ddf update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang c9c7d9cc0f update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang a6c24d104e update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang e7da3f29f4 update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang 61b795f21f update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang 0ef0b5ce08 update riformer mmpretrain 2023-04-06 21:56:25 +08:00
techmonsterwang 32c258ff19 update riformer mmpretrain 2023-04-06 21:56:25 +08:00