Thanks for your contribution and we appreciate it a lot. The following
instructions would make your pull request more healthy and more easily
get feedback. If you do not understand some items, don't worry, just
make the pull request and seek help from maintainers.
## Motivation
Support depth estimation algorithm [VPD](https://github.com/wl-zhao/VPD)
## Modification
1. add VPD backbone
2. add VPD decoder head for depth estimation
3. add a new segmentor `DepthEstimator` based on `EncoderDecoder` for
depth estimation
4. add an integrated metric that calculate common metrics in depth
estimation
5. add SiLog loss for depth estimation
6. add config for VPD
## BC-breaking (Optional)
Does the modification introduce changes that break the
backward-compatibility of the downstream repos?
If so, please describe how it breaks the compatibility and how the
downstream projects should modify their code to keep compatibility with
this PR.
## Use cases (Optional)
If this PR introduces a new feature, it is better to list some use cases
here, and update the documentation.
## Checklist
1. Pre-commit or other linting tools are used to fix the potential lint
issues.
7. The modification is covered by complete unit tests. If not, please
add more unit test to ensure the correctness.
8. If the modification has potential influence on downstream projects,
this PR should be tested with downstream projects, like MMDet or
MMDet3D.
9. The documentation has been modified accordingly, like docstring or
example tutorials.
* knet first commit
* fix import error in knet
* remove kernel update head from decoder head
* [Feature] Add kenerl updation for some decoder heads.
* [Feature] Add kenerl updation for some decoder heads.
* directly use forward_feature && modify other 3 decoder heads
* remover kernel_update attr
* delete unnecessary variables in forward function
* delete kernel update function
* delete kernel update function
* delete kernel_generate_head
* add unit test & comments in knet.py
* add copyright to fix lint error
* modify config names of knet
* rename swin-l 640
* upload models&logs and refactor knet_head.py
* modify docstrings and add some ut
* add url, modify docstring and add loss ut
* modify docstrings
* [Feature]Segformer re-implementation
* Using act_cfg and norm_cfg to control activation and normalization
* Split this PR into several little PRs
* Fix lint error
* Remove SegFormerHead
* [Feature] Add segformer decode head and related train config
* Add ade20K trainval support for segformer
1. Add related train and val configs;
2. Add AlignedResize;
* Set arg: find_unused_parameters = True
* parameters init refactor
* 1. Refactor segformer backbone parameters init;
2. Remove rebundant functions and unit tests;
* Remove rebundant codes
* Replace Linear Layer to 1X1 Conv
* Use nn.ModuleList to refactor segformer head.
* Remove local to_xtuple
* 1. Remove rebundant codes;
2. Modify module name;
* Refactor the backbone of segformer using mmcv.cnn.bricks.transformer.py
* Fix some code logic bugs.
* Add mit_convert.py to match pretrain keys of segformer.
* Resolve some comments.
* 1. Add some assert to ensure right params;
2. Support flexible peconv position;
* Add pe_index assert and fix unit test.
* 1. Add doc string for MixVisionTransformer;
2. Add some unit tests for MixVisionTransformer;
* Use hw_shape to pass shape of feature map.
* 1. Fix doc string of MixVisionTransformer;
2. Simplify MixFFN;
3. Modify H, W to hw_shape;
* Add more unit tests.
* Add doc string for shape convertion functions.
* Add some unit tests to improve code coverage.
* Fix Segformer backbone pretrain weights match bug.
* Modify configs of segformer.
* resolve the shape convertion functions doc string.
* Add pad_to_patch_size arg.
* Support progressive test with fewer memory cost.
* Modify default value of pad_to_patch_size arg.
* Temp code
* Using processor to refactor evaluation workflow.
* refactor eval hook.
* Fix process bar.
* Fix middle save argument.
* Modify some variable name of dataset evaluate api.
* Modify some viriable name of eval hook.
* Fix some priority bugs of eval hook.
* Fix some bugs about model loading and eval hook.
* Add ade20k 640x640 dataset.
* Fix related segformer configs.
* Depreciated efficient_test.
* Fix training progress blocked by eval hook.
* Depreciated old test api.
* Modify error patch size.
* Fix pretrain of mit_b0
* Fix the test api error.
* Modify dataset base config.
* Fix test api error.
* Modify outer api.
* Build a sampler test api.
* TODO: Refactor format_results.
* Modify variable names.
* Fix num_classes bug.
* Fix sampler index bug.
* Fix grammaly bug.
* Add part of benchmark results.
* Support batch sampler.
* More readable test api.
* Remove some command arg and fix eval hook bug.
* Support format-only arg.
* Modify format_results of datasets.
* Modify tool which use test apis.
* Update readme.
* Update readme of segformer.
* Updata readme of segformer.
* Update segformer readme and fix segformer mit_b4.
* Update readme of segformer.
* Clean AlignedResize related config.
* Clean code from pr #709
* Clean code from pr #709
* Add 512x512 segformer_mit-b5.
* Fix lint.
* Fix some segformer head bugs.
* Add segformer unit tests.
* Replace AlignedResize to ResizeToMultiple.
* Modify readme of segformer.
* Fix bug of ResizeToMultiple.
* Add ResizeToMultiple unit tests.
* Resolve conflict.
* Simplify the implementation of ResizeToMultiple.
* Update test results.
* Fix multi-scale test error when resize_ratio=1.75 and input size=640x640.
* Update segformer results.
* Update Segformer results.
* Fix some url bugs and pipelines bug.
* Move ckpt convertion to tools.
* Add segformer official pretrain weights usage.
* Clean redundant codes.
* Remove redundant codes.
* Unfied format.
* Add description for segformer converter.
* Update workers.
* Adjust vision transformer backbone architectures;
* Add DropPath, trunc_normal_ for VisionTransformer implementation;
* Add class token buring intermediate period and remove it during final period;
* Fix some parameters loss bug;
* * Store intermediate token features and impose no processes on them;
* Remove class token and reshape entire token feature from NLC to NCHW;
* Fix some doc error
* Add a arg for VisionTransformer backbone to control if input class token into transformer;
* Add stochastic depth decay rule for DropPath;
* * Fix output bug when input_cls_token=False;
* Add related unit test;
* Re-implement of SETR
* Add two head -- SETRUPHead (Naive, PUP) & SETRMLAHead (MLA);
* * Modify some docs of heads of SETR;
* Add MLA auxiliary head of SETR;
* * Modify some arg of setr heads;
* Add unit test for setr heads;
* * Add 768x768 cityscapes dataset config;
* Add Backbone: SETR -- Backbone: MLA, PUP, Naive;
* Add SETR cityscapes training & testing config;
* * Fix the low code coverage of unit test about heads of setr;
* Remove some rebundant error capture;
* * Add pascal context dataset & ade20k dataset config;
* Modify auxiliary head relative config;
* Modify folder structure.
* add setr
* modify vit
* Fix the test_cfg arg position;
* Fix some learning schedule bug;
* optimize setr code
* Add arg: final_reshape to control if converting output feature information from NLC to NCHW;
* Fix the default value of final_reshape;
* Modify arg: final_reshape to arg: out_shape;
* Fix some unit test bug;
* Add MLA neck;
* Modify setr configs to add MLA neck;
* Modify MLA decode head to remove rebundant structure;
* Remove some rebundant files.
* * Fix the code style bug;
* Remove some rebundant files;
* Modify some unit tests of SETR;
* Ignoring CityscapesCoarseDataset and MapillaryDataset.
* Fix the activation function loss bug;
* Fix the img_size bug of SETR_PUP_ADE20K
* * Fix the lint bug of transformers.py;
* Add mla neck unit test;
* Convert vit of setr out shape from NLC to NCHW.
* * Modify Resize action of data pipeline;
* Fix deit related bug;
* Set find_unused_parameters=False for pascal context dataset;
* Remove arg: find_unused_parameters which is False by default.
* Error auxiliary head of PUP deit
* Remove the minimal restrict of slide inference.
* Modify doc string of Resize
* Seperate this part of code to a new PR #544
* * Remove some rebundant codes;
* Modify unit tests of SETR heads;
* Fix the tuple in_channels of mla_deit.
* Modify code style
* Move detailed definition of auxiliary head into model config dict;
* Add some setr config for default cityscapes.py;
* Fix the doc string of SETR head;
* Modify implementation of SETR Heads
* Remove setr aux head and use fcn head to replace it;
* Remove arg: img_size and remove last interpolate op of heads;
* Rename arg: conv3x3_conv1x1 to kernel_size of SETRUPHead;
* non-square input support for setr heads
* Modify config argument for above commits
* Remove norm_layer argument of SETRMLAHead
* Add mla_align_corners for MLAModule interpolate
* [Refactor]Refactor of SETRMLAHead
* Modify Head implementation;
* Modify Head unit test;
* Modify related config file;
* [Refactor]MLA Neck
* Fix config bug
* [Refactor]SETR Naive Head and SETR PUP Head
* [Fix]Fix the lack of arg: act_cfg and arg: norm_cfg
* Fix config error
* Refactor of SETR MLA, Naive, PUP heads.
* Modify some attribute name of SETR Heads.
* Modify setr configs to adapt new vit code.
* Fix trunc_normal_ bug
* Parameters init adjustment.
* Remove redundant doc string of SETRUPHead
* Fix pretrained bug
* [Fix] Fix vit init bug
* Add some vit unit tests
* Modify module import
* Remove norm from PatchEmbed
* Fix pretrain weights bug
* Modify pretrained judge
* Fix some gradient backward bugs.
* Add some unit tests to improve code cov
* Fix init_weights of setr up head
* Add DropPath in FFN
* Finish benchmark of SETR
1. Add benchmark information into README.MD of SETR;
2. Fix some name bugs of vit;
* Remove DropPath implementation and use DropPath from mmcv.
* Modify out_indices arg
* Fix out_indices bug.
* Remove cityscapes base dataset config.
Co-authored-by: sennnnn <201730271412@mail.scut.edu.cn>
Co-authored-by: CuttlefishXuan <zhaoxinxuan1997@gmail.com>
* init commit: fast_scnn
* 247917iters
* 4x8_80k
* configs placed in configs_unify. 4x8_80k exp.running.
* mmseg/utils/collect_env.py modified to support Windows
* study on lr
* bug in configs_unify/***/cityscapes.py fixed.
* lr0.08_100k
* lr_power changed to 1.2
* log_config by_epoch set to False.
* lr1.2
* doc strings added
* add fast_scnn backbone test
* 80k 0.08,0.12
* add 450k
* fast_scnn test: fix BN bug.
* Add different config files into configs/
* .gitignore recovered.
* configs_unify del
* .gitignore recovered.
* delete sub-optimal config files of fast-scnn
* Code style improved.
* add docstrings to component modules of fast-scnn
* relevant files modified according to Jerry's instructions
* relevant files modified according to Jerry's instructions
* lint problems fixed.
* fast_scnn config extremely simplified.
* InvertedResidual
* fixed padding problems
* add unit test for inverted_residual
* add unit test for inverted_residual: debug 0
* add unit test for inverted_residual: debug 1
* add unit test for inverted_residual: debug 2
* add unit test for inverted_residual: debug 3
* add unit test for sep_fcn_head: debug 0
* add unit test for sep_fcn_head: debug 1
* add unit test for sep_fcn_head: debug 2
* add unit test for sep_fcn_head: debug 3
* add unit test for sep_fcn_head: debug 4
* add unit test for sep_fcn_head: debug 5
* FastSCNN type(dwchannels) changed to tuple.
* t changed to expand_ratio.
* Spaces fixed.
* Update mmseg/models/backbones/fast_scnn.py
Co-authored-by: Jerry Jiarui XU <xvjiarui0826@gmail.com>
* Update mmseg/models/decode_heads/sep_fcn_head.py
Co-authored-by: Jerry Jiarui XU <xvjiarui0826@gmail.com>
* Update mmseg/models/decode_heads/sep_fcn_head.py
Co-authored-by: Jerry Jiarui XU <xvjiarui0826@gmail.com>
* Docstrings fixed.
* Docstrings fixed.
* Inverted Residual kept coherent with mmcl.
* Inverted Residual kept coherent with mmcl. Debug 0
* _make_layer parameters renamed.
* final commit
* Arg scale_factor deleted.
* Expand_ratio docstrings updated.
* final commit
* Readme for Fast-SCNN added.
* model-zoo.md modified.
* fast_scnn README updated.
* Move InvertedResidual module into mmseg/utils.
* test_inverted_residual module corrected.
* test_inverted_residual.py moved.
* encoder_decoder modified to avoid bugs when running PSPNet.
getting_started.md bug fixed.
* Revert "encoder_decoder modified to avoid bugs when running PSPNet. "
This reverts commit dd0aadfb
Co-authored-by: Jerry Jiarui XU <xvjiarui0826@gmail.com>