mmsegmentation

mirror of https://github.com/open-mmlab/mmsegmentation.git synced 2025-06-03 22:03:48 +08:00

Author	SHA1	Message	Date
Peng Lu	c46cc85cba	[Feature] Support VPD Depth Estimator (#3321 ) Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. ## Motivation Support depth estimation algorithm [VPD](https://github.com/wl-zhao/VPD) ## Modification 1. add VPD backbone 2. add VPD decoder head for depth estimation 3. add a new segmentor `DepthEstimator` based on `EncoderDecoder` for depth estimation 4. add an integrated metric that calculate common metrics in depth estimation 5. add SiLog loss for depth estimation 6. add config for VPD ## BC-breaking (Optional) Does the modification introduce changes that break the backward-compatibility of the downstream repos? If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR. ## Use cases (Optional) If this PR introduces a new feature, it is better to list some use cases here, and update the documentation. ## Checklist 1. Pre-commit or other linting tools are used to fix the potential lint issues. 7. The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness. 8. If the modification has potential influence on downstream projects, this PR should be tested with downstream projects, like MMDet or MMDet3D. 9. The documentation has been modified accordingly, like docstring or example tutorials.	2023-09-13 15:31:22 +08:00
MengzhangLI	ff8d971988	[Feature] Support SegNeXt in MMSegmentation 2.0 (#2654 ) ## Motivation Support SegNeXt in MMSeg 1.x branch. 0.x PR: https://github.com/open-mmlab/mmsegmentation/pull/2600 --------- Co-authored-by: xiexinch <xiexinch@outlook.com>	2023-03-16 16:49:15 +08:00
谢昕辰	dd47cef801	[Feature] Support PIDNet (#2609 ) ## Motivation Support SOTA real-time semantic segmentation method in [Paper with code](https://paperswithcode.com/task/real-time-semantic-segmentation) Paper: https://arxiv.org/pdf/2206.02066.pdf Official repo: https://github.com/XuJiacong/PIDNet ## Current results Cityscapes \|Model\|Ref mIoU\|mIoU (ours)\| \|---\|---\|---\| \|PIDNet-S\|78.8\|78.74\| \|PIDNet-M\|79.9\|80.22\| \|PIDNet-L\|80.9\|80.89\| ## TODO - [x] Support inference with official weights - [x] Support training on Cityscapes - [x] Update docstring - [x] Add unit test	2023-03-15 14:55:30 +08:00
谢昕辰	039ba5d4ca	[Feature] Support auto import modules from registry. (#2481 ) ## Motivation The registry now supports auto-import modules from the given location. register_all_modules before running is no longer needed. The modules will be lazy-imported during building. - [x] This PR can be merged after https://github.com/open-mmlab/mmengine/pull/643. The MMEngine version should be updated. Ref: https://github.com/open-mmlab/mmdetection/pull/9143	2023-02-23 20:33:17 +08:00
MengzhangLI	925110044c	[Enhance] Make scipy as a default dependency in runtime in dev-1.x	2022-11-30 21:51:38 +08:00
谢昕辰	230246f557	[Refactor] Add pyupgrade pre-commit hook (#2078 ) * add pyupgrade hook * run pyupgrade precommit hook	2022-09-19 14:06:29 +08:00
Miao Zheng	8de0050f25	[Refactor] data flow (#1956 ) * [WIP] Refactor data flow * model return * [WIP] Refactor data flow * support data_samples is optional * fix benchmark * fix base * minors * rebase * fix api * ut * fix api inference * comments * docstring * docstring * docstring * fix bug of slide inference * add assert c > 1	2022-08-26 15:54:23 +08:00
Miao Zheng	50546da85c	[Fix]Remove modules from mmcv.runner and mmcv.utils (#1966 ) * [WIP] mmcv-clean * [WIP]Remove modules from mmcv.runner and mmcv.utils * wip * fix import mmengine * remove ut * loadcheckpoint in mae	2022-08-25 15:15:21 +08:00
Miao Zheng	e0499d5a77	[Fix] Fix repo based on refactoring standard (#1869 ) * [Fix] Fix repo based on refactory standard * fix ut	2022-08-19 20:50:02 +08:00
Yuan Liu	43b8a1d8dc	[Feature]: Add MAE (#1307 ) * [Fix]: Fix lint * [WIP]: Add mae seg config * [Feature]: Add MAE seg * [Fix]: Fix mae dataset img scale bug * [Fix]: Fix lint * [Feature]: Change mae config to mae_segmentation's config * [Feature]: Add interpolate pe when loading * [Fix]: Fix pos_embed not used bug * [Fix]: Fix lint * [Fix]: Init rel pos embed with zeros * [Fix]: Fix lint * [Fix]: Change the type name of backbone to MAE * [Fix]: Delete ade20k_512x512.py * [Fix]: Use mmseg provided ade20k.py * [Fix]: Change 1 sample per gpu to 2 samples per gpu * [Fix]: Fix conflict * [Refactor]: Use the TransformerEncoderLayer of BEiT * [Feature]: Add UT * [Fix]: Change the default value of qv bias to False * [Fix]: Initialize relative pos table with zeros * [Fix]: Delete redundant code in mae * [Fix]: Fix lint * [Fix]: Rename qkv_bias to qv_bias * [Fix]: Add docstring to weight_init of MAEAttention * [Refactor]: Delete qv_bias param * [Fix]: Add reference to fix_init_weight * [Fix]: Fix lint * [Fix]: Delete extra crop size * [Refactor]: Rename mae * [Fix]: Set bias to True * [Fix]: Delete redundant params * [Fix]: Fix lint * [Fix]: Fix UT * [Fix]: Add resize abs pos embed * [Fix]: Fix UT * [Refactor]: Use build layer * [Fix]: Add licsense and fix docstring * [Fix]: Fix docstring * [Feature]: Add README metafile * [Fix]: Change 640 to 512 * [Fix]: Fix README * fix readme of MAE Co-authored-by: MengzhangLI <mcmong@pku.edu.cn>	2022-04-28 00:54:20 +08:00
jiangyitong	7b6953fd5f	[Feature] Add with cp to mit and vit (#1431 ) * add with cp to mit and vit * add test unit Co-authored-by: jiangyitong <jiangyitong1@sensetime.com>	2022-04-01 21:01:45 +08:00
FangjianLin	d1b8eae35e	[Feature] Add BEiT backbone (#1404 ) * [Feature] Add BEiT backbone * fix * fix * fix * fix * add readme * fix * fix * fix * fix * fix * add link * fix memory * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix test_beit.py * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix	2022-03-30 15:25:10 +08:00
Miao Zheng	b97cfa77d2	[Enhancement] Revise pre-commit-hooks (#1315 )	2022-02-23 23:44:27 +08:00
MengzhangLI	304df56c78	[Fix] Add Pytorch HardSwish assertion in unit test (#1294 ) * assert original HardSwish when PyTorch > 1.6 in unit test * assert original HardSwish when PyTorch > 1.6 in unit test * assert original HardSwish when PyTorch > 1.6 in unit test * assert original HardSwish when PyTorch > 1.6 in unit test * assert original HardSwish when PyTorch > 1.6 in unit test * assert original HardSwish when PyTorch > 1.6 in unit test	2022-02-16 19:47:58 +08:00
MengzhangLI	c1e72513c9	[Feature] Support STDC Network (new) (#995 ) * refactor stdc code * update key * fix backbone inference * remove comments * fixing errors * fixing version conflict * fux typo * use STDCHead * upload models&logs * adding model converters script and fix unittest * fix error * fix error * fix error * delete redundant keys in config * fix errors in configs and unittest * fix errors in configs and unittest * fix errors in configs and unittest * change Memory name * refactor stdc2mmseg * change name to STDC * refactor stdc * refactor stdc * stdc refactor * stdc refactor * stdc refactor * stdc refactor * stdc refactor * stdc refactor * refactor stdc * stdc refactor Co-authored-by: xiexinch <xinchen.xie@qq.com>	2021-12-10 23:09:32 +08:00
Lxinyang	cd246e3635	[Feature] Support Twins (NeurIPS2021) (#989 ) * debug * debug * debug * this is a debug step, and needs to be recovered * need recover * git * debug * git * git * git * git * git * git * debug need recover * debug * git * debug * debug * debug * debug * debug * debug * debug * debug * debugf * debug * debug * debug * debug * debug * debug * debug * debug * git * git * git * use config small/base/large * debug * debug * git * debug * git * debug * debug * debug args * debug * debug * git * git * debug * git * git * git * git * git * debug * debug * git * debug * git * debug * debug * debug * debug * git * debug * git * git * debug * debug * git * git * git * git * debug * debug * debug * debug * git * debug * debug * git * git * debug * debug * git * debug * debug * debug * git * debug * debug * debug * Please enter the commit message for your changes. Lines starting * git * git * debug * debug * debug * git * git * debug * debug * debug * debug * debug * debug * debug * debug * debug * debug * debug * git * debug * debug * debug * debug * debug * debug * debug * git * fix pre-commit error * fix error * git * git * git * git * git * git * debug * debug * debug * debug * debug * debug * git * debug * debug * debug * debug * debug * debug * debug * debug * debug * git * git * git * debug * debug * debug * git * git * git * git * git * git * git * git * git * debug * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * fix unittest error * fix config errors * fix twins2mmseg bug * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * git * fix init_weights() in twins.py * git * git * git * git * fix comment * fix comment * fix comment * fix comment * fix unit test coverage in TwinsPR * Add Twins README * Add Twins README * twins refactor * twins refactor * delete init_cfg in FFN * delete init_cfg in FFN * Update mmseg/models/backbones/twins.py * Update mmseg/models/backbones/twins.py * Update mmseg/models/backbones/twins.py Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn> * Update mmseg/models/backbones/twins.py * add conference name Co-authored-by: linxinyang <linxinyang@meituan.com> Co-authored-by: MengzhangLI <mcmong@pku.edu.cn> Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>	2021-12-09 19:18:10 +08:00
Rockey	a370777e3b	[Fix] Fix the bug that mit cannot process init_cfg (#1102 ) * [Fix] Fix the bug that mit cannot process init_cfg * fix error	2021-12-08 23:17:43 -08:00
MengzhangLI	6a2cfea73b	[Feature] Support Real-time model ERFNet (#960 ) * first commit * Fixing Unittest Error * first refactory of ERFNet * Refactorying NonBottleneck1d Module * uploading models&logs * uploading models&logs * fix partial bugs & typos * ERFNet * add ERFNet with FCNHead * fix typos of ERFNet * add name on README.md cover * chane name to T-ITS'2017 * fix lint error	2021-12-01 23:54:39 -08:00
Rockey	bc27f24109	[Fix] Fix the bug that vit cannot load pretrain properly when using i… (#999 ) * [Fix] Fix the bug that vit cannot load pretrain properly when using init_cfg to specify the pretrain scheme * [Fix] fix the coverage problem * Update mmseg/models/backbones/vit.py Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn> * [Fix] make the predicate more concise and clearer * [Fix] Modified the judgement logic * Update tests/test_models/test_backbones/test_vit.py Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn> * add comments Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>	2021-11-03 10:36:09 -07:00
Junjun2016	0a06853bb6	[Feature] Support TIMMBackbone (#998 ) * add TIMMBackbone and unittests * add timm to tests requirements * deprecate pt1.3.1 * reduce the unittests input of timm backbone * fix ci * fix ci * fix ci * fix ci * fix ci * fix ci * fix ci * fix ci * fix ci * remove unittests of large models of timm backbone * generate coverage report for all unittests env * reduce the unittests input of timm backbone * reduce the unittests input of timm backbone	2021-11-02 12:51:11 +08:00
MengzhangLI	4ea92ebbe6	smaller input & channels of unittest (#1004 )	2021-11-01 07:47:43 -07:00
Junjun2016	67f1420472	[Enhancement] Add codespell pre-commit hook and fix typos (#920 ) * add codespell pre-commit hook and fix typos * Update mmseg/models/decode_heads/dpt_head.py * Update mmseg/models/backbones/vit.py * Update mmseg/models/backbones/vit.py * fix typos * skip formating typo * deprecate formating * skip ipynb * unstage ipynb changes * unstage ipynb changes * fix typos in ipynb * unstage ipynb changes	2021-10-13 06:21:17 -07:00
MengzhangLI	7db1cbb181	[Feature] Support ICNet (#884 ) * add icnet backbone * add icnet head * add icnet configs * nclass -> num_classes * Support ICNet * ICNet * ICNet * Add ICNeck * Add ICNeck * Add ICNeck * Add ICNeck * Adding unittest * Uploading models & logs * Uploading models & logs * add comment * smaller test_swin.py * try to delete test_swin.py * delete test_unet.py * delete test_unet.py * temp * smaller test_unet.py Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>	2021-09-30 09:31:57 -07:00
Jerry Jiarui XU	7edc71d532	[Improvement] Refactor Swin-Transformer (#800 ) * [Improvement] Refactor Swin-Transformer * fixed swin test * update patch emebd, add more tests * fixed test * remove pretrain_style * fixed padding * resolve coments * use mmcv 2tuple * refactor init_cfg Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>	2021-09-29 08:46:33 +08:00
MengzhangLI	e701497a36	[Feature] Support BiSeNetV1 (#851 ) * First Commit * fix typos * fix typos * Fix assertion bug * Adding Assert * Adding Unittest * Fixing typo * Uploading models & logs * Fixing unittest error * changing README.md * changing README.md	2021-09-29 02:12:57 +08:00
MengzhangLI	f82e4d6fc9	[Feature] Support BiSeNetV2 (#804 ) * BiSeNetV2 first commit * BiSeNetV2 unittest * remove pytest * add pytest module * fix ConvModule input name * fix pytest error * fix unittest * refactor * BiSeNetV2 Refactory * fix docstrings and add some small changes * use_sigmoid=False * fix potential bugs about upsampling * Use ConvModule instead * Use ConvModule instead * fix typos * fix typos * fix typos * discard nn.conv2d * discard nn.conv2d * discard nn.conv2d * delete *kwargs uploading markdown and model * final commit * BiSeNetV2 adding Unittest for its modules * BiSeNetV2 adding Unittest for its modules * BiSeNetV2 adding Unittest for its modules * BiSeNetV2 adding Unittest for its modules * BiSeNetV2 adding Unittest for its modules * BiSeNetV2 adding Unittest for its modules * BiSeNetV2 adding Unittest for its modules * Fix README conflict * Fix unittest problem * Fix unittest problem * BiSeNetV2 * Fixing fps * Fixing typpos * bisenetv2	2021-09-26 18:52:16 +08:00
Guangchen Lin	b5ad23e545	[Fix] The interface multiscale_output is defined but not used (#830 ) * Add interface multiscale_output * Add space between args and their types * Fix default value	2021-08-27 18:53:55 +08:00
谢昕辰	119bbd838d	[Enhancement] Delete convert function and add instruction to ViT/Swin README.md (#791 ) * delete convert function and add instruction to README.md * unified model convert and README * remove url * fix import error * fix unittest * rename pretrain * rename vit and deit pretrain * Update upernet_deit-b16_512x512_160k_ade20k.py * Update upernet_deit-b16_512x512_80k_ade20k.py * Update upernet_deit-b16_ln_mln_512x512_160k_ade20k.py * Update upernet_deit-b16_mln_512x512_160k_ade20k.py * Update upernet_deit-s16_512x512_160k_ade20k.py * Update upernet_deit-s16_512x512_80k_ade20k.py * Update upernet_deit-s16_ln_mln_512x512_160k_ade20k.py * Update upernet_deit-s16_mln_512x512_160k_ade20k.py Co-authored-by: Jiarui XU <xvjiarui0826@gmail.com> Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>	2021-08-25 15:00:41 -07:00
Junjun2016	2fe0bddf5e	[Dcos] Add header for files (#796 ) * Add header for files * Delete header in config files	2021-08-16 23:16:55 -07:00
sshuair	778961dd2e	[Enhancement] Support hrnet frozen stage (#743 ) * support hrnet frozen stage * support hrnet frozen stage	2021-08-03 09:45:42 -07:00
Miguel Méndez	50461efe85	[Fix] Replace interpolate with resize (#731 ) * Replace interpolate with resize * Replace nn.Upsample with ops.Upsample * Fix test	2021-07-28 16:56:22 +08:00
sennnnn	095ed243c0	[Feature] Segformer backbone re-implementation (#594 ) * [Feature]Segformer re-implementation * Using act_cfg and norm_cfg to control activation and normalization * Split this PR into several little PRs * Fix lint error * Remove SegFormerHead * parameters init refactor * 1. Refactor segformer backbone parameters init; 2. Remove rebundant functions and unit tests; * Remove rebundant codes * 1. Remove rebundant codes; 2. Modify module name; * Refactor the backbone of segformer using mmcv.cnn.bricks.transformer.py * Fix some code logic bugs. * Add mit_convert.py to match pretrain keys of segformer. * Resolve some comments. * 1. Add some assert to ensure right params; 2. Support flexible peconv position; * Add pe_index assert and fix unit test. * 1. Add doc string for MixVisionTransformer; 2. Add some unit tests for MixVisionTransformer; * Use hw_shape to pass shape of feature map. * 1. Fix doc string of MixVisionTransformer; 2. Simplify MixFFN; 3. Modify H, W to hw_shape; * Add more unit tests. * Add doc string for shape convertion functions. * Add some unit tests to improve code coverage. * Fix Segformer backbone pretrain weights match bug. * resolve the shape convertion functions doc string. * Add pad_to_patch_size arg. * Modify default value of pad_to_patch_size arg.	2021-07-19 09:40:40 -07:00
谢昕辰	f6246d6eaa	[Fix] fix patch_embed and pos_embed mismatch error (#685 ) * fix patch_embed and pos_embed mismatch error * add docstring * update unittest * use downsampled image shape * use tuple * remove unused parameters and add doc * fix init weights function * revise docstring * Update vit.py If -> Whether * fix lint Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>	2021-07-19 09:27:10 -07:00
Ze Liu	214d083cce	[WIP] Add Swin Transformer (#511 ) * add Swin Transformer * add Swin Transformer * fixed import * Add some swin training settings. * Fix some filename error. * Fix attribute name: pretrain -> pretrained * Upload mmcls implementation of swin transformer. * Refactor Swin Transformer to follow mmcls style. * Refactor init_weigths of swin_transformer.py * Fix lint * Match inference precision * Add some comments * Add swin_convert to load official style ckpt * Remove arg: auto_pad * 1. Complete comments for each block; 2. Correct weight convert function; 3. Fix the pad of Patch Merging; * Clean function args. * Fix vit unit test. * 1. Add swin transformer unit tests; 2. Fix some pad bug; 3. Modify config to adapt new swin implementation; * Modify config arg * Update readme.md of swin * Fix config arg error and Add some swin benchmark msg. * Add MeM and ms test content for readme.md of swin transformer. * Fix doc string of swin module * 1. Register swin transformer to model list; 2. Modify pth url which keep meta attribute; * Update swin.py * Merge config settings. * Modify config style. * Update README.md Add ViT link * Modify main readme.md Co-authored-by: Jiarui XU <xvjiarui0826@gmail.com> Co-authored-by: sennnnn <201730271412@mail.scut.edu.cn> Co-authored-by: Junjun2016 <hejunjun@sjtu.edu.cn>	2021-07-01 23:41:55 +08:00
sennnnn	2a9bf2d21b	[Fix] Fix some vit init bugs (#609 ) * [Fix] Fix vit init bug * Add some vit unit tests * Modify module import * Fix pretrain weights bug * Modify pretrained judge * Add some unit tests to improve code cov * Optimize code * Fix vit unit test	2021-06-19 15:53:13 -07:00
sennnnn	c01abb4f30	[Refactor] Using mmcv transformer bricks to refactor vit. (#571 ) * [Refactor] Using mmcv bricks to refactor vit * Follow the vit code structure from mmclassification * Add MMCV install into CI system. * Add to 'Install MMCV' CI item * Add 'Install MMCV_CPU' and 'Install MMCV_GPU CI' items * Fix & Add 1. Fix low code coverage of vit.py; 2. Remove HybirdEmbed; 3. Fix doc string of VisionTransformer; * Add helpers unit test. * Add converter to convert vit pretrain weights from timm style to mmcls style. * Clean some rebundant code and refactor init 1. Use timm style init_weights; 2. Remove to_xtuple and trunc_norm_; * Add comments for VisionTransformer.init_weights() * Add arg: pretrain_style to choose timm or mmcls vit pretrain weights.	2021-06-17 10:41:25 -07:00
Jerry Jiarui XU	9849a8dc23	[Refactor]: Unified parameter initialization (#567 ) * [Refactor]: Unified parameter initialization * fixed pretrained	2021-06-16 21:41:29 -07:00
sennnnn	5c195db1bd	Add option for output shape of ViT (#530 ) * Add arg: final_reshape to control if converting output feature information from NLC to NCHW; * Fix the default value of final_reshape; * Modify arg: final_reshape to arg: out_shape; * Fix some unit test bug;	2021-05-05 22:49:28 -07:00
sennnnn	c27ef91942	Adjust vision transformer backbone architectures (#524 ) * Adjust vision transformer backbone architectures; * Add DropPath, trunc_normal_ for VisionTransformer implementation; * Add class token buring intermediate period and remove it during final period; * Fix some parameters loss bug; * * Store intermediate token features and impose no processes on them; * Remove class token and reshape entire token feature from NLC to NCHW; * Fix some doc error * Add a arg for VisionTransformer backbone to control if input class token into transformer; * Add stochastic depth decay rule for DropPath; * * Fix output bug when input_cls_token=False; * Add related unit test; * * Add arg: out_indices to control model output; * Add unit test for DropPath; * Apply suggestions from code review Co-authored-by: Jerry Jiarui XU <xvjiarui0826@gmail.com>	2021-04-30 10:37:47 -07:00
谢昕辰	b379b5a5b3	support transformer backbone (#465 ) * vit backbone * fix lint * add docstrings and fix pretrained pos_embed dim not match prob * add unittest for vit * fix lint * add vit based fcn configs * fix import error * support multiple resolution input images * upsample pos_embed at init_weights * support resize pos_embed at evaluation * fix training errors * add more unitest code for vit backbone * unitest for uncovered code * add norm_eval unittest * refactor _pos_embeding * minor change * change var name * rafactor init_weight * load weights after resize * ignore 'module' in pretrain checkpoint * add with_cp * add with_cp Co-authored-by: Jiarui XU <xvjiarui0826@gmail.com>	2021-04-21 20:19:55 -07:00
Jerry Jiarui XU	3150dd0ce4	refactor test organization (#440 ) * refactor test organization * fixed se layer * update mmcv uper bound	2021-03-30 17:55:09 -07:00

41 Commits