关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

History

谢昕辰 c448646a92 [Doc] Refine doc and fix links (#2821 ) ## Motivation - Create the `main` branch ## Modification Modify links from `dev-1.x` to `main`		1 year ago
..
README.md	[Doc] Refine doc and fix links (#2821)	1 year ago

maskformer_r50-d32_8xb2-160k_ade20k-512x512.py	[Fix] Fix MaskFormer and Mask2Former of MMSegmentation (#2532)	1 year ago

maskformer_r101-d32_8xb2-160k_ade20k-512x512.py	[Feature] Support MaskFormer(NeurIPS'2021) in MMSeg 1.x (#2215)	1 year ago

maskformer_swin-s_upernet_8xb2-160k_ade20k-512x512.py	[Feature] Support MaskFormer(NeurIPS'2021) in MMSeg 1.x (#2215)	1 year ago

maskformer_swin-t_upernet_8xb2-160k_ade20k-512x512.py	[Feature] Support MaskFormer(NeurIPS'2021) in MMSeg 1.x (#2215)	1 year ago

metafile.yaml	[Dev] update update-model-index pre-commit hook (#2667)	1 year ago

README.md

MaskFormer

MaskFormer

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Introduction

Official Repo

Code Snippet

Abstract

Modern approaches typically formulate semantic segmentation as a per-pixel classification task, while instance-level segmentation is handled with an alternative mask classification. Our key insight: mask classification is sufficiently general to solve both semantic- and instance-level segmentation tasks in a unified manner using the exact same model, loss, and training procedure. Following this observation, we propose MaskFormer, a simple mask classification model which predicts a set of binary masks, each associated with a single global class label prediction. Overall, the proposed mask classification-based method simplifies the landscape of effective approaches to semantic and panoptic segmentation tasks and shows excellent empirical results. In particular, we observe that MaskFormer outperforms per-pixel classification baselines when the number of classes is large. Our mask classification-based method outperforms both current state-of-the-art semantic (55.6 mIoU on ADE20K) and panoptic segmentation (52.7 PQ on COCO) models.

Usage

MaskFormer model needs to install MMDetection first.

pip install "mmdet>=3.0.0rc4"

Results and models

ADE20K

Method	Backbone	Crop Size	Lr schd	Mem (GB)	Inf time (fps)	Device	mIoU	mIoU(ms+flip)	config	download
MaskFormer	R-50-D32	512x512	160000	3.29	A100	42.20	44.29	-	config	model \| log
MaskFormer	R-101-D32	512x512	160000	4.12	A100	34.90	45.11	-	config	model \| log
MaskFormer	Swin-T	512x512	160000	3.73	A100	40.53	46.69	-	config	model \| log
MaskFormer	Swin-S	512x512	160000	5.33	A100	26.98	49.36	-	config	model \| log

Note:

All experiments of MaskFormer are implemented with 8 V100 (32G) GPUs with 2 samplers per GPU.
The results of MaskFormer are relatively not stable. The accuracy (mIoU) of model with R-101-D32 is from 44.7 to 46.0, and with Swin-S is from 49.0 to 49.8.
The ResNet backbones utilized in MaskFormer models are standard ResNet rather than ResNetV1c.
Test time augmentation is not supported in MMSegmentation 1.x version yet, we would add "ms+flip" results as soon as possible.

Citation

@article{cheng2021per,
  title={Per-pixel classification is not all you need for semantic segmentation},
  author={Cheng, Bowen and Schwing, Alex and Kirillov, Alexander},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  pages={17864--17875},
  year={2021}
}

No Description

Python Markdown Shell Dockerfile other

mcmong@pku.edu.cn 76149310+MeowZheng@users.noreply.github.com xiexinch@outlook.com xvjiarui0826@gmail.com zhengmiao@sensetime.com hejunjun@sjtu.edu.cn 41846794+RockeyCoss@users.noreply.github.com xinchen.xie@qq.com 58427300+sennnnn@users.noreply.github.com 40987381+csatsurnh@users.noreply.github.com meowzheng@outlook.com 49829199+yamengxi@users.noreply.github.com limengzhang@pjlab.org.cn 50650583+AI-Tianlong@users.noreply.github.com masaaki75@qq.com linfangjian@pjlab.org.cn 45811724+matrixgame2018@users.noreply.github.com sshuair@gmail.com 46390195+tianbinli@users.noreply.github.com xiexinchen@pjlab.org.cn daviddelaiglesiacastro@gmail.com 93248678+linfangjian01@users.noreply.github.com penglu2097@gmail.com

31381602+johnzja@users.noreply.github.com

How to access data resources in code