图像分割 - 专知主题

会员服务 ·

图像分割

图像分割就是把图像分成若干个特定的、具有独特性质的区域并提出感兴趣目标的技术和过程。它是由图像处理到图像分析的关键步骤。所谓图像分割指的是根据灰度、颜色、纹理和形状等特征把图像划分成若干互不交迭的区域，并使这些特征在同一区域内呈现出相似性，而在不同区域间呈现出明显的差异性。

知识荟萃

图像分割（Image Segmentation) 专知荟萃

入门学习

A 2017 Guide to Semantic Segmentation with Deep Learning 概述——用深度学习做语义分割
- [http://blog.qure.ai/notes/semantic-segmentation-deep-learning-review]
- 中文翻译：[http://simonduan.site/2017/07/23/notes-semantic-segmentation-deep-learning-review/]
从全卷积网络到大型卷积核：深度学习的语义分割全指南
- [https://www.jiqizhixin.com/articles/2017-07-14-10]
Fully Convolutional Networks
- [http://simtalk.cn/2016/11/01/Fully-Convolutional-Networks/]
语义分割中的深度学习方法全解：从FCN、SegNet到各代DeepLab
- [https://zhuanlan.zhihu.com/p/27794982]
图像语义分割之FCN和CRF
- [https://zhuanlan.zhihu.com/p/22308032]
从特斯拉到计算机视觉之「图像语义分割」
- [http://www.52cs.org/?p=1089]
计算机视觉之语义分割
- [http://blog.geohey.com/ji-suan-ji-shi-jue-zhi-yu-yi-fen-ge/]
Segmentation Results: VOC2012 PASCAL语义分割比赛排名
- [http://host.robots.ox.ac.uk:8080/leaderboard/displaylb.php?challengeid=11&compid=6]

综述

A Review on Deep Learning Techniques Applied to Semantic Segmentation Alberto Garcia-Garcia, Sergio Orts-Escolano, Sergiu Oprea, Victor Villena-Martinez, Jose Garcia-Rodriguez 2017
- [https://arxiv.org/abs/1704.06857]
Computer Vision for Autonomous Vehicles: Problems, Datasets and State-of-the-Art
- [https://arxiv.org/abs/1704.05519]
基于内容的图像分割方法综述姜枫顾庆郝慧珍李娜郭延文陈道蓄 2017
- [http://www.jos.org.cn/ch/reader/create_pdf.aspx?file_no=5136&journal_id=jos]

进阶论文

U-Net [https://arxiv.org/pdf/1505.04597.pdf]
SegNet [https://arxiv.org/pdf/1511.00561.pdf]
DeepLab [https://arxiv.org/pdf/1606.00915.pdf]
FCN [https://arxiv.org/pdf/1605.06211.pdf]
ENet [https://arxiv.org/pdf/1606.02147.pdf]
LinkNet [https://arxiv.org/pdf/1707.03718.pdf]
DenseNet [https://arxiv.org/pdf/1608.06993.pdf]
Tiramisu [https://arxiv.org/pdf/1611.09326.pdf]
DilatedNet [https://arxiv.org/pdf/1511.07122.pdf]
PixelNet [https://arxiv.org/pdf/1609.06694.pdf]
ICNet [https://arxiv.org/pdf/1704.08545.pdf]
ERFNet [http://www.robesafe.uah.es/personal/eduardo.romera/pdfs/Romera17iv.pdf]
RefineNet [https://arxiv.org/pdf/1611.06612.pdf]
PSPNet [https://arxiv.org/pdf/1612.01105.pdf]
CRFasRNN [http://www.robots.ox.ac.uk/%7Eszheng/papers/CRFasRNN.pdf]
Dilated convolution [https://arxiv.org/pdf/1511.07122.pdf]
DeconvNet [https://arxiv.org/pdf/1505.04366.pdf]
FRRN [https://arxiv.org/pdf/1611.08323.pdf]
GCN [https://arxiv.org/pdf/1703.02719.pdf]
DUC, HDC [https://arxiv.org/pdf/1702.08502.pdf]
Segaware [https://arxiv.org/pdf/1708.04607.pdf]
Semantic Segmentation using Adversarial Networks [https://arxiv.org/pdf/1611.08408.pdf]

综述

A Review on Deep Learning Techniques Applied to Semantic Segmentation Alberto Garcia-Garcia, Sergio Orts-Escolano, Sergiu Oprea, Victor Villena-Martinez, Jose Garcia-Rodriguez 2017
- [https://arxiv.org/abs/1704.06857]
Computer Vision for Autonomous Vehicles: Problems, Datasets and State-of-the-Art
- [https://arxiv.org/abs/1704.05519]
基于内容的图像分割方法综述姜枫顾庆郝慧珍李娜郭延文陈道蓄 2017
- [http://www.jos.org.cn/ch/reader/create_pdf.aspx?file_no=5136&journal_id=jos]

Tutorial

Semantic Image Segmentation with Deep Learning
- [http://www.robots.ox.ac.uk/~sadeep/files/crfasrnn_presentation.pdf]
A 2017 Guide to Semantic Segmentation with Deep Learning
- [http://blog.qure.ai/notes/semantic-segmentation-deep-learning-review]
Image Segmentation with Tensorflow using CNNs and Conditional Random Fields
- [http://warmspringwinds.github.io/tensorflow/tf-slim/2016/12/18/image-segmentation-with-tensorflow-using-cnns-and-conditional-random-fields/]

视频教程

CS231n: Convolutional Neural Networks for Visual Recognition Lecture 11 Detection and Segmentation
- [http://cs231n.stanford.edu/syllabus.html]
Machine Learning for Semantic Segmentation - Basics of Modern Image Analysis
- [https://www.youtube.com/watch?v=psLChcm8aiU]

代码

Semantic segmentation

U-Net (https://arxiv.org/pdf/1505.04597.pdf)
- https://lmb.informatik.uni-freiburg.de/people/ronneber/u-net/ (Caffe - Matlab)
- https://github.com/jocicmarko/ultrasound-nerve-segmentation (Keras)
- https://github.com/EdwardTyantov/ultrasound-nerve-segmentation (Keras)
- https://github.com/ZFTurbo/ZF_UNET_224_Pretrained_Model (Keras)
- https://github.com/yihui-he/u-net (Keras)
- https://github.com/jakeret/tf_unet (Tensorflow)
- https://github.com/DLTK/DLTK/blob/master/examples/Toy_segmentation/simple_dltk_unet.ipynb (Tensorflow)
- https://github.com/divamgupta/image-segmentation-keras (Keras)
- https://github.com/ZijunDeng/pytorch-semantic-segmentation (PyTorch)
- https://github.com/akirasosa/mobile-semantic-segmentation (Keras)
- https://github.com/orobix/retina-unet (Keras)
SegNet (https://arxiv.org/pdf/1511.00561.pdf)
- https://github.com/alexgkendall/caffe-segnet (Caffe)
- https://github.com/developmentseed/caffe/tree/segnet-multi-gpu (Caffe)
- https://github.com/preddy5/segnet (Keras)
- https://github.com/imlab-uiip/keras-segnet (Keras)
- https://github.com/andreaazzini/segnet (Tensorflow)
- https://github.com/fedor-chervinskii/segnet-torch (Torch)
- https://github.com/0bserver07/Keras-SegNet-Basic (Keras)
- https://github.com/tkuanlun350/Tensorflow-SegNet (Tensorflow)
- https://github.com/divamgupta/image-segmentation-keras (Keras)
- https://github.com/ZijunDeng/pytorch-semantic-segmentation (PyTorch)
- https://github.com/chainer/chainercv/tree/master/examples/segnet (Chainer)
- https://github.com/ykamikawa/keras-SegNet (Keras)
DeepLab (https://arxiv.org/pdf/1606.00915.pdf)
- https://bitbucket.org/deeplab/deeplab-public/ (Caffe)
- https://github.com/cdmh/deeplab-public (Caffe)
- https://bitbucket.org/aquariusjay/deeplab-public-ver2 (Caffe)
- https://github.com/TheLegendAli/DeepLab-Context (Caffe)
- https://github.com/msracver/Deformable-ConvNets/tree/master/deeplab (MXNet)
- https://github.com/DrSleep/tensorflow-deeplab-resnet (Tensorflow)
- https://github.com/muyang0320/tensorflow-deeplab-resnet-crf (TensorFlow)
- https://github.com/isht7/pytorch-deeplab-resnet (PyTorch)
- https://github.com/bermanmaxim/jaccardSegment (PyTorch)
- https://github.com/martinkersner/train-DeepLab (Caffe)
- https://github.com/chenxi116/TF-deeplab (Tensorflow)
FCN (https://arxiv.org/pdf/1605.06211.pdf)
- https://github.com/vlfeat/matconvnet-fcn (MatConvNet)
- https://github.com/shelhamer/fcn.berkeleyvision.org (Caffe)
- https://github.com/MarvinTeichmann/tensorflow-fcn (Tensorflow)
- https://github.com/aurora95/Keras-FCN (Keras)
- https://github.com/mzaradzki/neuralnets/tree/master/vgg_segmentation_keras (Keras)
- https://github.com/k3nt0w/FCN_via_keras (Keras)
- https://github.com/shekkizh/FCN.tensorflow (Tensorflow)
- https://github.com/seewalker/tf-pixelwise (Tensorflow)
- https://github.com/divamgupta/image-segmentation-keras (Keras)
- https://github.com/ZijunDeng/pytorch-semantic-segmentation (PyTorch)
- https://github.com/wkentaro/pytorch-fcn (PyTorch)
- https://github.com/wkentaro/fcn (Chainer)
- https://github.com/apache/incubator-mxnet/tree/master/example/fcn-xs (MxNet)
- https://github.com/muyang0320/tf-fcn (Tensorflow)
- https://github.com/ycszen/pytorch-seg (PyTorch)
- https://github.com/Kaixhin/FCN-semantic-segmentation (PyTorch)
ENet (https://arxiv.org/pdf/1606.02147.pdf)
- https://github.com/TimoSaemann/ENet (Caffe)
- https://github.com/e-lab/ENet-training (Torch)
- https://github.com/PavlosMelissinos/enet-keras (Keras)
LinkNet (https://arxiv.org/pdf/1707.03718.pdf)
- https://github.com/e-lab/LinkNet (Torch)
DenseNet (https://arxiv.org/pdf/1608.06993.pdf)
- https://github.com/flyyufelix/DenseNet-Keras (Keras)
Tiramisu (https://arxiv.org/pdf/1611.09326.pdf)
- https://github.com/0bserver07/One-Hundred-Layers-Tiramisu (Keras)
- https://github.com/SimJeg/FC-DenseNet (Lasagne)
DilatedNet (https://arxiv.org/pdf/1511.07122.pdf)
- https://github.com/nicolov/segmentation_keras (Keras)
PixelNet (https://arxiv.org/pdf/1609.06694.pdf)
- https://github.com/aayushbansal/PixelNet (Caffe)
ICNet (https://arxiv.org/pdf/1704.08545.pdf)
- https://github.com/hszhao/ICNet (Caffe)
ERFNet (http://www.robesafe.uah.es/personal/eduardo.romera/pdfs/Romera17iv.pdf)
- https://github.com/Eromera/erfnet (Torch)
RefineNet (https://arxiv.org/pdf/1611.06612.pdf)
- https://github.com/guosheng/refinenet (MatConvNet)
PSPNet (https://arxiv.org/pdf/1612.01105.pdf)
- https://github.com/hszhao/PSPNet (Caffe)
- https://github.com/ZijunDeng/pytorch-semantic-segmentation (PyTorch)
- https://github.com/mitmul/chainer-pspnet (Chainer)
- https://github.com/Vladkryvoruchko/PSPNet-Keras-tensorflow (Keras/Tensorflow)
- https://github.com/pudae/tensorflow-pspnet (Tensorflow)
CRFasRNN (http://www.robots.ox.ac.uk/%7Eszheng/papers/CRFasRNN.pdf)
- https://github.com/torrvision/crfasrnn (Caffe)
- https://github.com/sadeepj/crfasrnn_keras (Keras)
Dilated convolution (https://arxiv.org/pdf/1511.07122.pdf)
- https://github.com/fyu/dilation (Caffe)
- https://github.com/fyu/drn#semantic-image-segmentataion (PyTorch)
- https://github.com/hangzhaomit/semantic-segmentation-pytorch (PyTorch)
DeconvNet (https://arxiv.org/pdf/1505.04366.pdf)
- http://cvlab.postech.ac.kr/research/deconvnet/ (Caffe)
- https://github.com/HyeonwooNoh/DeconvNet (Caffe)
- https://github.com/fabianbormann/Tensorflow-DeconvNet-Segmentation (Tensorflow)
FRRN (https://arxiv.org/pdf/1611.08323.pdf)
- https://github.com/TobyPDE/FRRN (Lasagne)
GCN (https://arxiv.org/pdf/1703.02719.pdf)
- https://github.com/ZijunDeng/pytorch-semantic-segmentation (PyTorch)
- https://github.com/ycszen/pytorch-seg (PyTorch)
DUC, HDC (https://arxiv.org/pdf/1702.08502.pdf)
- https://github.com/ZijunDeng/pytorch-semantic-segmentation (PyTorch)
- https://github.com/ycszen/pytorch-seg (PyTorch)
Segaware (https://arxiv.org/pdf/1708.04607.pdf)
- https://github.com/aharley/segaware (Caffe)
Semantic Segmentation using Adversarial Networks (https://arxiv.org/pdf/1611.08408.pdf)
- https://github.com/oyam/Semantic-Segmentation-using-Adversarial-Networks (Chainer)

Instance aware segmentation

FCIS [https://arxiv.org/pdf/1611.07709.pdf]
- https://github.com/msracver/FCIS [MxNet]
MNC [https://arxiv.org/pdf/1512.04412.pdf]
- https://github.com/daijifeng001/MNC [Caffe]
DeepMask [https://arxiv.org/pdf/1506.06204.pdf]
- https://github.com/facebookresearch/deepmask [Torch]
SharpMask [https://arxiv.org/pdf/1603.08695.pdf]
- https://github.com/facebookresearch/deepmask [Torch]
Mask-RCNN [https://arxiv.org/pdf/1703.06870.pdf]
- https://github.com/CharlesShang/FastMaskRCNN [Tensorflow]
https://github.com/jasjeetIM/Mask-RCNN [Caffe]
- https://github.com/TuSimple/mx-maskrcnn [MxNet]
- https://github.com/matterport/Mask_RCNN [Keras]
RIS [https://arxiv.org/pdf/1511.08250.pdf]
- https://github.com/bernard24/RIS [Torch]
FastMask [https://arxiv.org/pdf/1612.08843.pdf]
- https://github.com/voidrank/FastMask [Caffe]

Satellite images segmentation

Video segmentation

Autonomous driving

Annotation Tools:

Datasets

Stanford Background Dataset[http://dags.stanford.edu/projects/scenedataset.html]
Sift Flow Dataset[http://people.csail.mit.edu/celiu/SIFTflow/]
Barcelona Dataset[http://www.cs.unc.edu/~jtighe/Papers/ECCV10/]
Microsoft COCO dataset[http://mscoco.org/]
MSRC Dataset[http://research.microsoft.com/en-us/projects/objectclassrecognition/]
LITS Liver Tumor Segmentation Dataset[https://competitions.codalab.org/competitions/15595]
KITTI[http://www.cvlibs.net/datasets/kitti/eval_road.php]
Stanford background dataset[http://dags.stanford.edu/projects/scenedataset.html]
Data from Games dataset[https://download.visinf.tu-darmstadt.de/data/from_games/]
Human parsing dataset[https://github.com/lemondan/HumanParsing-Dataset]
Silenko person database[https://github.com/Maxfashko/CamVid]
Mapillary Vistas Dataset[https://www.mapillary.com/dataset/vistas]
Microsoft AirSim[https://github.com/Microsoft/AirSim]
MIT Scene Parsing Benchmark[http://sceneparsing.csail.mit.edu/]
COCO 2017 Stuff Segmentation Challenge[http://cocodataset.org/#stuff-challenge2017]
ADE20K Dataset[http://groups.csail.mit.edu/vision/datasets/ADE20K/]
INRIA Annotations for Graz-02[http://lear.inrialpes.fr/people/marszalek/data/ig02/]

比赛

领域专家

Jonathan Long
- [http://people.eecs.berkeley.edu/~jonlong/]
Liang-Chieh Chen
- [http://liangchiehchen.com/]
Hyeonwoo Noh
- [http://cvlab.postech.ac.kr/~hyeonwoonoh/]
Bharath Hariharan
- [http://home.bharathh.info/]
Fisher Yu
- [http://www.yf.io/]
Vijay Badrinarayanan
- [https://sites.google.com/site/vijaybacademichomepage/home/papers]
Guosheng Lin
- [https://sites.google.com/site/guoshenglin/]

初步版本，水平有限，有错误或者不完善的地方，欢迎大家提建议和补充，会一直保持更新，本文为专知内容组原创内容，未经允许不得转载，如需转载请发送邮件至fangquanyi@gmail.com 或联系微信专知小助手（Rancho_Fang）

敬请关注http://www.zhuanzhi.ai 和关注专知公众号，获取第一手AI相关知识

精品内容

面向图像分割的自监督学习：全面综述

面向图像分割的自监督学习：全面综述

专知会员服务

12+阅读 · 5月26日

SAM2 用于图像和视频分割：全面综述

SAM2 用于图像和视频分割：全面综述

专知会员服务

17+阅读 · 3月22日

【ICLR2025】SAMREFINER：驯化“Segment Anything Model”进行通用掩码优化

【ICLR2025】SAMREFINER：驯化“Segment Anything Model”进行通用掩码优化

专知会员服务

12+阅读 · 2月11日

基础模型时代的图像分割研究综述

基础模型时代的图像分割研究综述

专知会员服务

26+阅读 · 2024年8月26日

《生物医学图像分割的基础模型》综述

《生物医学图像分割的基础模型》综述

专知会员服务

31+阅读 · 2024年1月18日

什么是开放词汇检测？港科大等最新《开放词汇检测和分割综述：过去、现在与未来》

什么是开放词汇检测？港科大等最新《开放词汇检测和分割综述：过去、现在与未来》

专知会员服务

27+阅读 · 2023年7月21日

【2022新书】图像分割:原理、技术和应用，336页pdf

【2022新书】图像分割:原理、技术和应用，336页pdf

专知会员服务

117+阅读 · 2022年10月12日

Transformer综述又一弹！西电最新《Transformer视觉学习理解》综述ViT在图像视频中的研究进展与10大问题

Transformer综述又一弹！西电最新《Transformer视觉学习理解》综述ViT在图像视频中的研究进展与10大问题

专知会员服务

112+阅读 · 2022年4月24日

基于深度学习的计算机视觉研究新进展

基于深度学习的计算机视觉研究新进展

专知会员服务

151+阅读 · 2022年4月21日

瑞典皇家理工学院2022博士论文《从MRI图像分析和表征大脑形态的方法》

瑞典皇家理工学院2022博士论文《从MRI图像分析和表征大脑形态的方法》

专知会员服务

14+阅读 · 2022年4月18日

【TPAMI2022】「深度学习图像分割」最新综述论文，带你全面了解100个10大类深度图像分割算法

【TPAMI2022】「深度学习图像分割」最新综述论文，带你全面了解100个10大类深度图像分割算法

专知会员服务

60+阅读 · 2022年4月11日

【CVPR2022】双曲图像分割

【CVPR2022】双曲图像分割

专知会员服务

19+阅读 · 2022年3月14日

图像分割二十年，盘点影响力最大的10篇论文

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

45+阅读 · 2022年2月7日

【NeurIPS 2021 】 K-Net-大统一图像分割任务：语义、实例乃至全景分割

【NeurIPS 2021 】 K-Net-大统一图像分割任务：语义、实例乃至全景分割

专知会员服务

21+阅读 · 2021年12月14日

NeurIPS 2021丨K-Net: 迈向统一的图像分割

NeurIPS 2021丨K-Net: 迈向统一的图像分割

专知会员服务

17+阅读 · 2021年11月25日

参考链接

父主题

计算机视觉

子主题

荟萃目录

微信扫码咨询专知VIP会员