How to annotate the ground truth for image segmentation?

Question

I'm trying to train a CNN model that perform image segmentation, but I'm confused how to create the ground truth if I have several image samples?

Image segmentation can classify each pixel in input image to a pre-defined class, such as cars, buildings, people, or any else.

Is there any tools or some good idea to create the ground truth for image segmentation?

Thanks!

score 3 · Answer 1 · answered Nov 15 '16 at 07:11

3

One tool that pops to mind is MIT's LabelMe toolbox: this toolbox is mainly for browsing the existing labeled images of the dataset, but it has an option to annotated new images as well.

There's alos this github repository for COCO UI you might find useful.

answered Nov 15 '16 at 07:11

Shai

111,146
38
238
371

How to label each pixel in to object classes ? Do we have an array after that , the class of the each pixel . – Shamane Siriwardhana Aug 18 '17 at 06:38
@ShamaneSiriwardhana you usually ends up with an indexed image the same size as the original one. – Shai Aug 18 '17 at 06:55
Let's say I have 3 objects in one image. So how can I annotate ? Can you please help me – Shamane Siriwardhana Aug 18 '17 at 06:57
1

A note for those reading now, LabelMe seems quite old and the message board that hosts the FAQ/discussion is shutting down as of Dec 11, 2021. https://www.quicktopic.com/37/H/E4xRZ7fZZhh – ComputerScientist Nov 23 '21 at 23:35

score 3 · Accepted Answer · answered Jan 10 '19 at 14:19

For semantic segmentation every pixel of an image should be labeled. There are three following ways to address the task:

Vector based - polygons, polylines
Pixel based - brush, eraser
AI-powered tools

In Supervisely, tools to perform 1,2,3 are available.

Below are two videos that compare polygon vs AI-powered tools: cars segmentation and food segmentation.

More details about annotation features of Supervisely can be found here.

score 2 · Answer 3 · answered Feb 04 '18 at 01:32

2

Try out https://www.labelbox.io/. Here is what their image segmentation template looks like...

A lot of the code is open source and they have a hosted service to manage labeling end to end.

answered Feb 04 '18 at 01:32

Dan Rasmuson

5,705
5
29
45

score 1 · Answer 4 · answered Jan 20 '19 at 04:24

I created an open source tool called COCO Annotator: It provide any features where other tools fall short:

Directly export to COCO format
Segmentation of objects
Useful API endpoints to analyze data
Import datasets already annotated in COCO format
Annotated disconnected objects as a single instance
Labeling image segments with any number of labels simultaneously
Allow custom metadata for each instance or object
Magic wand/select tool
Generate datasets using google images
User authentication system

score 1 · Answer 5 · answered Dec 30 '19 at 22:26

1

SageMaker Ground Truth has dataset management and UIs to enable semantic segmentation. https://aws.amazon.com/sagemaker/groundtruth/

We recently released an enhancement to the UI which speeds up annotations considerably, by automatically finding region boundaries. Read about it here: https://aws.amazon.com/blogs/machine-learning/auto-segmenting-objects-when-performing-semantic-segmentation-labeling-with-amazon-sagemaker-ground-truth/

answered Dec 30 '19 at 22:26

Trenton

11,678
10
56
60

Once done with annotating the images for segmentation can I export the annotations in coco format? Right now I can only see .png files created for the masks and not the json files – Rohit Gavval Jul 12 '22 at 09:44

score 0 · Answer 6 · edited Jul 25 '22 at 15:50

The overall process is:

Load the data into a tool
Draw a shape. E.G. Using a polygon tool or other shape tools
Export the points and use them for training directly, or convert them into a dense pixel mask.

Annotation Process Detail

Generally speaking, vector based tools are preferred because they are more accurate. For example on the left is 4 polygon points and on the right is a brush.

Counter intuitively it can even be faster to trace out the points. For example here, I simply moved my mouse over the outline which records this:

But it's actually stored as points:

Other shapes are used

For example, you can use an ellipse or a circle to create segmentation masks. There's no requirement to actually manually create a polygon.

Autobordering

One of the main concepts behind wanting to get to 100% coverage of pixels is a concept called autobordering. Basically it just means that the edges of objects intersect well. In Diffgram for example you can select two points and it will create a matching border.

Here's an example:

Finally there are many other concepts like pre-labeling, copying to next, tracking etc.

Tooling

Diffgram's Segmentation Labeling Software is a great tool option for Image and Video Annotation. It's open-core, you can run it on your kubernetes cluster.

score 0 · Answer 7 · answered Sep 21 '22 at 14:27

0

ITK-SNAP is designed for medical image segmentation. It is easy to learn and powerful still. You can try it.

answered Sep 21 '22 at 14:27

Jingnan Jia

1,108
2
12
28

score -1 · Answer 8 · answered Oct 24 '18 at 16:10

you can use averaging and normalization techniques all over your dataset so create your ground truth and then label different structures. for this purpose, you can think about creating a so called |template|. make sure all your data are co registered initially.

How to annotate the ground truth for image segmentation?

8 Answers8

Linked