View on GitHub

PhDthesis

PhD thesis in Applied Physics

Image Segmentation

In the previous section we have discussed about the object classification and object detection problems (ref. Object Detection). Now we want to go deeper on this topic and extract the exact pixels which belong to an object into a given image. This kind of problem is called Image Segmentation, i.e give a label to each pixel of the input image.

Image segmentation is a typical task in many research fields and could be used for different purposes. information about pixel-wise position of objects inside an image could be used for extract object shapes from the image or to simplify and/or change the representation of an image into something more meaningful and easier to understand. This is an hot topic especially for self-driving car applications in which we have to find the exact shapes of object to better estimate their perspective position. Moreover, all these applications require fast algorithm as much as possible closed to real-time.

This kind of task can be performed using a pipeline of image processing functions or by training a neural network model. In the first case we have to stack a series of function to process the input image: it has to filters and extracts the useful information about the searched object but most of all it has to be as most general as possible to face the common heterogeneity of samples. In the second case we leave to the neural network model parameters the searching of optimal combination of function but we have to provide a supervised input pattern, i.e a combination of input and annotated pixel-wise mask of each image. The image annotation is one of the most hardest and boring step of image segmentation and for these reasons is very hard to find public dataset usable.

In this chapter we introduce a particular neural network model commonly used in image segmentation problems and we will describe its characteristics and performances. We applied this model to a novel dataset of CT images. The dataset annotation was performed by a custom semi-supervised pipeline of image processing and the neural network model was trained and tested on this dataset. The original data are taken from here [doi:10.1002/jor.22040] and the corresponding annotations are released on here.

next »