Ozan Oktay 2017 (24 Aug 2018 Reading Notes)

Title: Anatomically Constrained Neural Networks (ACNN) Application to Cardiac Image Enhancement and Segmentation Paper Link

Authors: Ozan Oktay

Association: Biomedical Image Analysis Group, Imperial College London

Submission: 2017

Motivation

CNN based segmentation it is not obvious how to incorporate such prior knowledge. State-of-the-art methods operate as pixel-wise classifiers where the training objectives do not incorporate the structure and inter-dependencies of the output.

we propose a generic training strategy that incorporates anatomical prior knowledge into CNNs through a new regularisation model, which is trained end-to-end. The new framework encourages models to follow the global anatomical properties of the underlying anatomy (e.g. shape, label structure) via learnt non-linear representations of the shape

Contributions

(0) learn anatomical shape variations from medical images we propose a generic and novel technique to incorporate priors on shape and label structure into NNs for medical image analysis tasks. In this way, we can constrain the NN training process and guide the NN to make anatomically more meaningful predictions

(1)

(2)

(3)

(4)

Problem 1

The cross-entropy loss function operates on individual pixel level class predictions, which does not guarantee global consistency and plausible anatomical shapes even though the segmentation network has a receptive field larger than the size of structures to be segmented. This is due to the fact that back-propagated gradients are parametrised only by pixel-wise individual probability divergence terms and thus provide little global context.

Solutions

class prediction label maps are passed through the AE to obtain a lower dimensional (e.g. 64 dimensions) parametrisation of the segmentation and its underlying structure [40]. By performing AE-based non-linear lower dimensional projections on both predictions and groundtruth labels, as shown in Fig. 4, we can build our ACNN

Methods

Loss function

consisting of two components:

penalizes differences in appearance L_sim

And set L_sim to the negative local cross-correlation of M(φ) and F.

penalizes local spatial variations in φ L_smooth