Vnet Deep learning 3D volumetric Image Segmentation. How to handle large and different size images?

Asked Mar 24 '21 at 10:41

Active Mar 24 '21 at 12:00

Viewed 131 times

I have a V-Net to segment an organ from bunch of CT images. However, the training data varies in depth: some examples [Batch, Channels, D, H, W]:

[32, 1, 34, 512, 512]
[32, 1, 125, 512, 512]
[32, 1, 80, 512, 512]

The training dataloader requires the same dimensions to be passed into the network. I have tried to implemented a random spatial crop [128, 128, 64] for the dataloader transform, partly due to memory issues, but the results are all funny (I think most of the crop aren't within the desired organ of interest, so all outputs are just zeros). Any suggestions for a work around?

I was thinking maybe preprocessing the data first and crop depth-wise by looking at the labels to find the desired depth to crop at.

edited Mar 24 '21 at 12:00

desertnaut

57,590
26
140
166

asked Mar 24 '21 at 10:41

Breadboy Kid

Any updates? I'm also facing the same issue. – planet_pluto May 27 '21 at 10:04
I found using slice based training to be more effective. Although this was done in UNet, the same idea might be translatable to Vnet. Slice based training and inference performed much better, compared to patch for my dataset. – Breadboy Kid May 28 '21 at 11:11
So slice images to depth of 1 along axial axis – Breadboy Kid May 28 '21 at 11:13

Vnet Deep learning 3D volumetric Image Segmentation. How to handle large and different size images?

0 Answers0