Questions tagged [semantic-segmentation]

Semantic segmentation is the task of classifying every pixel in the image to a category or class label.

Semantic segmentation is the task of classifying every pixel in the image to a category. It refers to the process of linking each pixel in an image to a class label. Some of its primary applications are in autonomous vehicles, human-computer interaction & robotics.

532 questions
-1
votes
1 answer

I want to deploy a pytorch segmentation model in a C++ application .. C++ equivalent preprocessing

I want to deploy a pytorch segmentation model in a C++ application. I knew that I have to convert the model to a Torch Script and use libtorch. However, what is C++ equivalent to the following pre-preprocessing (It's Ok to convert opencv, but I…
-1
votes
1 answer

6d pose estimation of an object using Deep learning

I have trained Deeplabv3+ to detect a car charging socket and now I am able to get 3D information of the object in the real world environment using Realsense depth camera. But I need the orientation of the object as well. After searching I found out…
-1
votes
1 answer

Are raw video better than mp4 encoded video as an input for a Neural Network?

I am working on a project for a university exam in computer vision where the objective is to analyse different road scenarios from videos. For example, it is to do instance segmentation of street images to recognise various objects and catalogue…
-1
votes
1 answer

In deep learning what is the difference between Weights Size and Model Size?

In deep learning, what is the difference between Weights Size and Model Size? (often expressed in MB)
-1
votes
1 answer

Swapping face parts relative to facial landmarks

I got an image of an individual with a beard: Using a mask, I was able to extract the beard: I want to move the beard on another person's face, such as this one: I want to do this by getting the nose location of the first person, the nose…
-1
votes
1 answer

How to crop an image by its mask

I have this image: And for the beard, I have this mask: I want to cut the beard out using the mask with a transparent background like this: I followed this SO post's attempt. Here it is: for img in input_images: gaberiel = Image.open(path +…
-1
votes
1 answer

Using Tensorflow random rotation and cropping sequentially throws errors

I am trying to edit this code on Keras page: https://keras.io/examples/vision/deeplabv3_plus/. One thing I want to do is to add an augmentation function for the dataset. This is what I wrote: def image_augmentation(img): img =…
-1
votes
1 answer

Image segmentation with U-net

I am trying to understand if Semantic segmentation with U-NET. Are we training kernels to extract features or are we training a fully connected layer at the end? Or both? Because based on this image: I don't think there is a need for a fully…
-1
votes
1 answer

is there any other good metrics for segmentation task in deep learning apart from IOU?

I would like to ask if there are other good metrics for segmentation tasks in deep learning except for IOU (intersection over union)? Because some times i got NaN results from IOU, Just wondering maybe there are some other metrics that can help to…
-1
votes
2 answers

I want to train the CSRNet model on UCF_CC_50 dataset which has total number of images 50. and face a problem like

I want to train the CSRNet model on UCF_CC_50 dataset but occurring this problem KeyError Traceback (most recent call last) in 4 img= plt.imread(img_path) 5 k =…
Akbar
  • 13
  • 6
-1
votes
1 answer

need guidance on using pre-trained weights in segmentation_models API

I want to use a pre-trained Unet model using segmentation_models API for the Cityscapes dataset, but I need the pre-trained weights for the same. Where can I find the pre-trained weights for a Unet model trained on the Cityscapes dataset? Please…
-1
votes
1 answer

Image to image DataSetIterator using dl4j

I would like to use DeepLearning4j to build and train a U-Net network. To do this I need a dataset iterator that feed the network with an image in input and an other image in output. I am new to DL4j and all the examples I found explain how build…
-1
votes
1 answer

Semantic segmentation dataset organization

I am trying to segment 4 lesions with semantic segmentation. I follow this this great post My training folder has only 2 subfolders with patches: masks and images. Inside the folder with masks, ALL the classes are mixed. The other folder has the…
-1
votes
1 answer

binary_corssentropy loss is negative even I normalize the inputs

I try to use U-net to do the semantic segmentation problem. The mask image is binary. But when training, I find out that my loss is negative. Loss here I use binary_crossentropy. Here is my code: X_train = X_train /255 y_train = y_train /255 X_val =…
-1
votes
1 answer

Nan in summary histogram for: deconv2/biases

The original size of my images is 3900 x 6000 x 3. I make overlapping patches of shape (232024, 28, 28, 3) and then make batches of size 1000. I have a CNN model for semantic segmentation as follows: def conv_layer(inputs, filters, kernel_size,…
Fawad Khalil
  • 357
  • 3
  • 20
1 2 3
35
36