Questions tagged [multimodal]

32 questions
0
votes
3 answers

Neural Network parameters are not being updated

I tried to train a multi modal model on the 2d heat equation. CONTEXT : The best I have is a CNN with a 5*5 kernel that is optimised to output temperature maps with a given diffusion coefficient. Now I try to give the model other coefficient and…
0
votes
0 answers

Is there a way to tokenize sentences with Longformer?

I have forked the Multimodal Transformers package and created a new version with Longformer support here --> https://github.com/jtfields/Multimodal-Toolkit-Longformer/tree/master. Georgian.io maintains the Multimodal Transformers package and here…
0
votes
0 answers

Change Keras Distiller() class so that the student and teacher model can have two different inputs

I am doing my Master's thesis in Multimodal Emotion Recognition, more precisely, I want to apply knowledge distillation from a multimodal teacher model to an unimodal student model. I am using Keras Distiller() class…
0
votes
0 answers

Is it possible to parallelise a call in the training loop of a multimodal deep learning model

In the forward pass of my network, I encode two modalities, concatenate them, and use a decoder to produce a final output. Here is a sample code: out_a = self.encoder_a(input_a) out_v = self.encoder_v(input_v) out = torch.cat((out_a, out_v),…
0
votes
0 answers

How to load multiple, varying references for BLEU metrics with DataLoader?

I have an Image Captioning dataset, where each sample is composed by an image and a list of captions. Each sample has one or more captions The number of captions can be different for each sample. Here's a visual example: I am using PyTorch and I…
Ciodar
  • 103
  • 1
  • 5
0
votes
0 answers

Keras concatenated model: ValueError: Data cardinality is ambiguous

I am creating a concatenated model using keras. For now, I am keeping it simple, using only Dense layers and without any kind of hyperparameters optimization. My model should be able to get data from two different datasets, with a different number…
0
votes
0 answers

Why is this error in tensorflow js when model predict?

If I execute my code, I get the below error error: Uncaught (in promise) TypeError: model.predict is not a function My code const model = tf.loadLayersModel('./tf_js/model.json'); const video1 =…
yeon
  • 5
  • 2
0
votes
0 answers

Downloading Fakeddit dataset and storing in drive through colab

I am trying to download the Fakeddit dataset (multimodal) through colab notebook, but getting the following error when running !python image_downloader.py: usage: image_downloader.py [-h] type image_downloader.py: error: the following arguments are…
0
votes
1 answer

Combining text and tabular data in PyTorch for classification model

I have a dataset that consists of customers and their product purchases for an ecommerce company that sells clothes.  Along with this data, I have application logs that show the customer’s interactions on the site. The data looks something like…
Negative Correlation
  • 813
  • 1
  • 11
  • 26
0
votes
1 answer

Modelling and fitting bi-modal lognormal distributions in a loop using lmfit

I have been spending FAR too much time trying to figure this out - so time to seek help. I am attempting to use lmfit to fit two lognormals (a and c) as well as the sum of these two lognormals (a+c) to a size distribution. Mode a centers around…
0
votes
0 answers

Image text retrieval evaluation metric

I am currently working on text image retrieval where basically, given a test query, I should be able to retrieve all the related images. For example- 'children playing in park', should retrieve all relevant images. I do know the evaluation metric to…
0
votes
1 answer

Multi-start in OpenMDAO

Is there any straightforward way to use a multi-start option for solving optimization problems in OpenMDAO? I am currently using the SLSQP driver of ScipyOptimize, and would like to start the search from multiple initial points (either in a sequence…
Kasia
  • 105
  • 6
0
votes
1 answer

Deep Learning - generate patches for 3D multimodal data

I have chosen the problem of Semantic Segmentation of Brain Tumors using Deep Learning. I am using the BRATS2015 dataset. It has 274 patient MRI scans each of size 240x240x155. There are four modalities for each patient (T1, T2, T1c, FLAIR). So I am…
Pranjal
  • 8,083
  • 3
  • 8
  • 13
0
votes
0 answers

How to create multimodal data set?

I want to make music recommendation system based on multimodal dataset. Ive been looking for publicly available dataset but I couldn't find it anywhere . I want to know where will I get multimodal dataset for music or book ? I watched many YOUTUBE…
user12613694
0
votes
1 answer

ValueError: Input 0 of layer conv1_pad is incompatible with the layer: expected ndim=4, found ndim=2. Full shape received: [None, 260]

I'm getting an error for Multi-modal problem. Input shape: img input:- (3740, 150, 150, 3), word input:- (3740, 260) where 3740 is the number of samples. Here have attached model as a function where build_img_encoder describes IMG encoder model and…
Bhuvan S
  • 213
  • 1
  • 4
  • 10