Highest Voted 'multimodal' Questions

0

votes

3 answers

Neural Network parameters are not being updated

I tried to train a multi modal model on the 2d heat equation. CONTEXT : The best I have is a CNN with a 5*5 kernel that is optimised to output temperature maps with a given diffusion coefficient. Now I try to give the model other coefficient and…

asked May 17 '23 at 16:41

Antoine Grand

11
2

0

votes

0 answers

Is there a way to tokenize sentences with Longformer?

I have forked the Multimodal Transformers package and created a new version with Longformer support here --> https://github.com/jtfields/Multimodal-Toolkit-Longformer/tree/master. Georgian.io maintains the Multimodal Transformers package and here…

nlp huggingface-transformers text-classification huggingface-tokenizers multimodal

asked May 04 '23 at 22:59

John Fields

21
4

0

votes

0 answers

Change Keras Distiller() class so that the student and teacher model can have two different inputs

I am doing my Master's thesis in Multimodal Emotion Recognition, more precisely, I want to apply knowledge distillation from a multimodal teacher model to an unimodal student model. I am using Keras Distiller() class…

python keras artificial-intelligence emotion multimodal

asked Apr 10 '23 at 22:54

rodrigo.cgcdl

1
1

0

votes

0 answers

Is it possible to parallelise a call in the training loop of a multimodal deep learning model

In the forward pass of my network, I encode two modalities, concatenate them, and use a decoder to produce a final output. Here is a sample code: out_a = self.encoder_a(input_a) out_v = self.encoder_v(input_v) out = torch.cat((out_a, out_v),…

machine-learning deep-learning parallel-processing multimodal

asked Mar 09 '23 at 21:24

Theo Patron

23
6

0

votes

0 answers

How to load multiple, varying references for BLEU metrics with DataLoader?

I have an Image Captioning dataset, where each sample is composed by an image and a list of captions. Each sample has one or more captions The number of captions can be different for each sample. Here's a visual example: I am using PyTorch and I…

deep-learning pytorch dataloader bleu multimodal

asked Mar 01 '23 at 12:45

Ciodar

103
1
5

0

votes

0 answers

Keras concatenated model: ValueError: Data cardinality is ambiguous

I am creating a concatenated model using keras. For now, I am keeping it simple, using only Dense layers and without any kind of hyperparameters optimization. My model should be able to get data from two different datasets, with a different number…

tensorflow keras multimodal

asked Dec 05 '22 at 14:52

Foxtrot_Romeo

16
3

0

votes

0 answers

Why is this error in tensorflow js when model predict?

If I execute my code, I get the below error error: Uncaught (in promise) TypeError: model.predict is not a function My code const model = tf.loadLayersModel('./tf_js/model.json'); const video1 =…

tensorflow.js multimodal

asked Sep 13 '22 at 07:07

yeon

5
2

0

votes

0 answers

Downloading Fakeddit dataset and storing in drive through colab

I am trying to download the Fakeddit dataset (multimodal) through colab notebook, but getting the following error when running !python image_downloader.py: usage: image_downloader.py [-h] type image_downloader.py: error: the following arguments are…

python google-colaboratory multimodal

asked Sep 07 '22 at 07:55

Kamonashish Saha

1

0

votes

1 answer

Combining text and tabular data in PyTorch for classification model

I have a dataset that consists of customers and their product purchases for an ecommerce company that sells clothes. Along with this data, I have application logs that show the customer’s interactions on the site. The data looks something like…

python deep-learning nlp pytorch multimodal

asked Sep 02 '22 at 02:19

Negative Correlation

813
1
11
26

0

votes

1 answer

Modelling and fitting bi-modal lognormal distributions in a loop using lmfit

I have been spending FAR too much time trying to figure this out - so time to seek help. I am attempting to use lmfit to fit two lognormals (a and c) as well as the sum of these two lognormals (a+c) to a size distribution. Mode a centers around…

loops distribution lmfit multimodal

asked Mar 09 '22 at 11:01

GingerMonster

33
5

0

votes

0 answers

Image text retrieval evaluation metric

I am currently working on text image retrieval where basically, given a test query, I should be able to retrieve all the related images. For example- 'children playing in park', should retrieve all relevant images. I do know the evaluation metric to…

deep-learning information-retrieval precision-recall multimodal image-text

asked Mar 01 '22 at 02:12

Nidutt

185
2
17

0

votes

1 answer

Multi-start in OpenMDAO

Is there any straightforward way to use a multi-start option for solving optimization problems in OpenMDAO? I am currently using the SLSQP driver of ScipyOptimize, and would like to start the search from multiple initial points (either in a sequence…

optimization openmdao multimodal

asked Jan 21 '22 at 10:35

Kasia

105
6

0

votes

1 answer

Deep Learning - generate patches for 3D multimodal data

I have chosen the problem of Semantic Segmentation of Brain Tumors using Deep Learning. I am using the BRATS2015 dataset. It has 274 patient MRI scans each of size 240x240x155. There are four modalities for each patient (T1, T2, T1c, FLAIR). So I am…

python tensorflow deep-learning multimodal

asked Aug 06 '21 at 21:54

Pranjal

8,083
3
8
13

0

votes

0 answers

How to create multimodal data set?

I want to make music recommendation system based on multimodal dataset. Ive been looking for publicly available dataset but I couldn't find it anywhere . I want to know where will I get multimodal dataset for music or book ? I watched many YOUTUBE…

machine-learning recommendation-engine multimodal

asked Dec 11 '20 at 06:27

user12613694

0

votes

1 answer

ValueError: Input 0 of layer conv1_pad is incompatible with the layer: expected ndim=4, found ndim=2. Full shape received: [None, 260]

I'm getting an error for Multi-modal problem. Input shape: img input:- (3740, 150, 150, 3), word input:- (3740, 260) where 3740 is the number of samples. Here have attached model as a function where build_img_encoder describes IMG encoder model and…

tensorflow keras deep-learning tensorflow2.0 multimodal

asked Nov 04 '20 at 08:47

Bhuvan S

213
1
4
10

Questions tagged [multimodal]