Highest Voted 'sequence-to-sequence' Questions

1

vote

2 answers

How do I convert the output data from the decoder LSTM back to words?

I have an encoder decoder network mimicking the one produced in this tutorial: https://towardsdatascience.com/how-to-implement-seq2seq-lstm-model-in-keras-shortcutnlp-6f355f3e5639 However the output of the decoder LSTM will be numbers between 0 and…

asked Nov 09 '20 at 22:09

Haztec

31
3

1

vote

1 answer

Embedding layer in neural machine translation with attention

I am trying to understanding how to implement a seq-to-seq model with attention from this website. My question: Is nn.embedding just returns some IDs for each word, so the embedding for each word would be the same during whole training? Or are they…

pytorch recurrent-neural-network word-embedding attention-model sequence-to-sequence

asked Nov 04 '20 at 06:37

Kadaj13

1,423
3
17
41

1

vote

1 answer

Exception in Sequence-to-Sequence model in Keras

I am trying to build a sequence to sequence model in Keras using LSTM and dense neural network. The encoder encodes the input, the encoded state and the inputs are then concatenated and fed into a decoder which is an lstm + dense neural network…

keras keras-layer keras-2 sequence-to-sequence

asked Feb 07 '19 at 00:54

Abdul Rahman

1,294
22
41

1

vote

0 answers

What is num_units argument in BahdanauAttention?

I don't understand the argument num_units in TensorFlow's BahdanauAttention. It's defined as: num_units: The depth of the query mechanism. which is unclear to me.

tensorflow deep-learning sequence-to-sequence nlp

asked Dec 01 '18 at 10:21

edoost

101
1
8

1

vote

1 answer

Merging sequence embedding with Time Series Features

I am having trouble around certain aspects of the Keras implementation of LSTM. This is a description of my problem: I am trying to train a model for word correctness prediction. My model has two types of inputs: A word sequence (sentence) And a…

keras lstm word-embedding sequence-to-sequence

asked Nov 02 '18 at 16:06

user3487059

21
4

1

vote

1 answer

How to build a decoder using dynamic rnn in Tensorflow?

I know how to build an encoder using dynamic rnn in Tensorflow, but my question is how can we use it for decoder? Because in decoder at each time step we should feed the prediction of previous time step. Thanks in advance!

python tensorflow recurrent-neural-network sequence-to-sequence encoder-decoder

asked Feb 21 '18 at 06:13

MRM

1,099
2
12
29

1

vote

0 answers

TensorFlow BeamSearchDecoder outputs the sample_id as (actual sample_id+1)

Have I written custom code (as opposed to using a stock example script provided in TensorFlow): Yes. Based on the NMT tutorial, I am writing a customized code for my own task. OS Platform and Distribution (e.g., Linux Ubuntu 16.04):Linux Ubuntu…

tensorflow recurrent-neural-network machine-translation sequence-to-sequence beam-search

asked Feb 05 '18 at 10:55

Seonghyeon Park

56
3

1

vote

1 answer

What is the difference between ensembling and averaging models?

In machine translation, sequence-to-sequence models have become very popular. They often use a few tricks to improve performance, such as ensembling or averaging a set of models. The logic here is that the errors will then "average out". As I…

machine-learning deep-learning machine-translation ensemble-learning sequence-to-sequence

asked Sep 06 '17 at 12:00

RNRug

55
5

1

vote

0 answers

No op named GatherTree when using BeamSearchDecoder

I'm implementing a Seq2Seq model with TensorFlow. My code works using the Greedy Decoder, but when I was using BeamSearchDecoder to improve the performance, I encountered this error: Traceback (most recent call last): File…

tensorflow nlp deep-learning sequence-to-sequence

asked Aug 23 '17 at 04:42

Michael Chen

11
3

1

vote

2 answers

Using CNTK to generate sequence by sampling at each generation step

In a seq2seq model with an encoder and a decoder, at each generation step a softmax layer outputs a distribution over the entire vocabulary. In CNTK, a greedy decoder can be implemented easily by using the C.hardmax function. It looks like this. def…

python reinforcement-learning decoder cntk sequence-to-sequence

asked Aug 14 '17 at 09:08

meijiesky

11
2

1

vote

0 answers

Create input/output sequences from separate columns in csv for seq2seq decoder in tensorflow

I am trying to experiment with tensorflow seq2seq and I am having trouble coming up with a good way to add the "GO", "EOS" + "PAD" elements a sequence of labels. I am reading this data from .csv using tf.TextLineReader and the .csv I have created…

python tensorflow sequence-to-sequence

asked Aug 10 '17 at 02:50

reese0106

2,011
2
16
46

1

vote

1 answer

Tensor flow continuous text sequence-to-sequence. Why batch?

I'm working through building a sequence-to-sequence shakespeare predictor and looking at sample code it seems to do batching in groups of 50 characters. I'm a little confused by this. If the text is continuous and you are processing in…

tensorflow batching recurrent-neural-network sequence-to-sequence

asked Jul 22 '17 at 14:25

raeldor

503
3
11

1

vote

0 answers

Batching Seq2Seq model with Tensorflow Experiment

I'm trying to convert a Seq2Seq model I've run locally to use Tensorflow distrubution capabilities using Estimator and Experiment. The basic feature and targets are set up as follows: for every input and response (translation or prompt and…

tensorflow batch-processing distributed-computing sequence-to-sequence

asked Jul 03 '17 at 01:57

Eric H.

6,894
8
43
62

1

vote

0 answers

Tensorflow 1.0 Seq2Seq Decoder function

I'm trying to make a Seq2Seq Regression example for time-series analysis and I've used the Seq2Seq library as presented at the Dev Summit, which is currently the code on the Tensorflow GitHub branch r1.0. I have difficulties understanding how the…

python-2.7 tensorflow regression sequence-to-sequence

asked Apr 10 '17 at 04:43

Vithursan Thangarasa

11
1

1

vote

0 answers

Sampled softmax loss over variable sequence batches?

Background info: I'm working on sequence-to-sequence models, and right now my model accepts variable-length input tensors (not lists) with input shapes corresponding to [batch size, sequence length]. However, in my implementation, sequence length is…

machine-learning tensorflow deep-learning recurrent-neural-network sequence-to-sequence

asked Mar 12 '17 at 23:03

Brandon McKinzie

46
1
5

Questions tagged [sequence-to-sequence]