Questions tagged [tensorflow-gradient]

The tag for questions regarding gradient computation, auto-differentiation and evaluation in tensorflow

24 questions

votes

1 answer

How to monitor gradient vanish and explosion in keras with tensorboard?

I would like to monitor the gradient changes in tensorboard with keras to decide whether gradient vanish or explosion. What should I do?

asked Apr 26 '18 at 00:59

Joey Chia

votes

1 answer

How to print the gradients during training in Tensorflow?

In order to debug a Tensorflow model I need to see whether the gradients get changed or if there any nans in them. Simply printing a variable in Tensorflow does not work because all you see is:

python variables tensorflow machine-learning tensorflow-gradient

asked Mar 04 '18 at 19:48

minerals

6,090
17
62
107

votes

1 answer

OpenAI Gradient Checkpointing with Tensorflow Eager Execution

I have recently switched to Tensorflow Eager (currently working with TF 1.8.0) and like it a lot. However, I now have quite a large model which does not fit into my GPU Memory (GTX 1080Ti, 12GB VRAM) when run with the Gradient Tape which is needed…

python tensorflow gradient-descent tensorflow-gradient

asked Jun 08 '18 at 09:42

Nemorior

votes

0 answers

RAM Usage keeps growing while training reinforcement learning agent

The other day I started training my Atari Breakout reinforcement learning agent. But after an hour and a half or so I noticed my screen started freezing and it became very difficult to interact with the computer via mouse. So, I decided I'd rerun…

tensorflow tensorflow2.0 tf.keras tensorflow-gradient

asked Feb 17 '23 at 12:32

Bryan Carty

votes

1 answer

How to accumulate my loss over mini batches then calculate my gradient

My main question is; is averaging the loss the same thing as averaging the gradient and how do i accumulate my loss over mini batches then calculate my gradient? I have been trying to implement policy gradient in Tensorflow and run into the issue…

python tensorflow reinforcement-learning tensorflow-gradient policy-gradient-descent

asked Mar 17 '19 at 16:59

Mike Jankowiak

votes

0 answers

making a memory efficient GAN in keras: `clear_session` causing conflicting tensorflow graphs

I have written the code for a general adversarial network that will run for 4000 epochs, however, after 2000 epochs- the model compiling time and memory usage become very inefficient and the code runs extremely slowly. I would like to make my code…

tensorflow keras tensorflow-gradient

asked Feb 06 '19 at 23:24

xenopus

votes

1 answer

stop_gradient in tensorflow

I am wondering if tf.stop_gradient stops the gradient computation of just a given op, or stops the update of its input tf.variable ? I have the following problem - During the forward path computation in MNIST, I would like to perform a set of…

tensorflow tensorflow-gradient

asked May 07 '18 at 20:15

user1193538

vote

1 answer

TensorFlow gradient of matrix wrt a matrix is not making sense

Let's say i have two matrix tf_t (shape : 5x3 ) and tf_b ( shape : 3x3). y_tf = tf.matmul(tf_t, tf_b) and then I've computed dy/dt using tf.gradient api import tensorflow as tf mat = [[0.8363, 0.4719, 0.9783], [0.3379, 0.6548, 0.3835], …

tensorflow autograd tensorflow-gradient

asked Nov 16 '20 at 11:09

Anish

vote

2 answers

How to assign custom gradient to TensorFlow op with multiple inputs

I'm trying to use TensorFlow's @tf.custom_gradient functionality to assign a custom gradient to a function with multiple inputs. I can put together a working setup for only one input, but not for two or more. I've based my code on TensorFlow's…

python tensorflow tensorflow-gradient

asked Jan 04 '19 at 23:45

aedificatori

vote

1 answer

Using tensorflow, how can you average parameter gradient values over a number of batches and update using that average?

Like many people developing deep learning models, I am hindered by my hardware resources, namely GPU memory. I have an audio classification problem for which I am trying out a number of RNNs. The data is very large and I am only able to use small…

python tensorflow recurrent-neural-network tensorflow-gradient

asked Sep 26 '18 at 15:02

michael_question_answerer

vote

1 answer

Can I get the gradient of a tensor with respect to the input without applying the input?

For example, I need to compute the gradient of the cross_entropy with respect to x, but I need to apply another value to the gradient function. That is: f'(x)|x = x_t I think tf.gradients() function will only give the gradient at x = x. So does…

python tensorflow machine-learning tensor tensorflow-gradient

asked Apr 10 '18 at 13:52

Sharyn Hu

vote

2 answers

tensorflow: change in parameter weights is different than it should be based on gradients

EDIT: Solved -- it was the stupidity of using different training examples for the gradients vs the optimizer update. OK this has me totally stumped. I have a parameter vector, let's call it w. w = [-1.34554319, 0.86998659, 0.52366061, 2.6723526…

python tensorflow tensorflow-gradient

asked Mar 21 '18 at 08:21

anon

vote

1 answer

Tensorflow - No gradients provided for any variable

I am experimenting some code on Jupyter and keep getting stuck here. Things work actually fine if I remove the line starting with "optimizer = ..." and all references to this line. But if I put this line in the code, it gives an error. I am not…

tensorflow neural-network deep-learning tensorflow-gradient

asked Mar 15 '18 at 00:56

edn

1,981
3
26
56

vote

1 answer

Updating a tensor in tensorflow

I have defined an unsupervised problem in tensorflow, I need to update my B and my tfZ with every iteration, but I don't know how to update my tfZ using the tensorflow session. tfY = tf.placeholder(shape=(15, 15), dtype=tf.float32) with…

python tensorflow variable-assignment tensorflow-gradient

asked Mar 07 '18 at 09:29

NicolaiF

1,283
1
20
44

vote

1 answer

Tensorflow: Gradient Calculation from Input to Output

I would like to calculate the gradients of the output of a neural network with respect to the input. I have the following tensors: Input: (num_timesteps, features) Output: (num_timesteps, 1) For the gradients from the inputs to the entire output…

python tensorflow machine-learning artificial-intelligence tensorflow-gradient

asked Mar 22 '17 at 10:55

Torben

2 Next