Questions tagged [automatic-differentiation]

Also known as algorithmic differentiation, short AD. Techniques that take a procedure evaluating a numerical function and transform it into a procedure that additionally evaluates directional derivatives, gradients, higher order derivatives.

Techniques include operator

overloading for dual numbers,
operator overloading to extract the operations sequence as a tape,
code analysis and transformation.

For a function with input of dimension n and output of dimension n, requiring L elementary operations for its evaluation, one directional derivative or one gradient can be computed with 3*L operations.

The accuracy of the derivative is, automatically, nearly as good as the accuracy of the function evaluation.

Other differentiation method are

symbolic differentiation, where the expanded expression for the derivatives is obtained first, which can be large depending on the implementation, and
numerical differentiation by divided differences, which provides less accuracy with comparable effort, or comparable accuracy with a higher effort.

See wikipedia and autodiff.org

192 questions

votes

0 answers

Convoluted tree structure causes the GC to pause indefinitely

I am doing some machine learning self study and currently I am implementing reverse mode automatic differentiation as practice. The way the program works is by essentially overloading common expressions like multiplication, addition and so on and…

asked Dec 02 '15 at 18:04

Marko Grdinić

3,798
3
18
21

votes

1 answer

How to do automatic differentiation on complex datatypes?

Given a very simple Matrix definition based on Vector: import Numeric.AD import qualified Data.Vector as V newtype Mat a = Mat { unMat :: V.Vector a } scale' f = Mat . V.map (*f) . unMat add' a b = Mat $ V.zipWith (+) (unMat a) (unMat b) sub' a b…

haskell automatic-differentiation

asked Apr 01 '15 at 12:41

fho

6,787
26
71

votes

2 answers

How does theano implement computing every function's gradient?

I have a question about Theano's implementation. How the theano get the gradient of every loss function by the following function(T.grad)? Thank you for your help. gparams = T.grad(cost, self.params)

python math gradient theano automatic-differentiation

asked Feb 03 '15 at 12:52

Issac

votes

2 answers

Java - Computation of Derivations with Apache Commons Mathematic Library

I have a problem in using the apache commons math library. I just want to create functions like f(x) = 4x^2 + 2x and I want to compute the derivative of this function --> f'(x) = 8x + 2 I read the article about Differentiation…

java derivative differentiation apache-commons-math automatic-differentiation

asked May 27 '13 at 22:56

Tobi Weißhaar

1,617
6
26
35

votes

1 answer

Haskell ad package

I want to use the ad automatic differentiation package for learning neural network weights in Haskell. I have found some functions that might just have what I need, however I can't figure out what they expect as the first parameter. It must be the…

haskell automatic-differentiation

asked Feb 03 '13 at 18:59

laci37

votes

1 answer

Automatic Differentiation with respect to rank-based computations

I'm new to automatic differentiation programming, so this maybe a naive question. Below is a simplified version of what I'm trying to solve. I have two input arrays - a vector A of size N and a matrix B of shape (N, M), as well a parameter vector…

tensorflow autograd automatic-differentiation jax autodiff

asked Dec 03 '21 at 12:39

P JMU

votes

0 answers

Why tf.GradientTape() has less GPU memory usage when watch model variables manually?

So when I use tf.GradientTape() to automatically monitor the trainable variables in a resnet model, the computer threw an out of memory error. Below is the code: x_mini = preprocess_input(x_train) with tf.GradientTape() as tape: outputs =…

python tensorflow automatic-differentiation gradienttape

asked Dec 31 '20 at 20:20

Tbone

votes

0 answers

How to obtain the Jacobian Matrix with respect to the inputs of a keras model neural network?

I recently started learning and using automatic differentiation to determine the gradients and jacobian matrix of a neural network with respect to a given input. The method suggested by tensorflow is the tape.gradient and tape.jacobian method.…

tensorflow2.x automatic-differentiation

asked Dec 20 '20 at 13:49

Derrick

votes

1 answer

Plotting output of ForwardDiff in Julia

I would just like to use the ForwardDiff.jl functionality to define a function and plot its gradient (evaluated using ForwardDiff.gradient). It seems not be working because the output of ForwardDiff.gradient is this weird Dual type thing, and it's…

type-conversion julia automatic-differentiation autodiff

asked Nov 13 '20 at 12:11

Conor

votes

1 answer

how to use promote rule in julia?

I'm trying to write a struct to compute the gradient (following https://www.youtube.com/watch?v=rZS2LGiurKY) this is what I have so far: struct GRAD{F <: Array{Float64,2}, ∇F <:Array{Float64,2}} f::F ∇f::∇F end begin import Base:…

julia automatic-differentiation

asked Oct 04 '20 at 14:59

Ali A. Al Nasser

votes

1 answer

Representing a computational graph in Haskell

I'm trying to write a simple automatic differentiation package in Haskell. What are the efficient ways to represent a type-safe (directed) computational graph in Haskell? I know that the ad package uses the "data-reify" method for that but I'm not…

algorithm haskell data-structures functional-programming automatic-differentiation

asked Aug 06 '20 at 08:23

thoughtpolice

votes

1 answer

Why do TensorFlow and PyTorch gradients of the eigenvalue decomposition differ from each other and the analytic solution?

The following code computes the eigenvalue decomposition of a real symmetric matrix. Then, the gradient of the first eigenvalue with respect to the matrix is computed. This is done three times: 1) using the analytic formula, 2) using TensorFlow, 3)…

numpy tensorflow pytorch derivative automatic-differentiation

asked Nov 14 '19 at 11:50

Maryks

votes

1 answer

Ranges with Dual Numbers

I am having an issue dealing with Dual numbers inside of ranges. Specifically: using ForwardDiff: Dual t = Dual.((0.0,10.0),0) (t[1]:1/60:t[2])[end] The issue seems to be that [end] uses last which then what's to compute the number of steps, so…

julia automatic-differentiation

asked Feb 10 '19 at 09:47

Chris Rackauckas

18,645
3
50
81

votes

2 answers

Automatic Differentiation with CoDiPack

The following code: #include ... codi::RealForward Gcodi[l]; for (int p = 0; p < l; p++) { ... double a = Gcodi[p]; } gives me the compilation error: nnBFAD.cpp: In function ‘void OptBF()’: nnBFAD.cpp:156:25: error: cannot…

c++ c++11 automatic-differentiation

asked Dec 05 '18 at 05:15

Filippo Portera

votes

2 answers

Julia ReverseDiff: how to take a gradient w.r.t. only a subset of inputs?

In my data flow, I'm querying a small subset of a database, using those results to construct about a dozen arrays, and then, given some parameter values, computing a likelihood value. Then repeating for a subset of the database. I want to compute…

machine-learning julia automatic-differentiation

asked May 04 '18 at 06:45

James

Prev 1 2

…

12 13 Next