Questions tagged [gradient-descent]

Gradient Descent is an algorithm for finding the minimum of a function. It iteratively calculates partial derivatives (gradients) of the function and descends in steps proportional to those partial derivatives. One major application of Gradient Descent is fitting a parameterized model to a set of data: the function to be minimized is an error function for the model.

Wiki:

Gradient descent is a first-order iterative optimization algorithm. It is an optimization algorithm used to find the values of parameters (coefficients) of a function (f) that minimizes a cost function (cost).

To find a local minimum of a function using gradient descent, one takes steps proportional to the negative of the gradient (or of the approximate gradient) of the function at the current point.

Gradient descent is best used when the parameters cannot be calculated analytically (e.g. using linear algebra) and must be searched for by an optimization algorithm.

Gradient descent is also known as steepest descent, or the method of steepest descent.

Tag usage:

Questions on gradient-descent should be about implementation and programming problems, not about the theoretical properties of the optimization algorithm. Consider whether your question might be better suited to Cross Validated, the StackExchange site for statistics, machine learning and data analysis.

Read more:

1428 questions

votes

0 answers

Proper asynchronous stochastic gradient descent with celery

I have to use celery to parallelize a stochastic gradient descent algorithm, though that might not be the better choice to do it with celery, this is still my question =) The algorithm looks like that, where datas is the matrix of samples: #Random…

python celery distributed mathematical-optimization gradient-descent

asked Oct 30 '13 at 08:45

Théo T

3,270
5
20
22

votes

1 answer

Gradient descent stochastic update - Stopping criterion and update rule - Machine Learning

My dataset has m features and n data points. Let w be a vector (to be estimated). I'm trying to implement gradient descent with stochastic update method. My minimizing function is least mean square. The update algorithm is shown below: for i = 1 ...…

machine-learning linear-regression stochastic gradient-descent

asked Jan 22 '13 at 17:33

code muncher

1,592
2
27
46

votes

1 answer

Does the choice of an activation function and initial weights have any bearing on whether a Neural Network gets stuck in a local minima?

I posted this question yesterday asking if my Neural Network (that I'm training via backpropagation using stochastic gradient descent) was getting stuck in a local minima. The following papers talk about the problem of the local minima in an XOR…

artificial-intelligence neural-network backpropagation gradient-descent minima

asked Nov 08 '11 at 21:29

Vivin Paliath

94,126
40
223
295

votes

1 answer

Optimizing with non-negative constraints

Consider the following functions import numpy as np import scipy.optimize as opt import math # Periodic indexation def pl(list, i): return list[i % len(list)] # Main function (index j) def RT(list, j, L): return…

python optimization constraints mathematical-optimization gradient-descent

asked Jul 07 '23 at 14:38

sam wolfe

votes

1 answer

Autodiff implementation for gradient calculation

I have worked through some papers about the autodiff algorithm to implement it for myself (for learning purposes). I compared my algorithm in test cases to the output of tensorflow and their outputs did not match in most cases. Therefor i worked…

tensorflow machine-learning gradient-descent gradienttape autodiff

asked Feb 06 '23 at 19:57

Frobeniusnorm

votes

1 answer

Logistic regression from scratch: error keeps increasing

I have implemented logistic regression from scratch, however when I run the script the algorithm always predict the wrong label. I've tried changing the training output and test_output by switching all 1 to 0 and vice versa but it always predict the…

python machine-learning deep-learning logistic-regression gradient-descent

asked Dec 14 '22 at 20:59

Diego

votes

1 answer

Understanding gradient computation using backward() in PyTorch

I'm trying to understand the basic pytorch autograd system: x = torch.tensor(10., requires_grad=True) print('tensor:',x) x.backward() print('gradient:',x.grad) output: tensor: tensor(10., requires_grad=True) gradient: tensor(1.) since x is a…

python pytorch torch gradient-descent autograd

asked May 24 '22 at 12:19

volperossa

1,339
20
33

votes

1 answer

How to find 3d point with minimum sum of euclidean distances to all given segments?

N segments in 3d space are given. Segment is represented by 2 points. The problem is to find the point with minimal possible sum of distances to all segments.

algorithm math data-structures gradient-descent

asked Oct 07 '21 at 19:31

Іван Манчур

votes

1 answer

How to plot gradient descent using plotly

I have been trying to replicate some work similar to this code below but when I try to use this data from this link https://raw.githubusercontent.com/plotly/datasets/master/api_docs/mt_bruno_elevation.csv Its throwing some error. I think its because…

python machine-learning plotly gradient-descent plotly-python

asked May 12 '21 at 17:26

pavitra

votes

0 answers

Can this nested for-loop be rewritten using tensorflow functions to allow for gradient calculation?

I wrote a function that sums only certain q-values from a tensor, those being the values corresponding to previous actions taken. I need this function to be auto-differentiable, but my current implementation uses a numpy array with nested for-loops,…

python tensorflow reinforcement-learning gradient-descent computation-graph

asked Apr 17 '21 at 08:50

Alan Perrow

votes

3 answers

SGDRegressor() constantly not increasing validation performance

The model fit of my SGDRegressor wont increase or decrease its performance on the validation set (test) after around 20'000 training records. Even if I try to switch penalty, early_stopping (True/False) or alpha,eta0 to extremely high or low levels,…

python machine-learning scikit-learn linear-regression gradient-descent

asked Dec 09 '20 at 11:46

Sam Amani

votes

2 answers

Understanding Gradient Tape with mini batches

In the below example taken from Keras documentation, I want to understand how grads is computed. Does the gradient grads corresponds to the average gradient computed using the batch (x_batch_train, y_batch_train)? In other words, does the algorithm…

tensorflow keras neural-network tensorflow2.0 gradient-descent

asked Oct 07 '20 at 07:57

Chao

votes

2 answers

How to write cost function formula from Andrew Ng assignment in Octave?

My implementation (see below) gives the scalar value 3.18, which is not the right answer. The value should be 0.693. Where does my code deviate from the equation? Here are the instructions to solve for the data to run the cost function method in…

machine-learning octave logistic-regression gradient-descent

asked Aug 31 '20 at 07:21

Irfan Yaqub

votes

1 answer

What is the difference between clipnorm and clipval on Keras

What is the difference between clipnorm and clipval. Ex: opt = SGD(lr=0.01, momentum=0.9, clipnorm=1.0)

tensorflow optimization keras deep-learning gradient-descent

asked Jul 19 '20 at 14:33

idan ahal

votes

1 answer

Find minimum return value of function with two parameters

I have an error function, and sum of all errors on self.array: #'array' looks something like this [[x1,y1],[x2,y2],[x3,y3],...,[xn,yn]] #'distances' is an array with same length as array with different int values in it def calcError(self,n,X,Y):…

python python-3.x gradient-descent

asked May 20 '20 at 22:08

Q29

Prev 1 2 3

…

95 96 Next