correct usage of scipy.optimize.fmin_bfgs

Question

I am playing around with logistic regression in Python. I have implemented a version where the minimization of the cost function is done via gradient descent, and now I'd like to use the BFGS algorithm from scipy (scipy.optimize.fmin_bfgs).

I have a set of data (features in matrix X, with one sample in every row of X, and correpsonding labels in vertical vector y). I am trying to find parameters Theta to minimize:

enter image description here

I have trouble understanding how fmin_bfgs works exactly. As far as I get it, I have to pass a function to be minimized and a set of initial values for Thetas.

I do the following:

initial_values = numpy.zeros((len(X[0]), 1))
myargs = (X, y)
theta = scipy.optimize.fmin_bfgs(computeCost, x0=initial_values, args=myargs)

where computeCost calculates J(Thetas) as illustrated above. But I get some index-related errors, so I think I am not supplying what fmin_bfgs expects.

Can anyone shed some light on this?

Ah, [rubber duck debugging](http://en.wikipedia.org/wiki/Rubber_duck_debugging) =) — Katriel, Apr 21 '12 at 11:04

score 5 · Accepted Answer · answered Jun 13 '12 at 10:26

5

After wasting hours on it, solved again by power of posting...I was defining computeCost(X, y, Thetas), but as Thetas is the target parameter for optimization, it should have been the first parameter in the signature. Fixed and works!

answered Jun 13 '12 at 10:26

ACEG

1,981
6
33
61

score 0 · Answer 2 · answered Jun 06 '12 at 09:58

0

i don't know your whole code, but have you tried

initial_values = numpy.zeros(len(X[0]))

? This x0 should be a 1d vector, i think.

answered Jun 06 '12 at 09:58

Harald Schilly

1,098
1
14
15

It's a 1D vector, because the second parameter of the shape in numpy.zeros() is 1. Thanks for taking the time to suggest it, though! – ACEG Jun 08 '12 at 06:46

correct usage of scipy.optimize.fmin_bfgs

2 Answers2