I've built caffe from source (based on the BVLC installation guide). The problem is that in CPU mode it always uses single core, although I set OPENBLAS_NUM_THREADS
to different values.
On my CentOS box, I installed openblas-threads64
, openblas-serial64
, and openblas-openmp64
rpm packages and set BLAS := open
in Makefile.config
. Also, under /user/lib64/
I see both single and parallel shared library, but ldd
output on caffe binary shows that it loads libopenblas.so.0
and not libopenblasp.so.0
(presumably the parallel version).
I'm not sure if there's anything else that I am missing or it's a bug in caffe.