I port sgemm function from weblas using c/c++ opengl es3.0. (which i caled esgemm). I have a application using this sgemm function. but I find the speed is very slow on Android. here is my test result. PC(Core™ i7-6700 CPU @ 3.40GHz × 8 HD Graphics 530 (Skylake GT2) ):
openBlas 52.91ms
esgemm 67.7ms
CL blas 68.88
Android S6 (Exynos 7420 2.1GHz x4 1.5GHz x4 , Mali-T760):
openBlas 62.96ms
esgemm 1027.59ms
Cl blas 1054.55ms
in the above score, CL blas, is my opencl version sgemm (port from clMathLibraries). the performance is same this opengl es version.
Does anyone do the similar work? Why the speed on S6 is so slow? Does mobile GPU cannot support data calculation?
the code is uploaded github esgemm