Questions tagged [flops]

FLOPS (FLoating point Operations Per Second): a unit of measurement used to quantify the performance of the implementation of a numerical algorithm.

Anything related to the FLOPS unit of measurement (FLoating point Operations Per Second), i.e. a unit of measurement used to quantify the performance of the implementation of a numerical algorithm.

See Wikipedia page on FLOPS.

132 questions
6
votes
4 answers

Counting Flops for a code!

This is really taking my time. I could not find a simple way to estimate FLOPS for a following code (the loop), How much FLOPS are for a single iteration of the loop: float func(float * atominfo, float energygridItem, int xindex, int yindex) { …
usman
  • 1,285
  • 1
  • 14
  • 25
6
votes
2 answers

Compiler Optimizations effect on FLOPs and L2/L3 Cache Miss Rate using PAPI

So we've been tasked with an assignment to compile some code (we're supposed to treat it as a black box), using different intel compiler optimization flags (-O1 and -O3) as well as vectorization flags (-xhost and -no-vec) and to observe changes…
kfkhalili
  • 996
  • 1
  • 11
  • 24
6
votes
1 answer

Python FLOPS calculation

I've been trying to get a standardized estimate of FLOPS across all of the computers that I've implemented a Python distributed processing program on. While I currently can calculate pystones quite fine, pystones are not particularly well known, and…
Doc Sohl
  • 165
  • 1
  • 1
  • 10
5
votes
2 answers

Measuring FLOPs of an application with the linux perf tool

I want to measure the ammount of floating point and arithmetic operations executed by some application with 'perf', the new command line interface command to the linux performance counter subsystem. (For testing purposes I use a simple dummy app…
5
votes
1 answer

Understanding FMA instructions performance

i'm tring to understand how can i max out the number of operations i can get on my CPU. I'm doing a simple matrix multiplication program, and i have a Skylake processor. I was looking at the wikipedia page for the flops information on this…
Peter L.
  • 157
  • 1
  • 1
  • 6
5
votes
1 answer

For XMM/YMM FP operation on Intel Haswell, can FMA be used in place of ADD?

This question is for packed, single-prec floating ops with XMM/YMM registers on Haswell. So according to the awesome, awesome table put together by Agner Fog, I know that MUL can be done on either port p0 and p1 (with recp thruput of 0.5), while…
codechimp
  • 1,509
  • 1
  • 14
  • 21
4
votes
2 answers

Why tensorflow's FLOPs is 2 times Caffe's macc?

I'm trying to rewrite a model from caffe to tensorflow. To make sure I did not make mistake, I count the macc and Flops and then I find this interesting thing: For example, when input a image 112x112x3, and conv2d it with 32 3x3 kernel, stride=1,…
MarStarck
  • 433
  • 7
  • 14
4
votes
1 answer

Gigaflops of a processor

I discovered my computer has NVIDIA CUDA Technology and I want measure the power of processing, in CPU and GPU. Instead of searching for a program to do this, I want have a deeper understanding of how it works. What kind of code (C/C++) I need?
rigon
  • 1,310
  • 4
  • 15
  • 37
4
votes
9 answers

FLOPS what really is a FLOP

I came from this thread: FLOPS Intel core and testing it with C (innerproduct) As I began writing simple test scripts, a few questions came into my mind. Why floating point? What is so significant about floating point that we have to consider? Why…
user185732
3
votes
1 answer

Mathematical flop count of column based back substitution function ( Julia )

I am new to Linear Algebra and learning about triangular systems implemented in Julia lang. I have a col_bs() function I will show here that I need to do a mathematical flop count of. It doesn't have to be super technical this is for learning…
3
votes
1 answer

On GPU, is it possible to get more flops by combining double and float operations?

If a GPU can do N1 single precision operations per second, and N2 double precision operations per second. Is it possible, by mixing (independent) single and double precision operations to achieve N1+N2 total operations per second, or at least…
nat chouf
  • 736
  • 5
  • 10
3
votes
0 answers

Programatic way of counting floating point operations (JAVA)

I'm looking for a programmatic way of counting the number of floating point operations (flops) in call to a function, in JAVA. There are several closely related questions, asking about what floating points are, and how to do big-O computational…
kabdulla
  • 5,199
  • 3
  • 17
  • 30
3
votes
1 answer

How to calculate GFLOPs for a funtion in c++ program?

I have a c++ code which calculates factorial of int data type, addition of float data type and execution time of each function as follows: long Sample_C:: factorial(int n) { int counter; long fact = 1; for (int counter = 1; counter <= n;…
anamika email
  • 327
  • 9
  • 21
3
votes
1 answer

Estimating the efficiency of GPU in FLOPS (CUDA SAMPLES)

It seems to me, that I don't completely understand the conception of FLOPS. In CUDA SAMPLES, there is Matrix Multiplication Example (0_Simple/matrixMul). In this example the number of FLOPs (operations with floating point) per matrix multiplication…
Mikhail Genkin
  • 123
  • 1
  • 6
3
votes
1 answer

GPU FLOPS and FPS

I am modelling a GPU (cannot disclose which) for estimating the performance of OpenCL and OpenGL applications, The model can reasonably estimate the FLOPS of the executing app/kernel/code is there a way to estimate to Frames per Second from the…
Umair
  • 68
  • 1
  • 6
1
2
3
8 9