Use this tag to ask questions about Intel Advisor, which is modeling analysis tool for efficient vectorization, memory, threading and offloading to GPU. It provides Roofline performance model automation for CPU and GPU. This product is available on Windows and Linux operating systems in form of Standalone GUI tool, Microsoft Visual Studio plug-in or command line interface and Mac OS viewer.
Questions tagged [intel-advisor]
17 questions
5
votes
1 answer
Inefficient memory access pattern and irregular stride accesses
I'm trying to optimize this function:
bool interpolate(const Mat &im, float ofsx, float ofsy, float a11, float a12, float a21, float a22, Mat &res)
{
bool ret = false;
// input size (-1 for the safe bilinear interpolation)
const…

cplusplusuberalles
- 199
- 1
- 12
2
votes
2 answers
How to use Intel Advisor to profile my parallel MPI application?
I am working on a remote Linux server where I have my application running in parallel with MPI. I want to profile it and test how good is the load balance in each MPI process and which are the heaviest parts of the code.
To run my application in…

Andreas Hadjigeorgiou
- 309
- 3
- 10
2
votes
2 answers
What is this Read after Write dependency?
I have this loop this function:
Mat HessianDetector::hessianResponse(const Mat &inputImage, float norm)
{
//...
const float *in = inputImage.ptr(1);
Mat outputImage(rows, cols, CV_32FC1);
float *out =…

justHelloWorld
- 6,478
- 8
- 58
- 138
2
votes
1 answer
Roofline model - how to calculate flop/byte ratio?
I would like to create roofline model and i have problem with algorithm flop per byte ratio. Can You explain me how to calculate it? The algorithm do computation using 5-point stencil.
Here's algorithm
for(int i=1; i

JudgeDeath
- 151
- 1
- 2
- 9
1
vote
0 answers
Intel Advisor's bandwidth information
While using Intel Advisor's roofline analysis view, we are presented data-bandwidth information for the different data-paths of the system i.e. DRAM, L3-, L2- and L1 caches. The program claims that it measures the bandwidths on the provided hardware…

Nitin Malapally
- 534
- 2
- 10
1
vote
1 answer
Intel Advisor: Inspect method including all submethods
Using Intel Advisor and the roof line model, I would like to assess the performance of a certain function. This function uses the Eigen library for matrix operations, where the main part of work is done.
In the output I can see my function with a…

carlosvalderrama
- 465
- 1
- 6
- 22
1
vote
1 answer
Report FLOPs with Intel Advisor XE
I am usign the Intel Advisor 2018 (build 523188) on Linux CentOS 7.4 to profile a collection of benchmarks (I want to plot them all in a single Roofline plot) and I am using the command line tool advixe-cl to collect the survey, tripcounts and flops…

K. Iliakis
- 13
- 5
1
vote
1 answer
Ineffective "Peel/Remainder" Loop in my code
I have this function:
bool interpolate(const Mat &im, float ofsx, float ofsy, float a11, float a12, float a21, float a22, Mat &res)
{
bool ret = false;
// input size (-1 for the safe bilinear interpolation)
const int width =…

justHelloWorld
- 6,478
- 8
- 58
- 138
1
vote
1 answer
Where should I put ANNOTATE_ITERATION_TASK?
I'm using Intel Advisor to analyze my parallel application. I have this code, which is the main loop of my program and where is spent most of the time:
for(size_t i=0; i

justHelloWorld
- 6,478
- 8
- 58
- 138
1
vote
1 answer
What is the correct way to use Intel Advisor on a remote machine?
Intel VTune Amplifier has the possibility to profile a parallel application executed on a remote machine.
Intel Advisor doesn't have such an option. According to this document, you have to use the command-line version of Intel Advisor:
This makes…

justHelloWorld
- 6,478
- 8
- 58
- 138
1
vote
1 answer
Intel Advisor optimal flags and settings
I'm reading this tutorial about code vectorization using Intel Advisor. In particular in this page they suggest to:
Build the target sample application in release mode ... compiler options: -O2 -g
And following:
To build your own applications to…

justHelloWorld
- 6,478
- 8
- 58
- 138
0
votes
1 answer
Intel Inspector: "Source file not found. Suggestion: Specify locations in a Project Properties dialog box search tab."
I'm building a small application to my phd research. As the application is presenting some undefined behavior, I decided to use the Intel Inspector standalone GUI to locate memory problems and it manage to find a considerably amount of problems as…

Lgcos
- 91
- 7
0
votes
1 answer
Intel Advisor beta offloading analysis: No execution count
I am trying to use Intel oneAPI advisor beta to do a GPU offloading analysis (via analyze.py and collect.py). I have the problem that all non offloaded regions show Cannot be modelled: No Execution Count.
Furthermore I get the warning
advixe:…

lm1909
- 35
- 7
0
votes
1 answer
Intel Advisor - view results from Linux cloud on local Windows GUI
With Intel Advisor, I ran the following on Linux in the cloud and downloaded the result folders to my local WIndows machine:
advixe-cl --collect=survey ./My_Program.exe
advixe-cl --collect=map ./My_Program .exe
advixe-cl --collect=dependencies…

RTC222
- 2,025
- 1
- 20
- 53
0
votes
0 answers
Theoretical Scalar Integer Performance KabyLake
I was doing some experiments with Intel Advisor 2020 and in particular with the roofline model. Something I can't quite understand is why the peak scalar integer performance (intop/cycle) is different than the theoretical one that I would expect…

Tommy95
- 183
- 2
- 12