Upon using Intel's Vtune tool, I notice that the
movsxd rax, edx
instruction is taking quite some time to execute. I understand that we access both 32 bit and 64 bit registers in this assembly code but is it expected to take a long time to execute? Thanks!
Machine: Intel's Cascade Lake (Core i9-10980XE)
--- Edit ----
Added a screenshot of the assembly to provide more context.
Block 1 6,735,000,000
push rbx 210,000,000
push rsi 30,000,000
push r14 30,000,000
sub rsp, 0x50 120,000,000
mov r14, rcx
nop
lock inc dword ptr [rsp+0x78]
movsxd rax, edx 5,985,000,000
nop
mov rbx, qword ptr [rcx+rax*8+0x8] 0
lea rsi, ptr [rcx+rax*8] 360,000,000
test rbx, rbx
jnz 0x14900fdff <Block 3>