I need to reduce amount of branches in code. There is exist benchmark called median it have some code like:
if ( A < B )
return A = foo[i];
else
return B = foo[i];
I wrote a pattern in machine description file *.md to avoid branches:
(define_insn "smin<GPR:mode>3"
[
(set
(match_operand:GPR 0 "register_operand" "=r")
(if_then_else:GPR
(lt:GPR
(match_operand:GPR 1 "register_operand" " r")
(match_operand:GPR 2 "register_operand" " r"))
(match_dup 1)
(match_dup 2)))
]
""
"min\t%0,%1,%2"
[(set_attr "type" "move")
(set_attr "mode" "<MODE>")])
It works in case of simple comparison:
if ( A < B )
return A ;
else
return B;
GCC emit:
min a0,a0,a1 # 9 smindi3 [length = 4]
ret # 21 simple_return [length = 4]
But if i try same, but with indexed variable( array ): it won`t works:
if ( A < B )
return A = foo[i];
else
return B = foo[i];
GCC emit:
blt a0,a1,.L5 # 11 *branch_orderdi [length = 4]
sd a1,8(a2) # 18 *movdi_64bit/4 [length = 4]
mv a0,a1 # 8 *movdi_64bit/1 [length = 4]
ret # 34 simple_return [length = 4]
.L5:
sd a0,8(a2) # 13 *movdi_64bit/4 [length = 4]
ret # 28 simple_return [length = 4]
I need to GCC emit something like this:
min a0,a0,a1 # 9 smindi3 [length = 4]
sd a0,8(a2) # 18 *movdi_64bit/4 [length = 4]
ret # 34 simple_return [length = 4]
I appreciate any help.