Why there is no NEON intrisic to perform Signed saturating Rounding Doubling Multiply, like there is for signed 16-bit integers (vqrdmulhq_s16
) ? More generally, there are only a few intrisics to perform multiplication of 8-bit integers. Is there any particular reason behind the lack of these commands?
Right now, the only solution I can think of, is to split the int8x16 vector, and perform each int8x8 multiplication seperately, by casting int8x8 to int16x8.