I'm wondering if it is possible to do the following calculation with four values parallel within a MMX-Register:
(a*b)/256
where a is a signed word and b is an unsigned value (blend factor) in the range of 0-256
I think my problem is that I'm not sure about how (or if) pmullw and pmulhw will help me with this task.