Neil Roberts f16fd0ddbb cogl: Use SSE2 when possible for premultiplying
This adds a fast path for premultiplying an RGBA image using SSE2
instructions. SSE registers are 128-bit and we need at least 16-bits
per component for the intermediate result of the multiplication so we
can do two pixels in parallel with one register. The function
interleaves 2 SSE registers to multiply 4 pixels in one function call
with the hope that this will pipeline better.

http://bugzilla.openedhand.com/show_bug.cgi?id=1939
Signed-off-by: Emmanuele Bassi <ebassi@linux.intel.com>
2010-02-09 11:03:21 +00:00
..
2009-11-19 13:56:47 +00:00
2010-02-04 21:10:02 +00:00
2009-12-02 11:37:13 +00:00
2010-01-12 17:10:15 +00:00
2010-02-01 10:40:34 +00:00
2010-01-05 18:02:29 +00:00