
15 Feb
2011
15 Feb
'11
12:53 p.m.
This caught my eye. A couple of years ago I looked at MMX/SIMD - tested an example off of the codeguru website which out of the box showed MMX/SIMD was faster. I unrolled the loop of the non-MMX/SIMD code using duff's device and saw no difference in performance with or without MMX/SIMD. I'm just wondering in your experience is there a real benefit of MMX/SIMD even discounting the alignment issue.
MMX is old tech. SSE2+ and Altivec has a real effect on such operations.