
22 Jan
2009
22 Jan
'09
9:12 a.m.
Well, the whole discussion is about "optimal." If one doesn't care about "optimal" then a compiler will do just fine all the time and there's no need for a DSEL, asm or ugly gcc intrinsics.
What if we replace optimal with optimized?
Surely library code than gives a 4x speedup is desirable to have even if you can hand generate code that gives you a 5x. Getting a 4x speedup over naive simd-less in simple vector operations and still being able to concentrate on the problem at hand instead of low level optimization details sounds fantastic to me.
Indeed, and like I keep saying: show us the code that produces the speedup and we can all stop arguing and start rejoycing :-) John.