
5 Sep
2011
5 Sep
'11
11:05 p.m.
Phil Endecott wrote:
I've tested this on two systems: (A): Marvell ARMv5TE ("Feroceon") @ 1.2 GHz, g++ 4.4 (B): Freescale ARMv7 (i.MX53, Coretex A8) @ 1.0 GHz, g++ 4.6.1
Compiled with: --std=c++0x -O4
What do the numbers look like if you compile with -march=native switch set? or if you're cross-compiling replace 'native' with correct gcc supported arch, To futher that, is there much of a difference when compiled with pgo? -fprofile-generate run -fprofile-use. btw I've found that in some situation O2 performs better than O3 or O4 - though pgo cleans up a lot of those inconsistencies at O3+ levels.