Efficient "[][]" operator overloading (encapsulating a MultiArray)
data:image/s3,"s3://crabby-images/91406/91406bdb1e05915fe5bc3ce778a27e0230a5264e" alt=""
Hello,
I need to encapsulate a 3D MultiArray inside another class "GridArray".
The reason is that GridArray will have another implementation not using
MultiArray and it seems much easier to provide an abstration over
MultiArray, than to reimplement its interface for the other implementation.
I need to keep the [][][] operator access to the structure. I created a
class like this, but it is slow:
template<class T> class GridArray
data:image/s3,"s3://crabby-images/4782d/4782d3994261d04366069f7f5b7a7d737d904c87" alt=""
On 09-04-2013 16:36, Jiří Vyskočil wrote:
Do you have any idea what might be going on? Is there a better way to write an interface for successive [] operators?
Take a look at STLSoft's fixed_array_3d<> -Thorsten
data:image/s3,"s3://crabby-images/3cdde/3cdde99a33dd10faf821fade4b762c93ab4a4310" alt=""
On 09/04/13 16:36, Jiří Vyskočil wrote:
Hello,
I need to encapsulate a 3D MultiArray inside another class "GridArray". The reason is that GridArray will have another implementation not using MultiArray and it seems much easier to provide an abstration over MultiArray, than to reimplement its interface for the other implementation.
I need to keep the [][][] operator access to the structure. I created a class like this, but it is slow:
template<class T> class GridArray
{ public: GridArray(int x, int y, int z): _array(boost::extents[x][y][z]) { } boost::multi_array _array; struct SliceRef { struct ColRef { SliceRef &_parent; int _x, _y; ColRef(SliceRef &p, const int x, const int y): _parent(p), _x(x), _y(y) {} T& operator[](const int z) { return _parent._parent._array[_x][_y][z]; } }; GridArray &_parent; int _x; SliceRef(GridArray &p, const int x): _parent(p), _x(x) {} ColRef operator[](const int y) { return ColRef(*this,_x,y); } }; SliceRef operator[](const int x){ return SliceRef(*this,x); } }; I am unable to write a minimal test case - if I compare using multi_array directly vs. using this GridArray class in a simple test program (compiled with -O2), the performance is the same. But the real code shows about 20% slowdown - it's a numerical simulaiton, I create several arrays at the beginning and then do computations with them. The rest of the code hasn't been touched - I only chanded the constructors for the arrays and replaced a header which contained some typedefs with a header containing this class.
Do you have any idea what might be going on? Is there a better way to write an interface for successive [] operators?
Remember that those are references, so you're actually doing
*(_parent->_parent->array)[_x][_y][z].
Some compilers might not be able to avoid the redundant loads here.
You could put SliceRef by value in ColRef for starters. You can also try
to put GridArray
data:image/s3,"s3://crabby-images/fe4e4/fe4e46508dfc52e91d748c32afb36132f625d1b7" alt=""
Jiří- On 16:36 Tue 09 Apr , Jiří Vyskočil wrote:
I am unable to write a minimal test case - if I compare using multi_array directly vs. using this GridArray class in a simple test program (compiled with -O2), the performance is the same. But the real code shows about 20% slowdown - it's a numerical simulaiton, I create several arrays at the beginning and then do computations with them. The rest of the code hasn't been touched - I only chanded the constructors for the arrays and replaced a header which contained some typedefs with a header containing this class.
Do you have any idea what might be going on? Is there a better way to write an interface for successive [] operators?
I suspect that the compiler isn't able to optimize certain accesses in the GridArray case. Could you give us some more details on the numerical kernel you're running? What kind of operation do you run on the members? Is it some kind of stencil code where you update each member depending on its neighbors? I'm asking this as a compiler will typically try to avoid the potentially costly repetitive address computations and will cache the results in registers. As compilers are easily confused (e.g. via possible aliasing), and registers are scarce, even seemingly inconspicuous code changes may lead to performance degradations. Best -Andreas -- ========================================================== Andreas Schäfer HPC and Grid Computing Chair of Computer Science 3 Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany +49 9131 85-27910 PGP/GPG key via keyserver http://www.libgeodecomp.org ========================================================== (\___/) (+'.'+) (")_(") This is Bunny. Copy and paste Bunny into your signature to help him gain world domination!
participants (4)
-
Andreas Schäfer
-
Jiří Vyskočil
-
Mathias Gaunard
-
Thorsten Ottosen