
On 02/16/12 12:17, Joel Falcou wrote:
On 02/16/2012 06:40 PM, Larry Evans wrote:
Although it is probably slower if the array us unrotated; if rotation is needed, as in array_dyn.diff_pde.cpp, then it may be faster since, using something like nt2 or multi_array would, AFAICT, require copying the array to a new storage order. [snip] storage order adaptation never trigger copy in NT2, it changes how the for loop nest is generated around the computation unless you do :
a = as_< storage_order_stuff>(b)
, where the copy is ofc explicit.
NT2 core primitive is a loop nest generator on which we have some sort of control from the user land. [snip]
Hi Joel, I've just downloaded nt2 from here: http://voxel.dl.sourceforge.net/project/nt2/nt2/nt2%202.0/nt2_2.0.tar.gz Are there some examples of this "loop nest generator" that I could look at to get a better idea of what you mean? TIA. -regards, Larry