On 15 Sep 2014, at 18:00, boost-users-request@lists.boost.org wrote:
On Monday, September 15, 2014 07:31 PM, Tim Odenthal wrote:
Dear experts,
essentially, I use this type of reinterpret_cast for doubles, and it __seems__ to work fine. I compile-time check that the sizeof(boost::atomic<T>) == sizeof( T ), but is that enough to ensure that this will always work? Should/could I check alignment issues as well? Even if the "atomic_cast" function has a (small) run-time overhead, it would be very much worth it in my setting, where I actually want to safely add to a double at some location in memory. Obviously, I want to use boost::atomic to be system/compiler agnostic, since it's supposed to eventually run with clang, gcc, (linux, OS X) as well as MS compilers, and so far we didn't see the need to move to C++11...
I'm not sure I can figure out what problem you're trying to solve.
Well, essentially, I have an array of doubles that at exactly one point in the code needs to be added to where I cannot ensure that each thread only adds to its own part of the array, so I want to ensure that each addition there is executed atomically.
I think the cast is probably "safe", but far less can be said of it when
it's dereferenced. That would depend entirely on the type of var.
Is *var actually a boost::atomic<T> in the first place? If it is, why
are you erasing its type?
No, as I was saying, it’s a double *.
How can you be sure there is nobody else aliasing var or *var and then
reading or writing it?
That is actually a problem, but since I completely control that part of the code, I hope to not make that mistake ;-).
If *var is not a boost::atomic<T>, reinterpret_cast isn't going to
magically convert it, you have undefined behaviour. Of course, one
effect of UB is that it __seems__ to work, until, presumably, it doesn’t.
But what, then, is the difference between an atomic<double>* and a normal double*, if the sizes (and possibly alignments) are the same?
Hmmm, I just read up on what a reinterpret_cast<> is supposed to be good for, which is very little indeed, so it seems that the behaviour is undefined - but then there is no way to solve that problem? (So far I only ever used it to print the address of a pointer...)Since openMP allows for a pragma atomic, somehow, it seems that there should be a way. Using a simple spin_lock is way too slow (tested), since it doesn’t lock a specific memory location, but essentially everything, and I suspect that more complicated locking schemes will be also more expensive, just for different reasons.