run-time integer log2
data:image/s3,"s3://crabby-images/a2463/a2463ae2178ce928dcea66a07f1c68a1e57044e0" alt=""
hi all, i am aware, that boost provides a compile-time integer log2 function. but what is the best way to compile an integer log2 at run-time? thanks, tim -- tim@klingt.org http://tim.klingt.org Silence is only frightening to people who are compulsively verbalizing. William S. Burroughs
data:image/s3,"s3://crabby-images/89590/89590d82cbe9918869d50d06c72e3afad4a31b25" alt=""
Tim Blechmann wrote:
hi all,
i am aware, that boost provides a compile-time integer log2 function. but what is the best way to compile an integer log2 at run-time?
In the good'ol Bits Twiddling Hacks page: unsigned int v; // 32-bit value to find the log2 of unsigned int r; // result of log2(v) will go here unsigned int shift; r = (v > 0xFFFF) << 4; v >>= r; shift = (v > 0xFF ) << 3; v >>= shift; r |= shift; shift = (v > 0xF ) << 2; v >>= shift; r |= shift; shift = (v > 0x3 ) << 1; v >>= shift; r |= shift; r |= (v >> 1); We use a similar stuff in NT². I can be obviously fitted back to 16 and 8 bits integer for more speed.
data:image/s3,"s3://crabby-images/a2463/a2463ae2178ce928dcea66a07f1c68a1e57044e0" alt=""
hi all,
i am aware, that boost provides a compile-time integer log2 function. but what is the best way to compile an integer log2 at run-time?
In the good'ol Bits Twiddling Hacks page:
i see, maybe it would be an interesting addition to boost.integer to provide a generic and efficient version, based on bsr/ffs or the like ... tim -- tim@klingt.org http://tim.klingt.org May music never just become another way of making money. Keith Tippett
data:image/s3,"s3://crabby-images/89590/89590d82cbe9918869d50d06c72e3afad4a31b25" alt=""
Tim Blechmann wrote:
i see, maybe it would be an interesting addition to boost.integer to provide a generic and efficient version, based on bsr/ffs or the like ...
There is a lot of branchless or SWAR algorithm to salvaged from there and to be indeed made generic or at least platform independant w/r to various things. Other thign we never did is benching all of the versions of a given algorithm (with/without branch) on different platform to add a proper #ifdef/else to maximize performance.
data:image/s3,"s3://crabby-images/becfa/becfa4a02a6b5ded9b14e03841b473e0ef80f048" alt=""
Joel Falcou-3 wrote:
Tim Blechmann wrote:
i see, maybe it would be an interesting addition to boost.integer to provide a generic and efficient version, based on bsr/ffs or the like ...
There is a lot of branchless or SWAR algorithm to salvaged from there and to be indeed made generic or at least platform independant w/r to various things. Other thign we never did is benching all of the versions of a given algorithm (with/without branch) on different platform to add a proper #ifdef/else to maximize performance. _______________________________________________ Boost-users mailing list Boost-users@lists.boost.org http://lists.boost.org/mailman/listinfo.cgi/boost-users
Hi, there is a GSOC project "Bits and Ints" that is working on a lot off this kind of branch less algorithms. I can suggest him to include the integer runtime variant of some math functions, as log2. Anyway, on the meantime, for the run-time you can use the cmath header working on floats and then convert to an integer if this is what you need. Best, Vicente -- View this message in context: http://old.nabble.com/run-time-integer-log2-tp29149060p29150418.html Sent from the Boost - Users mailing list archive at Nabble.com.
data:image/s3,"s3://crabby-images/a2463/a2463ae2178ce928dcea66a07f1c68a1e57044e0" alt=""
there is a GSOC project "Bits and Ints" that is working on a lot off this kind of branch less algorithms. I can suggest him to include the integer runtime variant of some math functions, as log2.
that would be nice ... it would be even nicer, if you could suggest to make use of hardware instructions ... i am currently using this gcc-centric code [1] ... tim [1] http://tim.klingt.org/git?p=boost_heap.git;a=blob;f=boost/heap/detail/ilog2.... -- tim@klingt.org http://tim.klingt.org After one look at this planet any visitor from outer space would say "I want to see the manager." William S. Burroughs
data:image/s3,"s3://crabby-images/becfa/becfa4a02a6b5ded9b14e03841b473e0ef80f048" alt=""
Bugzilla from tim@klingt.org wrote:
there is a GSOC project "Bits and Ints" that is working on a lot off this kind of branch less algorithms. I can suggest him to include the integer runtime variant of some math functions, as log2.
that would be nice ... it would be even nicer, if you could suggest to make use of hardware instructions ... i am currently using this gcc-centric code [1] ...
tim
[1] http://tim.klingt.org/git?p=boost_heap.git;a=blob;f=boost/heap/detail/ilog2....
The GSOC project will make first a portable implementation. Once the portable implementation is finished, other architecture dependent implementation could be benchmarked and provided if they give better performances. Best, Vicente -- View this message in context: http://old.nabble.com/run-time-integer-log2-tp29149060p29151703.html Sent from the Boost - Users mailing list archive at Nabble.com.
data:image/s3,"s3://crabby-images/9360f/9360f2f2295224b5c065940b794f5c016ef6151a" alt=""
What is "NT²" I've used an inline asm instruction for that. On the x86, there is a "Bit Scan Reverse" (BSR) which stores the index of the most-significant set bit.
-----Original Message----- In the good'ol Bits Twiddling Hacks page:
unsigned int v; // 32-bit value to find the log2 of unsigned int r; // result of log2(v) will go here unsigned int shift;
r = (v > 0xFFFF) << 4; v >>= r; shift = (v > 0xFF ) << 3; v >>= shift; r |= shift; shift = (v > 0xF ) << 2; v >>= shift; r |= shift; shift = (v > 0x3 ) << 1; v >>= shift; r |= shift;
TradeStation Group, Inc. is a publicly-traded holding company (NASDAQ GS: TRAD) of three operating subsidiaries, TradeStation Securities, Inc. (Member NYSE, FINRA, SIPC and NFA), TradeStation Technologies, Inc., a trading software and subscription company, and TradeStation Europe Limited, a United Kingdom, FSA-authorized introducing brokerage firm. None of these companies provides trading or investment advice, recommendations or endorsements of any kind. The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from any computer.
data:image/s3,"s3://crabby-images/614d5/614d524e16d0f404c36f1238c62a011841d7af72" alt=""
G'day all.
Quoting joel falcou
In the good'ol Bits Twiddling Hacks page:
Many modern ISAs (at least EM64T, PPC and recent ARMs; these are just the ones I know about) have a single-instruction priority encoder usually called "count leading zeroes". In GCC, it's available as a builtin: uint64_t log2v = v == 1 ? 0 : 64 - __builtin_clzll(v - 1); Note that this is branch free if your compiler is feeling speculative and has a conditional move or skip instruction available. If not, it's an easy branch to predict. Andrew Bromage
data:image/s3,"s3://crabby-images/89590/89590d82cbe9918869d50d06c72e3afad4a31b25" alt=""
ajb@spamcop.net wrote:
Many modern ISAs (at least EM64T, PPC and recent ARMs; these are just the ones I know about) have a single-instruction priority encoder usually called "count leading zeroes". In GCC, it's available as a builtin:
uint64_t log2v = v == 1 ? 0 : 64 - __builtin_clzll(v - 1);
Guess my old signal processing habits showed up ;) Of course, mapping to intrinsic is better I think. But I guess when they are not there, we need a fallback ;)
participants (6)
-
ajb@spamcop.net
-
joel falcou
-
Joel Falcou
-
John Dlugosz
-
Tim Blechmann
-
Vicente Botet Escriba