[review] Heaps: Mergeable Heaps

Andrew Sutton

5 Jun 2011 5 Jun '11

1:19 p.m.

Tim, I think you should define a MergeableHeap that describes heaps that can be merged in O(log n) (i.e., binomial, fibonacci, pairing, skew). Actually, the only heaps that don't satisfy this requirement are binary heaps, b-heaps and d-ary heaps (they all seem to be linear). This is similar to the the AdjacencyMatrix in the BGL. This poses a serious problem, however. The mutable priority queue wrapper implements merge() as, at best, a linear operation, meaning that no mutable priority queue could model the MergeableHeap concept. Andrew

Show replies by date

Tim Blechmann

5 Jun 5 Jun

1:31 p.m.

...

I think you should define a MergeableHeap that describes heaps that can be merged in O(log n) (i.e., binomial, fibonacci, pairing, skew). Actually, the only heaps that don't satisfy this requirement are binary heaps, b-heaps and d-ary heaps (they all seem to be linear). This is similar to the the AdjacencyMatrix in the BGL.

all heaps are mergable, but with different performance constraints. so maybe a LogNOrBetterMergableHeap concept would be more expressive?

...

This poses a serious problem, however. The mutable priority queue wrapper implements merge() as, at best, a linear operation, meaning that no mutable priority queue could model the MergeableHeap concept.

the mutable priority queue wrapper is only used to make container adaptors mutable ... these are b-heaps and d-ary heaps. fibonacci-heaps for example are can be merged (using merge_and_clear) in O(1). tim -- tim@klingt.org http://tim.klingt.org I don't write music for sissy ears. Charles Ives

Andrew Sutton

1:56 p.m.

...

all heaps are mergable, but with different performance constraints. so maybe a LogNOrBetterMergableHeap concept would be more expressive?

That is a terrible name for a concept :) All heaps can be merged in the same way that the distance between iterators can be computed. Is the merge implementation for container adaptors the same? It might be worthwhile implementing it is an algorithm rather than a member function for inefficiently merged heaps. void merge(Heap& l, Heap& r) { for(auto& x : r) l.push(x); } Or whatever... moves would probably be more appropriate if r is being torn down. The operation could then be specialized for MergeableHeap types to call the member function. Exactly like swap and distance. This has the benefit of making all heaps operable under a merge operation, but only efficiently mergeable heaps have a member function. A better analogy might actually be a range-based find() algorithm that calls find_if for Sequences and r.find(x) for Sets (set, unordered_set,etc.). You can always find an object in a sequence, but some data structures lend themselves towards efficient operation.

...

...
This poses a serious problem, however. the mutable priority queue wrapper is only used to make container adaptors mutable ... these are b-heaps and d-ary heaps. fibonacci-heaps for example are can be merged (using merge_and_clear) in O(1).

I see. That wasn't abundantly clear from scanning the implementation. Thanks for clearing that up. Andrew

Tim Blechmann

2:33 p.m.

...

Is the merge implementation for container adaptors the same? It might be worthwhile implementing it is an algorithm rather than a member function for inefficiently merged heaps.

void merge(Heap& l, Heap& r) { for(auto& x : r) l.push(x); }

well, it should be possible to implement it in O(N) instead of O(N log(N)) by using the heap iterators and creating a new heap ...

...

The operation could then be specialized for MergeableHeap types to call the member function. Exactly like swap and distance. This has the benefit of making all heaps operable under a merge operation, but only efficiently mergeable heaps have a member function.

i like about this approach that it would be possible to merge heaps of different types ... in fact we could have `heap operations' like merging, equivalence test and comparison of as heap functions ... but i am not sure, how the interface for merge should look like ... maybe something like: template <typename ResultHeap, typename Heap1, typename Heap2> ResultHeap merge(Heap1 const & h1, Heap2 const & h2); // consume h2 template <typename ResultHeap, typename Heap1, typename Heap2> ResultHeap merge_and_consume(Heap1 const & h1, Heap2 & h2); // consume h2 via move template <typename ResultHeap, typename Heap1, typename Heap2> ResultHeap merge_and_consume(Heap1 const & h1, Heap2 && h2); the current Heap::merge() and Heap::merge_and_clear() are somewhat odd ... thoughts? tim -- tim@klingt.org http://tim.klingt.org You can play a shoestring if you're sincere John Coltrane

Andrew Sutton

11:07 p.m.

...

...
void merge(Heap& l, Heap& r) { for(auto& x : r) l.push(x); }

well, it should be possible to implement it in O(N) instead of O(N log(N)) by using the heap iterators and creating a new heap ...

I know it's possible for binary and I suspect b-heaps (and probably d-ary as well). Is that true for all other container-based heaps?

...

i like about this approach that it would be possible to merge heaps of different types ... in fact we could have `heap operations' like merging, equivalence test and comparison of as heap functions ...

Yes, you could :)

...

ResultHeap merge(Heap1 const & h1, Heap2 const & h2);

I've always been of the opinion that merge was more like += than +, and that it's usually destructive on the RHS. The term "heap union" is often used as a synonym for merge. What if merge() was a modifying operation and heap_union implements the version above. For Mergeable heaps, the heap_union is two copies and two merges.

...

// consume h2 via move template <typename ResultHeap, typename Heap1, typename Heap2> ResultHeap merge_and_consume(Heap1 const & h1, Heap2 && h2);

I think that merge() (if it modifies both arguments) should move by default. You're really transferring data from the RHS into the LHS. In fact, I think this might be one of those perfect examples of what move is supposed to do :)

...

the current Heap::merge() and Heap::merge_and_clear() are somewhat odd ...

thoughts?

So... taking a quick look, we shouldn't call the operation merge(), we should call it heap_merge() to avoid ADL issues with std::merge. We don't want to break merge_sort :) I see two algorithms: void heap_merge(Heap1& lhs, Heap2& rhs); void heap_union(const Heap1& lhs, const Heap2& rhs, Heap3& result); The first transfers all objects owned by rhs into lhs. I think that's important. The second is more like std::merge (but without iterators). Obviously, there should be specializations for Mergeable heaps. Andrew

Tim Blechmann

6 Jun 6 Jun

8:05 a.m.

...

...
...
void merge(Heap& l, Heap& r) { for(auto& x : r) l.push(x); }

well, it should be possible to implement it in O(N) instead of O(N log(N)) by using the heap iterators and creating a new heap ...

I know it's possible for binary and I suspect b-heaps (and probably d-ary as well). Is that true for all other container-based heaps?

there are just two types of container-based heaps: d-ary heaps and b-heaps. binary heaps are d-ary heaps with an arity of 2.

...

...
ResultHeap merge(Heap1 const & h1, Heap2 const & h2);

I've always been of the opinion that merge was more like += than +, and that it's usually destructive on the RHS.

The term "heap union" is often used as a synonym for merge. What if merge() was a modifying operation and heap_union implements the version above. For Mergeable heaps, the heap_union is two copies and two merges.

i think the terms `merge' and `union' are more descriptive ...

...

...
// consume h2 via move template <typename ResultHeap, typename Heap1, typename Heap2> ResultHeap merge_and_consume(Heap1 const & h1, Heap2 && h2);

I think that merge() (if it modifies both arguments) should move by default. You're really transferring data from the RHS into the LHS. In fact, I think this might be one of those perfect examples of what move is supposed to do :)

true ... this would mean that we depend on boost::move ... but it seems that it has been accepted recently and is already merged into trunk ...

...

So... taking a quick look, we shouldn't call the operation merge(), we should call it heap_merge() to avoid ADL issues with std::merge. We don't want to break merge_sort :) I see two algorithms:

void heap_merge(Heap1& lhs, Heap2& rhs); void heap_union(const Heap1& lhs, const Heap2& rhs, Heap3& result);

The first transfers all objects owned by rhs into lhs. I think that's important. The second is more like std::merge (but without iterators). Obviously, there should be specializations for Mergeable heaps.

why pass the result by reference instead of returning it? to avoid passing a template argument? again, is there any guideline about this for boost libraries? so with move we could write: Heap1 heap_merge(Heap1 && lhs, Heap2 && rhs); ResultHeap heap_union(const Heap1& lhs, const Heap2& rhs); or void heap_union(const Heap1& lhs, const Heap2& rhs, Heap3& result); tim

BernardH

8:21 a.m.

Andrew Sutton <asutton.list <at> gmail.com> writes:

...

...
the current Heap::merge() and Heap::merge_and_clear() are somewhat odd ...

thoughts?

So... taking a quick look, we shouldn't call the operation merge(), we should call it heap_merge() to avoid ADL issues with std::merge. We don't want to break merge_sort :) I see two algorithms:

void heap_merge(Heap1& lhs, Heap2& rhs); void heap_union(const Heap1& lhs, const Heap2& rhs, Heap3& result);

The first transfers all objects owned by rhs into lhs. I think that's important. The second is more like std::merge (but without iterators). Obviously, there should be specializations for Mergeable heaps.

FWIW, I also found Heap::merge() and Heap::merge_and_clear() «somewhat odd» and was left wondering why not overload ::insert() (which copies elements from a source) and reuse splice() name from std::list<> which moves nodes from one container to the other. Best Regards, Bernard

Andrew Sutton

1:31 p.m.

...

FWIW, I also found Heap::merge() and Heap::merge_and_clear() «somewhat odd» and was left wondering why not overload ::insert() (which copies elements from a source) and reuse splice() name from std::list<> which moves nodes from one container to the other.

That's because the operations are commonly called merge and union in the literature. It's best to follow precedent. Andrew

Tim Blechmann

10:34 a.m.

...

...
Is the merge implementation for container adaptors the same? It might be worthwhile implementing it is an algorithm rather than a member function for inefficiently merged heaps.

void merge(Heap& l, Heap& r) {

for(auto& x : r) l.push(x);

}

well, it should be possible to implement it in O(N) instead of O(N log(N)) by using the heap iterators and creating a new heap ...

äh ... actually this is not as straight-forward as i thought, since boost.heap iterators cannot be used to iterate the heap in the order of the nodes. so in fact we need to copy the heap and sequentially extract the top node ... tim

Thorsten Ottosen

9:16 p.m.

Den 05-06-2011 15:19, Andrew Sutton skrev:

...

Tim,

I think you should define a MergeableHeap that describes heaps that can be merged in O(log n) (i.e., binomial, fibonacci, pairing, skew). Actually, the only heaps that don't satisfy this requirement are binary heaps, b-heaps and d-ary heaps (they all seem to be linear). This is similar to the the AdjacencyMatrix in the BGL.

Please specify what n is. Is it the combined size of the two heaps or??? -Thorsten

Andrew Sutton

9:48 p.m.

...

...
I think you should define a MergeableHeap that describes heaps that can be merged in O(log n) (i.e., binomial, fibonacci, pairing, skew). Actually, the only heaps that don't satisfy this requirement are binary heaps, b-heaps and d-ary heaps (they all seem to be linear). This is similar to the the AdjacencyMatrix in the BGL.

For binomial heaps, O(log max(x.size(), y.size())). O(1) for the others. Andrew

Thorsten Ottosen

11:40 p.m.

Den 06-06-2011 23:48, Andrew Sutton skrev:

...

...
...
I think you should define a MergeableHeap that describes heaps that can be merged in O(log n) (i.e., binomial, fibonacci, pairing, skew). Actually, the only heaps that don't satisfy this requirement are binary heaps, b-heaps and d-ary heaps (they all seem to be linear). This is similar to the the AdjacencyMatrix in the BGL.

For binomial heaps, O(log max(x.size(), y.size())). O(1) for the others.

Right. But I want the docs to clearly state that. -Thorsten

Andrew Sutton

7 Jun 7 Jun

1:14 a.m.

...

Right. But I want the docs to clearly state that.

Indeed they should :) Andrew

Tim Blechmann

8:31 a.m.

...

...
...
...
I think you should define a MergeableHeap that describes heaps that can be merged in O(log n) (i.e., binomial, fibonacci, pairing, skew). Actually, the only heaps that don't satisfy this requirement are binary heaps, b-heaps and d-ary heaps (they all seem to be linear). This is similar to the the AdjacencyMatrix in the BGL.

For binomial heaps, O(log max(x.size(), y.size())). O(1) for the others.

Right. But I want the docs to clearly state that.

right ... on my todo list ... tim

5163

Age (days ago)

5165

Last active (days ago)

List overview

Download

13 comments

4 participants

participants (4)

Andrew Sutton
BernardH
Thorsten Ottosen
Tim Blechmann