String, vector type with stack-allocated internal array (and dynamic array fallback)?

newer
Problem of compiling boost::thread...

older
Solved it - Re: How to set include...

Boris Dušek

30 Jun 2009 30 Jun '09

6:28 a.m.

Hello, for performance reasons, I would like not to have to have a std::wstring (1 allocation/1 deallocation of its character array) in one routine and instead have a wchar_t[MAXSIZE]. But I also would like my code to be correct for sizes bigger than MAXSIZE. Is there some type (in boost or elsewhere, preferable for both basic_string and vector) that defaults to stack-allocated array of some size (preferably configurable by a template parameter), and in case of overflow (i.e. push_back or insert when it's full), would allocate a dynamically-allocated array? This is how it works on MSVC with their STL basic_string implementation, just the array size is unconfigurable (16 bytes including null-terminating byte). I would obviously choose the MAXSIZE so that 99.9% of cases are handled by the stack-allocated array. Thanks, Boris

Attachments:

attachment.html (text/html — 929 bytes)

Show replies by date

Christian Schladetsch

30 Jun 30 Jun

12:02 p.m.

Hi, I have a library proposal called Boost.Monotonic that does exactly this. Documentation is a work in progress, and what there is, is out of date. However, you are welcome to dig around https://svn.boost.org/svn/boost/sandbox/monotonic/. A good starting point is the test suite at http://tinyurl.com/mhwn5b. Very quickly, it is a storage system that starts on that stack (with a size you can specify), then grows to the heap as needed. This is combined with an allocator that can use this storage, allowing containers and strings etc to use the stack at first, then the heap as needed. You may also get some joy from boost::auto_buffer<> which is in the trunk, but this can't be used as a storage area for containers or strings. Cheers, Christian. 2009/6/30 Boris Dušek <boris.dusek@gmail.com>

...

Hello, for performance reasons, I would like not to have to have a std::wstring (1 allocation/1 deallocation of its character array) in one routine and instead have a wchar_t[MAXSIZE]. But I also would like my code to be correct for sizes bigger than MAXSIZE.

Is there some type (in boost or elsewhere, preferable for both basic_string and vector) that defaults to stack-allocated array of some size (preferably configurable by a template parameter), and in case of overflow (i.e. push_back or insert when it's full), would allocate a dynamically-allocated array? This is how it works on MSVC with their STL basic_string implementation, just the array size is unconfigurable (16 bytes including null-terminating byte). I would obviously choose the MAXSIZE so that 99.9% of cases are handled by the stack-allocated array.

Thanks, Boris

_______________________________________________ Boost-users mailing list Boost-users@lists.boost.org http://lists.boost.org/mailman/listinfo.cgi/boost-users

Boris Dušek

1 Jul 1 Jul

8:40 a.m.

Hello Christian, thanks for your reply. On Tue, Jun 30, 2009 at 2:02 PM, Christian Schladetsch < christian.schladetsch@gmail.com> wrote:

...

Hi,

I have a library proposal called Boost.Monotonic that does exactly this. Documentation is a work in progress, and what there is, is out of date. However, you are welcome to dig around https://svn.boost.org/svn/boost/sandbox/monotonic/. A good starting point is the test suite at http://tinyurl.com/mhwn5b.

Very quickly, it is a storage system that starts on that stack (with a size you can specify), then grows to the heap as needed. This is combined with an allocator that can use this storage, allowing containers and strings etc to use the stack at first, then the heap as needed.

This is great - exactly what I have been looking for. I have checked out the library from trunk and I am now in the process of trying it on my production code (first I have to finally add some sane repeatable performance testing to my project, until today I just used to manually ran a profiler and that was it). I want to ask a few questions though: If I write void some_function() { typedef std::basic_string<wchar_t, std::char_traits<wchar_t>, boost::monotonic::allocator<wchar_t> > bufferwstring; bufferwstring key; bufferwstring key2; } then are key and key2 each having its own buffer (and is the buffer on the stack, i.e. thread-safe)? Is there a way to specify the size of the buffer? like boost::monotonic::allocator<wchar_t, 32> (I looked and the other template parameters are "class"es so not this way), or is the only way the one below? Or is the probably big default size not an issue (I have not really experience in these stack-allocated buffers, other than knowing they are zero-cost). Is it possible to specify the size roughly like: void some_func() { boost::monotonic::storage<32*sizeof(wchar_t)> storage(); std::vector<wchar_t, std::char_traits<wchar_t>, boost::monotonic::allocator<wchar_t> > key(storage); } Also in this case (the example you mailed me): const size_t stack_size = 10*1024; monotonic::storage<stack_size> storage; { monotonic::string<> str(storage); monotonic::vector<Foo> vec(storage); // use str and vec; storage will use 10k of stack space, then the heap. // resources will be freed when storage goes out of scope. } So str and vec are sharing the buffer (storage)? Does that mean that we run into same performance issues as malloc due to free space management inside that buffer (at least I suppose that's where the main cost of malloc/free comes from)? Also thanks for the pointer to auto_buffer, good to know. I will try to make some measurements with my code and report back the results. Cheers, Boris

...

2009/6/30 Boris Dušek <boris.dusek@gmail.com>

...
Hello,

for performance reasons, I would like not to have to have a std::wstring (1 allocation/1 deallocation of its character array) in one routine and instead have a wchar_t[MAXSIZE]. But I also would like my code to be correct for sizes bigger than MAXSIZE.

Is there some type (in boost or elsewhere, preferable for both basic_string and vector) that defaults to stack-allocated array of some size (preferably configurable by a template parameter), and in case of overflow (i.e. push_back or insert when it's full), would allocate a dynamically-allocated array? This is how it works on MSVC with their STL basic_string implementation, just the array size is unconfigurable (16 bytes including null-terminating byte). I would obviously choose the MAXSIZE so that 99.9% of cases are handled by the stack-allocated array.

Thanks, Boris

_______________________________________________ Boost-users mailing list Boost-users@lists.boost.org http://lists.boost.org/mailman/listinfo.cgi/boost-users

_______________________________________________ Boost-users mailing list Boost-users@lists.boost.org http://lists.boost.org/mailman/listinfo.cgi/boost-users

Christian Schladetsch

12:38 p.m.

Hi Boris, I'll make this reply here, but since this is about a library proposal and not a boost library, any further email on this should go directly to me. 2009/7/1 Boris Dušek <boris.dusek@gmail.com>

...

Hello Christian,

...
Hi,

I have a library proposal called Boost.Monotonic that does exactly this. Documentation is a work in progress, and what there is, is out of date. However, you are welcome to dig around https://svn.boost.org/svn/boost/sandbox/monotonic/. A good starting point is the test suite at http://tinyurl.com/mhwn5b.

Very quickly, it is a storage system that starts on that stack (with a size you can specify), then grows to the heap as needed. This is combined with an allocator that can use this storage, allowing containers and strings etc to use the stack at first, then the heap as needed.

This is great - exactly what I have been looking for. I have checked out the library from trunk and I am now in the process of trying it on my production code (first I have to finally add some sane repeatable performance testing to my project, until today I just used to manually ran a profiler and that was it).

I want to ask a few questions though:

If I write

void some_function() { typedef std::basic_string<wchar_t, std::char_traits<wchar_t>, boost::monotonic::allocator<wchar_t> > bufferwstring; bufferwstring key; bufferwstring key2; }

then are key and key2 each having its own buffer (and is the buffer on the stack, i.e. thread-safe)? Is there a way to specify the size of the buffer? like boost::monotonic::allocator<wchar_t, 32> (I looked and the other template parameters are "class"es so not this way), or is the only way the one below? Or is the probably big default size not an issue (I have not really experience in these stack-allocated buffers, other than knowing they are zero-cost).

In the case you have above, key and key2 will both be using the same default global storage. This storage will grow monotonically (hence the name) - storage is not released when an object is destroyed (but the objects dtor is still called of course). This is why it is the fastest allocation system - deallocation is a no-op. The idea is to use it then lose it all it one go. You can supply "region tags" to the allocator to use different regions, and another tag to specify the access. For example: struct my_region_0 { }; struct my_region_1 { }; monotonic::allocator<T, my_region_0> alloc_0; // global, not threadsafe monotonic::allocator<T, my_region_1, monotonic::shared_access_tag> alloc_1; // threadsafe monotonic::allocator<T, my_region_1, monotonic::thread_local_access_tag> alloc_1; // thread-local storage These all use different storage. To free the storage, use monotonic::static_storage<my_region[, my_access]>::reset() to reset the usage count to zero, or ::release() to actually release any allocated memory. Dox are on the way (what is in the sandbox is largely out of date), but in the meantime your best bet is to read the library and/or the unit-tests and/or the benchmark application at http://tinyurl.com/l89llq. Is it possible to specify the size roughly like:

...

void some_func() { boost::monotonic::storage<32*sizeof(wchar_t)> storage(); std::vector<wchar_t, std::char_traits<wchar_t>, boost::monotonic::allocator<wchar_t> > key(storage); }

Yes. However, by initialising a std::vector with storage, you create a stateful allocator. This is not a safe thing to do with STL. If you want to do this, then you should use monotonic::vector<> instead, which respects stateful allocators. If you want to use std::containers with stack-based monotonic storage, the best you can do ATM is to use a region: std::vector<T, monotonic::allocator<T, my_region> > vec; This will use 64k of pre-allocated space (by default, see monotonic/config.hpp) in the BSS, which is just as fast as the stack from my testing. After the first 64k is used, it will transparently start using the heap. To clear resources, use monotonic::static_storage<my_region>::reset() to reset or ::release() to actually release the memory.

...

Also in this case (the example you mailed me):

const size_t stack_size = 10*1024; monotonic::storage<stack_size> storage; { monotonic::string<> str(storage); monotonic::vector<Foo> vec(storage); // use str and vec; storage will use 10k of stack space, then the heap. // resources will be freed when storage goes out of scope. }

So str and vec are sharing the buffer (storage)?

Yes

...

Does that mean that we run into same performance issues as malloc due to free space management inside that buffer (at least I suppose that's where the main cost of malloc/free comes from)?

Monotonic does not free storage when it is deallocated. It uses the stack (or BSS) first, then the heap, but it always grows until you manually reset it. It was designed for small, fast allocation for small, fast containers and similar requirements. In the case above, you can also use the storage directly: Foo &foo = storage.create<Foo>(); char *bytes = storage.allocate_bytes<3000>(); Don't forget to `destroy` anything you create this way by either calling their dtors yourself, or calling storage.destroy(foo). Also thanks for the pointer to auto_buffer, good to know.

...

Auto-buffer is safer to use ATM because it has been peer reviewed and is in boost trunk. Monotonic is a work in progress, but it is at the stage now where most of the remaining work is testing and documentation. However, it is very much a "use at your own risk" library. It hasn't been fully reviewed, it hasn't been extensively tested across many platforms, and it doesn't have extensive documentation. You should stick with auto_buffer<> if you can get away with it, or use only stateless regionalised monotonic allocators: struct my_region_tag {}; { std::container<..., monotonic::allocator<T, my_region_tag> > cont; std::string<..., monotonic::allocator<char, my_region_tag> > str; std::other_container<..., monotonic::allocator<U, my_region_tag> > cont2; /// use cont, str, cont2..... } // after this, cont etc has lost its storage so it had better be out of scope before you call: monotonic::static_storage<my_region_tag>::reset(); This usage is safe, robust, works everywhere and is extremely fast.

...

I will try to make some measurements with my code and report back the results.

I benchmarked monotonic against boost:pool, boost::fast_pool, std::allocator and Intel's TBB. The results are here: http://tinyurl.com/lj6nab. In summary, it is faster than the alternatives. I can't recommend using monotonic in production code, and it isn't really an appropriate topic for this list as it is not in trunk of boost and it may never be. It hasn't been extensively reviewed or tested across many platforms. But you are welcome to see what you can get. Performance-wise, it wins hands down. Cheers,

...

Boris

Regards, Christian

5897

Age (days ago)

5898

Last active (days ago)

List overview

Download

3 comments

2 participants

participants (2)

Boris Dušek
Christian Schladetsch