[Review] UUID library (mini-)review starts today, November 23rd

newer
[bind/lambda] Matching signature...

Hartmut Kaiser

24 Nov 2008 24 Nov '08

12:13 a.m.

Hi all, The mini-review of Andy Tompkins UUID library starts today, November 23rd 2008, and will end on November 30th. I really hope to see your vote and your participation in the discussions on the Boost mailing lists! The library can be downloaded from the vault here (it's the file uuid_v13.zip in the topmost directory): http://tinyurl.com/6xx28b The initial review of the UUID library ended with a provisional acceptance (read here: http://article.gmane.org/gmane.comp.lib.boost.user/27774/). This mini review is meant to allow for a final decision after looking at the changed parts of the library. Here is a list of things fixed and changed: - fixed the licensing issues revealed by the initial review - fixed the security vulnerability revealed by the initial review - renamed to uuid, moved all classes, functions, etc. to namespace boost::uuids - implemented sha1 hash function (thus no license problem) - new class basic_uuid_generator to create random-based uuids. The random number generator is no longer hard coded. It can use any random number generator, default is boost::mt19937 - implemented a good seed function for random number generators - all functions are now reentrant, all classes are as thread safe as an int and the library is no longer dependent on Boost.Thread --------------------------------------------------- Please always state in your review, whether you think the library should be accepted as a Boost library! Additionally please consider giving feedback on the following general topics: - What is your evaluation of the design? - What is your evaluation of the implementation? - What is your evaluation of the documentation? - What is your evaluation of the potential usefulness of the library? - Did you try to use the library? With what compiler? Did you have any problems? - How much effort did you put into your evaluation? A glance? A quick reading? In-depth study? - Are you knowledgeable about the problem domain? Regards Hartmut Review Manager

Show replies by date

Dave Jenkins

24 Nov 24 Nov

1:58 p.m.

...

The library can be downloaded from the vault here (it's the file uuid_v13.zip in the topmost directory): http://tinyurl.com/6xx28b

This link didn't work for me. Is there a problem with it? Dave Jenkins

Roman Perepelitsa

2:06 p.m.

2008/11/24 Dave Jenkins <david@jenkins.net>

...

The library can be downloaded from the vault here (it's the file

...
uuid_v13.zip in the topmost directory): http://tinyurl.com/6xx28b

This link didn't work for me. Is there a problem with it?

Works for me. You can also try this: http://www.boostpro.com/vault/index.php?action=downloadfile&filename=uuid_v13.zip Roman Perepelitsa.

Christian Henning

5:45 p.m.

Hi there,

...

Please always state in your review, whether you think the library should be accepted as a Boost library!

I vote for acceptance.

...

Additionally please consider giving feedback on the following general topics:

- What is your evaluation of the design?

The design I like and fits into my world of c++ programming.

...

- What is your evaluation of the implementation?

Looks good. I like the programming style. I would recommend to clean up the files, in particular the commented out headers. Also, the file uuid_serialize.hpp contains only commented out code besides the macro instantiation. Is that functionality working? How about having a uuid_all.hpp header?

...

- What is your evaluation of the documentation?

Good.

...

- What is your evaluation of the potential usefulness of the library?

Huge. We are using UUID's a lot and this library seems to be a good candidate for the next code refactoring.

...

- Did you try to use the library? With what compiler? Did you have any problems?

Yes, I compiled the test program which Paul Bristow posted and it worked without a glitch. I'm using Visual Studio 8.

...

- How much effort did you put into your evaluation? A glance? A quick reading? In-depth study?

A quick reading.

...

- Are you knowledgeable about the problem domain?

I'm a user. Thanks for another very useful lib. Christian

Andy Tompkins

25 Nov 25 Nov

1:23 a.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Mon, 24 Nov 2008 12:45:41 -0500, "Christian Henning" <chhenning@gmail.com> said: < snip >

...

...
- What is your evaluation of the implementation?

Looks good. I like the programming style. I would recommend to clean up the files, in particular the commented out headers. Also, the file uuid_serialize.hpp contains only commented out code besides the macro instantiation. Is that functionality working?

How about having a uuid_all.hpp header?

If users want this, I have no problem creating a uuid_all.hpp header. I'm also considering creating a uuid_fwd.hpp header. < snip >

...

Christian

Thanks, Andy Tompkins

Tim Blechmann

24 Nov 24 Nov

5:56 p.m.

...

The mini-review of Andy Tompkins UUID library starts today, November 23rd 2008, and will end on November 30th.

i just did a quick compile/testsuite test for now, and i see three trivial issues: 1. it doesn't compile with gcc-4.4, because of a missing include. patch: --- a/boost/uuid/seed_rng.hpp +++ b/boost/uuid/seed_rng.hpp @@ -26,6 +26,7 @@ #include <memory.h>^M #include <ctime>^M #include <cstdlib>^M +#include <cstdio>^M #include <boost/uuid/sha1.hpp>^M //#include <boost/nondet_random.hpp> //forward declare boost::random_device^M ^M 2. the testsuite fails on x86_64 because it is using hardcoded 32-bit hash values: Running 1 test case... libs/uuid/test/test_uuid.cpp(146): error in "test_main_caller( argc, argv )": check uuid_hasher(uuid()) == 3565488559U failed [17562157925649023279 != 3565488559] libs/uuid/test/test_uuid.cpp(147): error in "test_main_caller( argc, argv )": check uuid_hasher(u1) == 4159045843U failed [6622376135548557651 != 4159045843] libs/uuid/test/test_uuid.cpp(148): error in "test_main_caller( argc, argv )": check uuid_hasher(u2) == 2713274306U failed [4700797755868797762 != 2713274306] *** 3 failures detected in test suite "Test Program" 3. the sources use cr+lf eol style, i guess correctly importing the sources into subversion will fix this maybe i find some time to do a full review later this week. anyway, i have been using the library for quite some time and would like to see it being part of boost. thus i vote for acceptance. cheers, tim -- tim@klingt.org http://tim.klingt.org The composer makes plans, music laughs. Morton Feldman

Andy Tompkins

25 Nov 25 Nov

1:34 a.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Mon, 24 Nov 2008 17:56:20 +0000 (UTC), "Tim Blechmann" <tim@klingt.org> said:

...

...
The mini-review of Andy Tompkins UUID library starts today, November 23rd 2008, and will end on November 30th.

i just did a quick compile/testsuite test for now, and i see three trivial issues:

1. it doesn't compile with gcc-4.4, because of a missing include. patch: --- a/boost/uuid/seed_rng.hpp +++ b/boost/uuid/seed_rng.hpp @@ -26,6 +26,7 @@ #include <memory.h>^M #include <ctime>^M #include <cstdlib>^M +#include <cstdio>^M #include <boost/uuid/sha1.hpp>^M //#include <boost/nondet_random.hpp> //forward declare boost::random_device^M ^M

I'll fix this.

...

2. the testsuite fails on x86_64 because it is using hardcoded 32-bit hash values: Running 1 test case... libs/uuid/test/test_uuid.cpp(146): error in "test_main_caller( argc, argv )": check uuid_hasher(uuid()) == 3565488559U failed [17562157925649023279 != 3565488559] libs/uuid/test/test_uuid.cpp(147): error in "test_main_caller( argc, argv )": check uuid_hasher(u1) == 4159045843U failed [6622376135548557651 != 4159045843] libs/uuid/test/test_uuid.cpp(148): error in "test_main_caller( argc, argv )": check uuid_hasher(u2) == 2713274306U failed [4700797755868797762 != 2713274306]

*** 3 failures detected in test suite "Test Program"

Hmm, I assume that the value that uuid_hasher produces is correct since I mimicked the way Boost.Hash does it. So, I believe I just need to adjust the code in test_uuid.cpp in a 64 bit environment where sizeof(std::size_t) != 32?

...

3. the sources use cr+lf eol style, i guess correctly importing the sources into subversion will fix this

I believe so this to be true.

...

maybe i find some time to do a full review later this week. anyway, i have been using the library for quite some time and would like to see it being part of boost. thus i vote for acceptance.

cheers, tim

-- tim@klingt.org http://tim.klingt.org

The composer makes plans, music laughs. Morton Feldman

Thanks, Andy Tompkins

Tim Blechmann

9:37 a.m.

...

...
libs/uuid/test/test_uuid.cpp(148): error in "test_main_caller( argc, argv )": check uuid_hasher(u2) == 2713274306U failed [4700797755868797762 != 2713274306]

*** 3 failures detected in test suite "Test Program"

Hmm, I assume that the value that uuid_hasher produces is correct since I mimicked the way Boost.Hash does it. So, I believe I just need to adjust the code in test_uuid.cpp in a 64 bit environment where sizeof(std::size_t) != 32?

right ... i suppose the hardcoded value in test_uuid.cpp is only correct on 32 bit platforms ... cheers, tim -- tim@klingt.org http://tim.klingt.org The price an artist pays for doing what he wants is that he has to do it. William S. Burroughs

Dave Jenkins

10:27 p.m.

...

- What is your evaluation of the design?

I like the design, but wish it had time-based UUIDs.

...

- What is your evaluation of the implementation?

I had no problem reading the code. But there's almost nothing in "uuid.ipp". Could you merge it into "uuid.hpp" for simplicity?

...

- What is your evaluation of the documentation?

I'd like to see the Rationale section come first, before you give the whole Class Synopsis. That would make it easier for casual readers to understand the purpose of the library before reading further. A small example program near the beginning of the document would help too. There is a spelling error in the Rationale section, s/indended/intended/.

...

- What is your evaluation of the potential usefulness of the library?

It's a nice addition to Boost.

...

- Did you try to use the library? With what compiler? Did you have any problems?

Yes, I tried running some test code and had no problems. I vote Yes, to include the UUID library in Boost. Regards, Dave Jenkins

Andy Tompkins

27 Nov 27 Nov

5:06 a.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Tue, 25 Nov 2008 17:27:42 -0500, "Dave Jenkins" <david@jenkins.net> said:

...

...
- What is your evaluation of the design?

I like the design, but wish it had time-based UUIDs.

Understood. I will look into this again.

...

...
- What is your evaluation of the implementation?

I had no problem reading the code. But there's almost nothing in "uuid.ipp". Could you merge it into "uuid.hpp" for simplicity?

I could. The idea was to put non-template functions in uuid.ipp so that one could compile it in a .cpp to create a library. I agree that there is little in it. I should have moved the definition of many uuid member function to uuid.ipp. But I think you are right, I should just merge uuid.ipp into uuid.hpp.

...

...
- What is your evaluation of the documentation?

I'd like to see the Rationale section come first, before you give the whole Class Synopsis. That would make it easier for casual readers to understand the purpose of the library before reading further.

Good idea. I'll do this.

...

A small example program near the beginning of the document would help too.

Sure. I'll do this.

...

There is a spelling error in the Rationale section, s/indended/intended/.

Thanks.

...

...
- What is your evaluation of the potential usefulness of the library?

It's a nice addition to Boost.

...
- Did you try to use the library? With what compiler? Did you have any problems?

Yes, I tried running some test code and had no problems.

I vote Yes, to include the UUID library in Boost.

Regards, Dave Jenkins

Thanks, Andy Tompkins

Steven Watanabe

10:42 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

AMDG Hartmut Kaiser wrote:

...

The mini-review of Andy Tompkins UUID library starts today, November 23rd 2008, and will end on November 30th. I really hope to see your vote and your participation in the discussions on the Boost mailing lists!

I only saw a couple of issues: uuid::get_showbraces_index() is not reentrant (there is a race the first time it is called.) operator<< doesn't handle setw. operator>> ignores spaces. Is this intended? In Christ, Steven Watanabe

Andy Tompkins

29 Nov 29 Nov

4:58 a.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Thu, 27 Nov 2008 14:42:19 -0800, "Steven Watanabe" <watanabesj@gmail.com> said:

...

AMDG

Hartmut Kaiser wrote:

...
The mini-review of Andy Tompkins UUID library starts today, November 23rd 2008, and will end on November 30th. I really hope to see your vote and your participation in the discussions on the Boost mailing lists!

I only saw a couple of issues:

uuid::get_showbraces_index() is not reentrant (there is a race the first time it is called.)

I would like to fix this, but I'm not yet sure how.

...

operator<< doesn't handle setw. operator>> ignores spaces. Is this intended?

I intended to follow standard practices. I forgot setw. I will fix this. I didn't realize that operator>> ignores spaces. I will fix this too.

...

In Christ, Steven Watanabe

Thanks, Andy Tompkins

Christian Holmquist

2 Dec 2 Dec

2:05 a.m.

Hi, Maybe too late for a mini-review, but here's some unordered remarks.

...

From uuid.hpp

template <typename ch, typename char_traits, typename alloc> explicit uuid(std::basic_string<ch, char_traits, alloc> const& str); Creating a stringstream and parsing from a user-supplied string seems dangerous to me. It could be convenient to some, but I wouldn't go for an API where std::string str = ...; uuid id1(str); uuid id2(str.begin(), std.end()); Doesn't do the same thing. std::string to_string() const; std::wstring to_wstring() const; template <typename ch, typename char_traits, typename alloc> std::basic_string<ch, char_traits, alloc> to_basic_string() const; Any reason to use to_string instead of std::basic_ostream operators instead? size_type size() The underlying container is boost::array, shouldn't this function be made static? Or is the size of the uuid implementation defined and the user shouldn't count on it's length? is_null() What does this mean? Ah, ok, from the docs I see it's a magic uuid (all zero) that is_null. Maybe is_zero() would be more clear? static uuid create(uuid const& namespace_uuid, char const* name, int name_length); Can this be a generator instead? std::string name="www.widgets.com"; name_based_generator gen(name.begin(), name.end()); uuid id = gen(); Shouldn't basic_uuid_generator be named random_uuid_generator? It doesn't seem more basic than any other generator. - What is your evaluation of the design? Id like to see more separation between the generators and the core uuid class. IMO generating functionality should not be part of the uuid class at all, but separated into own headers with well documented behaviour. I'd like to se random_generator<class RandomGen> parsing_generator<class CharIterator> native_generator<..> (wrapper around os API) - What is your evaluation of the implementation? It seems fine, but it's not organized IMO. As a user I don't want to pay compile time for features I do not use (from uuid.hpp): #include <boost/operators.hpp> #include <boost/io/ios_state.hpp> #include <boost/random/uniform_int.hpp> #include <boost/random/variate_generator.hpp> #include <boost/random/mersenne_twister.hpp> #include <boost/uuid/seed_rng.hpp> #include <sstream> #include <iosfwd> #include <locale> - What is your evaluation of the documentation? I'd like to see more information about how to create uuids and if the library can help me doing that. Also what I can expect from the different generators (speed vs uniqueness). - What is your evaluation of the potential usefulness of the library? Very useful. - Did you try to use the library? With what compiler? Did you have any problems? Didn't try. - How much effort did you put into your evaluation? A glance? A quick reading? In-depth study? Reading the documentation and headers. About 2 hours. - Are you knowledgeable about the problem domain? Yes, as a user of them (not generation).

...

Please always state in your review, whether you think the library should be accepted as a Boost library! I vote no for now. I think most of the functionality needed is implemented (except the os_generator), but needs refactoring.

Maybe this is out of the question to most, but is the uuid class needed at all? I'd be happy to see the following working (cause then I can use our own uuid class with the algorithms provided by the library). #include <boost/uuid/random_generator.hpp> #include <boost/uuid/support/array.hpp> #include <boost/uuid/io.hpp> typedef boost::array<char, 16> id_type; boost::uuid::random_generator<id_type, boost::mt19937> uuid_gen(...); my_id id = uuid_gen(); std::cout << boost::uuid::format(id) <<std::endl; I admit the last part is from the top of my head right now, there's probably flaws with it =) But separating io, generation and container should IMO be done either way. Regards, Christian

Scott McMurray

5 a.m.

On Mon, Dec 1, 2008 at 21:05, Christian Holmquist <c.holmquist@gmail.com> wrote:

...

is_null() What does this mean? Ah, ok, from the docs I see it's a magic uuid (all zero) that is_null. Maybe is_zero() would be more clear?

I also prefer using zero in the name. That said, http://www.faqs.org/rfcs/rfc4122.html calls it the "nil" UUID. But .NET calls it Guid.Empty, and python seems to only call it uuid.UUID(), so nil may not be any clearer. Though really, why not just call it operator unspecified_bool_type? assert(!int() && !shared_ptr<T>() && !uuid());

...

static uuid create(uuid const& namespace_uuid, char const* name, int name_length); Can this be a generator instead?

std::string name="www.widgets.com"; name_based_generator gen(name.begin(), name.end()); uuid id = gen();

A generator seems logical. I certainly don't like the char*+length version. std::string name="www.widgets.com"; sha1_name_based_generator gen(dns_namespace_uuid); uuid id = gen(name); uuid id2 = gen(name.begin(), name.end()); assert(id == id2) I'd rather know whether it was v3 or v5 somewhere more obvious than the design notes, though. If I were using v3 (MD5) uuids and switched to boost not knowing that v5 (SHA1) uuids exist, I'd likely be very confused when my code stops working with my data.

Andy Tompkins

5 Dec 5 Dec

2:58 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Tue, 2 Dec 2008 00:00:32 -0500, "Scott McMurray" <me22.ca+boost@gmail.com> said:

...

On Mon, Dec 1, 2008 at 21:05, Christian Holmquist <c.holmquist@gmail.com> wrote:

...
is_null() What does this mean? Ah, ok, from the docs I see it's a magic uuid (all zero) that is_null. Maybe is_zero() would be more clear?

I also prefer using zero in the name. That said, http://www.faqs.org/rfcs/rfc4122.html calls it the "nil" UUID. But .NET calls it Guid.Empty, and python seems to only call it uuid.UUID(), so nil may not be any clearer.

Though really, why not just call it operator unspecified_bool_type?

assert(!int() && !shared_ptr<T>() && !uuid());

Fair. I can add operator unspecified_bool_type. And, possibility remove null().

...

...
static uuid create(uuid const& namespace_uuid, char const* name, int name_length); Can this be a generator instead?

std::string name="www.widgets.com"; name_based_generator gen(name.begin(), name.end()); uuid id = gen();

A generator seems logical. I certainly don't like the char*+length version.

Agreed.

...

std::string name="www.widgets.com"; sha1_name_based_generator gen(dns_namespace_uuid); uuid id = gen(name); uuid id2 = gen(name.begin(), name.end()); assert(id == id2)

I'd rather know whether it was v3 or v5 somewhere more obvious than the design notes, though. If I were using v3 (MD5) uuids and switched to boost not knowing that v5 (SHA1) uuids exist, I'd likely be very confused when my code stops working with my data.

Hmm, I'd be surprised if your code stopped working. But regardless, would it be enough to add a/some member functions to uuid so that one could ask what type of uuid it was? For example: boost::uuids::uuid u; // generate a uuid bool bIsV3 = u.is_v3(); //or boost::uuids::uuid::version_type v = u.version(); Regards, Andy Tompkins

Scott McMurray

3:44 p.m.

On 2008-12-05, Andy Tompkins <atompkins@fastmail.fm> wrote:

...

On Tue, 2 Dec 2008 00:00:32 -0500, "Scott McMurray" said:

...
I'd rather know whether it was v3 or v5 somewhere more obvious than the design notes, though. If I were using v3 (MD5) uuids and switched to boost not knowing that v5 (SHA1) uuids exist, I'd likely be very confused when my code stops working with my data.

Hmm, I'd be surprised if your code stopped working.

Well, if I had some UUIDs in a database that were generated from a v3 algorithm, and I tried to look one up from the base string, but generated the UUID from the v5 algorithm, the lookup would fail. Though I have no idea whether this would be an issue in practice.

...

But regardless, would it be enough to add a/some member functions to uuid so that one could ask what type of uuid it was? For example:

boost::uuids::version_type v = u.version();

I think that's an excellent idea. I don't think it solves the issue, though, since I likely wouldn't think to check going from some hypothetical old C API's UuidCreateByName(&id, UUID_NAMESPACE_DNS, "www.widgets.com", 15); to id = boost::uuids::uuid::create(dns_namespace_uuid, "www.widgets.com");

Andy Tompkins

6 Dec 6 Dec

8:25 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Fri, 5 Dec 2008 10:44:00 -0500, "Scott McMurray" <me22.ca+boost@gmail.com> said:

...

On 2008-12-05, Andy Tompkins <atompkins@fastmail.fm> wrote:

...
On Tue, 2 Dec 2008 00:00:32 -0500, "Scott McMurray" said:

...
I'd rather know whether it was v3 or v5 somewhere more obvious than the design notes, though. If I were using v3 (MD5) uuids and switched to boost not knowing that v5 (SHA1) uuids exist, I'd likely be very confused when my code stops working with my data.

Hmm, I'd be surprised if your code stopped working.

Well, if I had some UUIDs in a database that were generated from a v3 algorithm, and I tried to look one up from the base string, but generated the UUID from the v5 algorithm, the lookup would fail.

Ah, I understand. Yes this would be a problem.

...

Though I have no idea whether this would be an issue in practice.

Good question. I don't know how boost::uuids can prevent this besides documentation.

...

...
But regardless, would it be enough to add a/some member functions to uuid so that one could ask what type of uuid it was? For example:

boost::uuids::version_type v = u.version();

I think that's an excellent idea.

I don't think it solves the issue, though, since I likely wouldn't think to check going from some hypothetical old C API's

UuidCreateByName(&id, UUID_NAMESPACE_DNS, "www.widgets.com", 15);

to

id = boost::uuids::uuid::create(dns_namespace_uuid, "www.widgets.com");

I agree, it doesn't solve the issue. I think the best guard against this would be with unit tests for their code. Regards, Andy Tompkins

Andy Tompkins

2 Dec 2 Dec

10:19 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Mon, 1 Dec 2008 20:05:58 -0600, "Christian Holmquist" <c.holmquist@gmail.com> said:

...

Hi, Maybe too late for a mini-review, but here's some unordered remarks.

...
From uuid.hpp

template <typename ch, typename char_traits, typename alloc> explicit uuid(std::basic_string<ch, char_traits, alloc> const& str);

Creating a stringstream and parsing from a user-supplied string seems dangerous to me. It could be convenient to some, but I wouldn't go for an API where

std::string str = ...; uuid id1(str); uuid id2(str.begin(), std.end());

Doesn't do the same thing.

Good point. I wonder if they should both be generators. A generator that takes a string or an iterator range for a string. And a different generator that takes an array of bytes or an iterator range for bytes.

...

std::string to_string() const; std::wstring to_wstring() const; template <typename ch, typename char_traits, typename alloc> std::basic_string<ch, char_traits, alloc> to_basic_string() const;

Any reason to use to_string instead of std::basic_ostream operators instead?

No reason to use one over the other, just preference. It has been mentioned before and I will likely remove the to_..._string functions.

...

size_type size() The underlying container is boost::array, shouldn't this function be made static? Or is the size of the uuid implementation defined and the user shouldn't count on it's length?

It could easily be made static. It will _always_ return 16.

...

is_null() What does this mean? Ah, ok, from the docs I see it's a magic uuid (all zero) that is_null. Maybe is_zero() would be more clear?

Maybe. Maybe uninitialized, nil. I don't think it matters too much because I don't think there is a name that everyone will think is obvious.

...

static uuid create(uuid const& namespace_uuid, char const* name, int name_length); Can this be a generator instead?

std::string name="www.widgets.com"; name_based_generator gen(name.begin(), name.end()); uuid id = gen();

I agree this should be a generator.

...

Shouldn't basic_uuid_generator be named random_uuid_generator? It doesn't seem more basic than any other generator.

Yes it should have random in its name.

...

- What is your evaluation of the design?

Id like to see more separation between the generators and the core uuid class. IMO generating functionality should not be part of the uuid class at all, but separated into own headers with well documented behaviour. I'd like to se random_generator<class RandomGen> parsing_generator<class CharIterator> native_generator<..> (wrapper around os API)

Yes, I like this.

...

- What is your evaluation of the implementation?

It seems fine, but it's not organized IMO. As a user I don't want to pay compile time for features I do not use (from uuid.hpp): #include <boost/operators.hpp> #include <boost/io/ios_state.hpp> #include <boost/random/uniform_int.hpp> #include <boost/random/variate_generator.hpp> #include <boost/random/mersenne_twister.hpp> #include <boost/uuid/seed_rng.hpp> #include <sstream> #include <iosfwd> #include <locale>

This could easily be fixed by separating functionality out into different header files. I will do this.

...

- What is your evaluation of the documentation?

I'd like to see more information about how to create uuids and if the library can help me doing that. Also what I can expect from the different generators (speed vs uniqueness).

Yes, I will do this.

...

- What is your evaluation of the potential usefulness of the library? Very useful.

- Did you try to use the library? With what compiler? Did you have any problems?

Didn't try.

- How much effort did you put into your evaluation? A glance? A quick reading? In-depth study? Reading the documentation and headers. About 2 hours.

- Are you knowledgeable about the problem domain? Yes, as a user of them (not generation).

...
Please always state in your review, whether you think the library should be accepted as a Boost library!

I vote no for now. I think most of the functionality needed is implemented (except the os_generator), but needs refactoring.

Maybe this is out of the question to most, but is the uuid class needed at all? I'd be happy to see the following working (cause then I can use our own uuid class with the algorithms provided by the library).

#include <boost/uuid/random_generator.hpp> #include <boost/uuid/support/array.hpp> #include <boost/uuid/io.hpp>

typedef boost::array<char, 16> id_type; boost::uuid::random_generator<id_type, boost::mt19937> uuid_gen(...); my_id id = uuid_gen(); std::cout << boost::uuid::format(id) <<std::endl;

I not keen on this. The fact that boost::uuid is implemented with boost::array is just an implementation detail. That may change. I want the public interface to remain the same.

...

I admit the last part is from the top of my head right now, there's probably flaws with it =) But separating io, generation and container should IMO be done either way.

I agree, separating io, generation, ... should be done. I will do this.

...

Regards, Christian

Thanks, Andy Tompkins

Christian Holmquist

11:26 p.m.

2008/12/2 Andy Tompkins <atompkins@fastmail.fm>

...

...
size_type size() The underlying container is boost::array, shouldn't this function be made static? Or is the size of the uuid implementation defined and the user shouldn't count on it's length?

It could easily be made static. It will _always_ return 16.

I think that should be part of the documented interface. It's just a matter of not confusing the reader that the uuid may be of variable length.

...

...
is_null() What does this mean? Ah, ok, from the docs I see it's a magic uuid (all zero) that is_null. Maybe is_zero() would be more clear?

Maybe. Maybe uninitialized, nil. I don't think it matters too much because I don't think there is a name that everyone will think is obvious.

You're probably right.

...

...
Maybe this is out of the question to most, but is the uuid class needed at all? I'd be happy to see the following working (cause then I can use our own uuid class with the algorithms provided by the library).

#include <boost/uuid/random_generator.hpp> #include <boost/uuid/support/array.hpp> #include <boost/uuid/io.hpp>

typedef boost::array<char, 16> id_type; boost::uuid::random_generator<id_type, boost::mt19937> uuid_gen(...); my_id id = uuid_gen(); std::cout << boost::uuid::format(id) <<std::endl;

I not keen on this. The fact that boost::uuid is implemented with boost::array is just an implementation detail. That may change. I want the public interface to remain the same.

Ok. For the sake of interface, does it make sense to you adding const char* uuid::data() const and static std::size_t uuid::size() that you won't change in future releases? I think it's an important feature for a uuid to store its 128bit value as a continues range of bytes, and that users can rely on using it as such where needed. Also, that the ordering of the bytes won't change in future releases. Uuids are likely to be persistent somewhere, and not necessarily trough the means of boost.serialization where version data can be attached. Regards, Christian

Andy Tompkins

6 Dec 6 Dec

7:44 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Tue, 2 Dec 2008 17:26:13 -0600, "Christian Holmquist" <c.holmquist@gmail.com> said:

...

2008/12/2 Andy Tompkins <atompkins@fastmail.fm>

...
...
size_type size() The underlying container is boost::array, shouldn't this function be made static? Or is the size of the uuid implementation defined and the user shouldn't count on it's length?

It could easily be made static. It will _always_ return 16.

I think that should be part of the documented interface. It's just a matter of not confusing the reader that the uuid may be of variable length.

I will document this. < snip >

...

...
...
Maybe this is out of the question to most, but is the uuid class needed at all? I'd be happy to see the following working (cause then I can use our own uuid class with the algorithms provided by the library).

#include <boost/uuid/random_generator.hpp> #include <boost/uuid/support/array.hpp> #include <boost/uuid/io.hpp>

typedef boost::array<char, 16> id_type; boost::uuid::random_generator<id_type, boost::mt19937> uuid_gen(...); my_id id = uuid_gen(); std::cout << boost::uuid::format(id) <<std::endl;

I not keen on this. The fact that boost::uuid is implemented with boost::array is just an implementation detail. That may change. I want the public interface to remain the same.

Ok. For the sake of interface, does it make sense to you adding const char* uuid::data() const and static std::size_t uuid::size() that you won't change in future releases?

Hmm, I'm not sure. There is another post that I haven't got to replying to that talks about platforms with different sized types (char, int, ...). If I make sure that sizeof(boost::uuids::uuid) == 16 bytes on every platform then some platforms may get a performance penalty. On a platform where sizeof(char) != 1 or sizeof(int) != 4 then boost::uuids::uuid will have to do more work to put bits where they belong. Then again, I don't think I can actually guarantee on every platform that sizeof(boost::uuids::uuid) == 16. What I have done is make sure that each of the 16 blocks of bits in a boost::uuids::uuid is at least 8 bits. Thus boost::uuids::uuid::begin() and boost::uuids::uuid::end() will walk the 16 blocks correctly. I know this didn't really answer your question. I would like to offer const char* uuid::data() const, but I'm not sure that I can.

...

I think it's an important feature for a uuid to store its 128bit value as a continues range of bytes, and that users can rely on using it as such where needed. Also, that the ordering of the bytes won't change in future releases. Uuids are likely to be persistent somewhere, and not necessarily trough the means of boost.serialization where version data can be attached.

I agree. I did provide begin() and end() with this purpose in mind. And the order will never change.

...

Regards, Christian

Regards, Andy Tompkins

Steven Watanabe

8:10 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

AMDG Andy Tompkins wrote:

...

On a platform where sizeof(char) != 1

sizeof(char) == 1 by definition. In Christ, Steven Watanabe

Agustín K-ballo Bergé

4 Dec 4 Dec

1:05 a.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

Andy Tompkins escribió:

...

It could easily be made static. It will _always_ return 16.

Even for architectures with chars bigger than 8 bits long? From a quick read of the library source code, and boost::integer documentation, it seems that the library would not compile there. Would it make any sense in defining data_type as array< unsigned char, 128 / CHAR_BIT >? I'm interested in portability issues, but I have no access to platforms other than intel-x86, so I would like to hear some advice on the subject. Thank you. Agustín K-ballo Bergé.-

Andy Tompkins

6 Dec 6 Dec

8:18 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Wed, 03 Dec 2008 22:05:39 -0300, "Agustín K-ballo Bergé" <kaballo86@hotmail.com> said:

...

Andy Tompkins escribió:

...
It could easily be made static. It will _always_ return 16.

Even for architectures with chars bigger than 8 bits long? From a quick read of the library source code, and boost::integer documentation, it seems that the library would not compile there.

Hmm, can anyone verify this? I will address this if it is a problem. What are the guarantees of the size of a byte? Is a byte always 8 bits?

...

Would it make any sense in defining data_type as array< unsigned char, 128 / CHAR_BIT >? I'm interested in portability issues, but I have no access to platforms other than intel-x86, so I would like to hear some advice on the subject.

I also have no access to platforms other than x86. Would this work? Would there be performance penalities on some platforms?

...

Thank you.

Agustín K-ballo Bergé.-

Regards, Andy Tompkins

Scott McMurray

8:28 p.m.

On Sat, Dec 6, 2008 at 15:18, Andy Tompkins <atompkins@fastmail.fm> wrote:

...

On Wed, 03 Dec 2008 22:05:39 -0300, "Agustín K-ballo Bergé" <kaballo86@hotmail.com> said:

...
Andy Tompkins escribió:

...
It could easily be made static. It will _always_ return 16.

Even for architectures with chars bigger than 8 bits long? From a quick read of the library source code, and boost::integer documentation, it seems that the library would not compile there.

Hmm, can anyone verify this? I will address this if it is a problem.

What are the guarantees of the size of a byte? Is a byte always 8 bits?

By C++ definition, a byte is the size of a char, and contains at least 8 bits. My understanding is that posix sockets require CHAR_BIT == 8, so outside of DSP chips and other special hardware, that's almost always the case. Considering that UUIDs were originally designed for RPC -- typically over sockets -- and that the v1 algorithm is defined using MAC addresses, it's probably a safe assumption.

Andy Tompkins

9 Dec 9 Dec

4:56 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Sat, 6 Dec 2008 15:28:30 -0500, "Scott McMurray" <me22.ca+boost@gmail.com> said:

...

On Sat, Dec 6, 2008 at 15:18, Andy Tompkins <atompkins@fastmail.fm> wrote:

...
On Wed, 03 Dec 2008 22:05:39 -0300, "Agustín K-ballo Bergé" <kaballo86@hotmail.com> said:

...
Andy Tompkins escribió:

...
It could easily be made static. It will _always_ return 16.

Even for architectures with chars bigger than 8 bits long? From a quick read of the library source code, and boost::integer documentation, it seems that the library would not compile there.

Hmm, can anyone verify this? I will address this if it is a problem.

What are the guarantees of the size of a byte? Is a byte always 8 bits?

By C++ definition, a byte is the size of a char, and contains at least 8 bits.

My understanding is that posix sockets require CHAR_BIT == 8, so outside of DSP chips and other special hardware, that's almost always the case. Considering that UUIDs were originally designed for RPC -- typically over sockets -- and that the v1 algorithm is defined using MAC addresses, it's probably a safe assumption.

Thank you. So I think the uuid library will require either that CHAR_BIT == 8 or CHAR_BIT % 8 == 0. That is to say that the platform must have 8 bit bytes. Andy.

Daniel Krügler

10 Dec 10 Dec

8:31 a.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

Andy Tompkins wrote:

...

On Sat, 6 Dec 2008 15:28:30 -0500, "Scott McMurray" <me22.ca+boost@gmail.com> said:

...
My understanding is that posix sockets require CHAR_BIT == 8, so outside of DSP chips and other special hardware, that's almost always the case. Considering that UUIDs were originally designed for RPC -- typically over sockets -- and that the v1 algorithm is defined using MAC addresses, it's probably a safe assumption.

Thank you. So I think the uuid library will require either that CHAR_BIT == 8 or CHAR_BIT % 8 == 0. That is to say that the platform must have 8 bit bytes.

The last condition (CHAR_BIT % 8 == 0) does not prove that the platform has 8 bit bytes, just consider DSP controllers with CHAR_BIT = 32 - we have those. I have not completely followed the discussion, but it is of-course possible to realize effective 8 bit calculations even on those systems with CHAR_BIT > 8 by effectively using a math with corresponding bit masks hiding everything beyond the 8 bit limit. Side note: The C++ is going to introduce non-optional new character types char16_t and char32_t. But these are not required to exactly fit into 16 bit or 32 bit, just they are required to be capable to *represent* corresponding values. This seems quite similar to the discussion here, if I did not misunderstand it. Greetings from Bremen, Daniel Krügler

muhammadchang

3 Dec 3 Dec

5:21 a.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

Hi boosters, First of all, my opinion is that this kind of classes, sometimes called concrete types, are the very plus plus suggested by the language's name. A well designed user type behaves and cost, exactly the same as any built in type. This makes C++ unique in the plethora of available languages. That said, I must mention that I am aware of the flaws pointed out by many reviewers, specially those concerning interface. In particular, I don't like the "default null constructor". For me, the default constructor MUST generate a valid object, period. I have already copied/used Andy's code, which I luckily found in your vault, and one thing I changed was this default constructor [uuid_t uuid; // generates a valid random uuid]. If the user is short of resources, she must use the copy constructor [uuid_t nil(uuid_t::null());]. But I don't provide a bool is_null() const; method, for operator==() does the job pretty well. Despite this weaknesses, I think the library is ready for out-of-the-box usage, so my vote is YES, this library should be part of boost. Sincerely, Kenneth -- View this message in context: http://www.nabble.com/-Review--UUID-library-%28mini-%29review-starts-today%2... Sent from the Boost - Users mailing list archive at Nabble.com.

Scott McMurray

6:11 a.m.

On Wed, Dec 3, 2008 at 00:21, muhammadchang <kennethlaskoski@gmail.com> wrote:

...

First of all, my opinion is that this kind of classes, sometimes called concrete types, are the very plus plus suggested by the language's name. A well designed user type behaves and cost, exactly the same as any built in type. [...] In particular, I don't like the "default null constructor". For me, the default constructor MUST generate a valid object, period. I have already copied/used Andy's code, which I luckily found in your vault, and one thing I changed was this default constructor [uuid_t uuid; // generates a valid random uuid].

"behaves and cost, exactly the same as any built in type" and the "default constructor [...] generates a valid random uuid" seem rather opposed. int() is not a random int, it's 0. No fundamental type -- no standard library type either, that I can think of -- has T() != T(), yet you want uuid() != uuid(). And why is the random UUID generator deemed more important than the time-based one, so much so that it gets the default constructor?

Rodrigo Madera

12:42 p.m.

...

... No fundamental type -- no standard library type either, that I can think of -- has T() != T(), yet you want uuid() != uuid().

I totally agree. Sometimes it "feels" that you must implement something the "practical" way, but the theory behind it that makes the building blocks possible has to be respected at all times to remain mathematically consistent and somewhat predictable for future uses to come. IMHO, it should always be true that T() == T(), and uuid should not be the exception: uuid() == uuid(). Rodrigo

muhammadchang

8:19 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

...

"behaves and cost, exactly the same as any built in type" and the "default constructor [...] generates a valid random uuid" seem rather opposed. int() is not a random int, it's 0. No fundamental type -- no standard library type either, that I can think of -- has T() != T(), yet you want uuid() != uuid().

You are right, there is an apparent contradiction in my wording above. By "behaves and cost", I mean implementing comparison operators, inserters and extractors, for example. Where applicable, of course, and with minimum overhead. But an UUID is more than its bare 128-bit representation. IMHO, what justifies the truly exceptional uuid() != uuid() is the "unique" in "universally unique id". Yours, Kenneth

...

And why is the random UUID generator deemed more important than the time-based one, so much so that it gets the default constructor?

Because Andy has not [yet] implemented the time-based generator, which poses portability problems. _______________________________________________ Boost-users mailing list Boost-users@lists.boost.org http://lists.boost.org/mailman/listinfo.cgi/boost-users -- View this message in context: http://www.nabble.com/-Review--UUID-library-%28mini-%29review-starts-today%2... Sent from the Boost - Users mailing list archive at Nabble.com.

Scott McMurray

8:57 p.m.

On Wed, Dec 3, 2008 at 15:19, muhammadchang <kennethlaskoski@gmail.com> wrote:

...

...
"behaves and cost, exactly the same as any built in type" and the "default constructor [...] generates a valid random uuid" seem rather opposed. int() is not a random int, it's 0. No fundamental type -- no standard library type either, that I can think of -- has T() != T(), yet you want uuid() != uuid().

You are right, there is an apparent contradiction in my wording above. By "behaves and cost", I mean implementing comparison operators, inserters and extractors, for example. Where applicable, of course, and with minimum overhead. But an UUID is more than its bare 128-bit representation. IMHO, what justifies the truly exceptional uuid() != uuid() is the "unique" in "universally unique id".

But it's the mapping of the ID to something that's unique, not the objects themselves. The only time you can do something useful with a UUID is when you have two that are the same, so I still don't think breaking normal semantics is justified. The usage I was looking at for my project never needs the UUID library to generate one itself, in fact. They're all generated in the database (http://dev.mysql.com/doc/refman/5.0/en/miscellaneous-functions.html#function...), but the class is still useful for parsing and serialization. Changing tacks somewhat... I was just looking deeper into the implementation, and saw some things that worry me. For example, this looks like an aliasing violation in uuid.hpp line 279: *reinterpret_cast<unsigned long*>(&data[i]) = _v_gen(); It's at least clearly byte-order-dependant, which isn't good, since I'd expect a generator seeded the same way to always produce the same UUIDs, regardless of architecture. I'm pleased to see proper shifting and masking in the SHA1 code, which made me think of something: Why not just make the UUID class a PoD? It has an architecture-independent in-memory format (rfc4122, section 4.1.2, which is already followed), so turning it into one would be straight-forward, and would allow it to be very easily & safely splatted to and from files. This would require moving the constructors and such to generators, stream operators, and similar, but I think those are positive changes in any case. It certainly solves the "what should the constructor do" problem, and places all the various generators (nil, random, name, time, ...) at equal footing. It's a value type, so let's make it as much like an int as possible.

muhammadchang

6 Dec 6 Dec

2:07 a.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

I see, our use cases are radically different. You ain't talking about objects at all. If all you need is a PoD, why not just declare typedef boost::array<uint8_t, 16> uuid;//? Inter, Kenneth Scott McMurray-2 wrote:

...

On Wed, Dec 3, 2008 at 15:19, muhammadchang <kennethlaskoski@gmail.com> wrote:

...
...
"behaves and cost, exactly the same as any built in type" and the "default constructor [...] generates a valid random uuid" seem rather opposed. int() is not a random int, it's 0. No fundamental type -- no standard library type either, that I can think of -- has T() != T(), yet you want uuid() != uuid().

You are right, there is an apparent contradiction in my wording above. By "behaves and cost", I mean implementing comparison operators, inserters and extractors, for example. Where applicable, of course, and with minimum overhead. But an UUID is more than its bare 128-bit representation. IMHO, what justifies the truly exceptional uuid() != uuid() is the "unique" in "universally unique id".

But it's the mapping of the ID to something that's unique, not the objects themselves. The only time you can do something useful with a UUID is when you have two that are the same, so I still don't think breaking normal semantics is justified.

The usage I was looking at for my project never needs the UUID library to generate one itself, in fact. They're all generated in the database (http://dev.mysql.com/doc/refman/5.0/en/miscellaneous-functions.html#function...), but the class is still useful for parsing and serialization.

Changing tacks somewhat...

I was just looking deeper into the implementation, and saw some things that worry me. For example, this looks like an aliasing violation in uuid.hpp line 279:

*reinterpret_cast<unsigned long*>(&data[i]) = _v_gen();

It's at least clearly byte-order-dependant, which isn't good, since I'd expect a generator seeded the same way to always produce the same UUIDs, regardless of architecture.

I'm pleased to see proper shifting and masking in the SHA1 code, which made me think of something: Why not just make the UUID class a PoD? It has an architecture-independent in-memory format (rfc4122, section 4.1.2, which is already followed), so turning it into one would be straight-forward, and would allow it to be very easily & safely splatted to and from files.

This would require moving the constructors and such to generators, stream operators, and similar, but I think those are positive changes in any case. It certainly solves the "what should the constructor do" problem, and places all the various generators (nil, random, name, time, ...) at equal footing.

It's a value type, so let's make it as much like an int as possible. _______________________________________________ Boost-users mailing list Boost-users@lists.boost.org http://lists.boost.org/mailman/listinfo.cgi/boost-users

-- View this message in context: http://www.nabble.com/-Review--UUID-library-%28mini-%29review-starts-today%2... Sent from the Boost - Users mailing list archive at Nabble.com.

Andy Tompkins

8:11 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Wed, 3 Dec 2008 15:57:45 -0500, "Scott McMurray" <me22.ca+boost@gmail.com> said:

...

On Wed, Dec 3, 2008 at 15:19, muhammadchang <kennethlaskoski@gmail.com> wrote:

...
...
"behaves and cost, exactly the same as any built in type" and the "default constructor [...] generates a valid random uuid" seem rather opposed. int() is not a random int, it's 0. No fundamental type -- no standard library type either, that I can think of -- has T() != T(), yet you want uuid() != uuid().

You are right, there is an apparent contradiction in my wording above. By "behaves and cost", I mean implementing comparison operators, inserters and extractors, for example. Where applicable, of course, and with minimum overhead. But an UUID is more than its bare 128-bit representation. IMHO, what justifies the truly exceptional uuid() != uuid() is the "unique" in "universally unique id".

But it's the mapping of the ID to something that's unique, not the objects themselves. The only time you can do something useful with a UUID is when you have two that are the same, so I still don't think breaking normal semantics is justified.

I also think that uuid() == uuid() is a good thing. I do understand the other point of view and where it is coming from. It really does make me think that there should not be a default constructor. Then, one must be explicit if they want a 'null' uuid, something like: boost::uuids::uuid u1(boost::uuids::null()); and one must be explicit if they want a unique uuid, something like: boost::uuids::uuid u2(boost::uuids::uuid_generator());

...

The usage I was looking at for my project never needs the UUID library to generate one itself, in fact. They're all generated in the database (http://dev.mysql.com/doc/refman/5.0/en/miscellaneous-functions.html#function...), but the class is still useful for parsing and serialization.

Changing tacks somewhat...

I was just looking deeper into the implementation, and saw some things that worry me. For example, this looks like an aliasing violation in uuid.hpp line 279:

*reinterpret_cast<unsigned long*>(&data[i]) = _v_gen();

I do not believe this is an aliasing violation.

...

It's at least clearly byte-order-dependant, which isn't good, since I'd expect a generator seeded the same way to always produce the same UUIDs, regardless of architecture.

Hmm, I was only trying to be efficient. The generator produces values of size = sizeof(unsigned long) and I wanted to use all those bytes. Do you have a suggestion that would improve this?

...

I'm pleased to see proper shifting and masking in the SHA1 code, which made me think of something: Why not just make the UUID class a PoD? It has an architecture-independent in-memory format (rfc4122, section 4.1.2, which is already followed), so turning it into one would be straight- forward, and would allow it to be very easily & safely splatted to and from files.

I do really like having a class encapsulate the data and functions. Also, I don't have much experience with platforms where sizeof(int) != 4 or sizeof(char) != 1, and I want to make sure that I can do as you suggest.

...

This would require moving the constructors and such to generators, stream operators, and similar, but I think those are positive changes in any case. It certainly solves the "what should the constructor do" problem, and places all the various generators (nil, random, name, time, ...) at equal footing.

I also think that all the ways to create a uuid can be made as generators, and by not providing a default constructor will keep them equal.

...

It's a value type, so let's make it as much like an int as possible.

Regards, Andy Tompkins

Scott McMurray

8:59 p.m.

On Sat, Dec 6, 2008 at 15:11, Andy Tompkins <atompkins@fastmail.fm> wrote:

...

On Wed, 3 Dec 2008 15:57:45 -0500, "Scott McMurray" <me22.ca+boost@gmail.com> said:

...
I was just looking deeper into the implementation, and saw some things that worry me. For example, this looks like an aliasing violation in uuid.hpp line 279:

*reinterpret_cast<unsigned long*>(&data[i]) = _v_gen();

I do not believe this is an aliasing violation.

3.10/15 from n2723 (and essentially the same as C++03): If a program attempts to access the stored value of an object through an lvalue of other than one of the following types the behavior is undefined: — the dynamic type of the object, — a cv-qualified version of the dynamic type of the object, — a type similar (as defined in 4.4) to the dynamic type of the object, — a type that is the signed or unsigned type corresponding to the dynamic type of the object, — a type that is the signed or unsigned type corresponding to a cv-qualified version of the dynamic type of the object, — an aggregate or union type that includes one of the aforementioned types among its members (including, recursively, a member of a subaggregate or contained union), — a type that is a (possibly cv-qualified) base class type of the dynamic type of the object, — a char or unsigned char type. The dynamic type of the stored objects is unsigned char, and they are being accessed through an lvalue of type unsigned long.

...

...
It's at least clearly byte-order-dependant, which isn't good, since I'd expect a generator seeded the same way to always produce the same UUIDs, regardless of architecture.

Hmm, I was only trying to be efficient. The generator produces values of size = sizeof(unsigned long) and I wanted to use all those bytes. Do you have a suggestion that would improve this?

Actually, the default generator produces uint32_t, so (in the thread on boost-dev I started earlier) I just switched to using that, and used the same code that the sha1 uses to put 4-byte ints into 4 bytes.

...

...
I'm pleased to see proper shifting and masking in the SHA1 code, which made me think of something: Why not just make the UUID class a PoD? It has an architecture-independent in-memory format (rfc4122, section 4.1.2, which is already followed), so turning it into one would be straight- forward, and would allow it to be very easily & safely splatted to and from files.

I do really like having a class encapsulate the data and functions.

In general, I agree, but POD doesn't prevent having a class. Also, the UUID itself imposes no invariant on the stored data other than that it's initialized, and allowing arbitrary modification can't break that. That said, POD-ness is the least important of the aspects I've raised. I would like mutating iterators into the array, though. ~ Scott

Andy Tompkins

9 Dec 9 Dec

5:07 p.m.

New subject: [Review] UUID library (mini-)review starts today, November 23rd

On Sat, 6 Dec 2008 15:59:41 -0500, "Scott McMurray" <me22.ca+boost@gmail.com> said:

...

On Sat, Dec 6, 2008 at 15:11, Andy Tompkins <atompkins@fastmail.fm> wrote:

...
On Wed, 3 Dec 2008 15:57:45 -0500, "Scott McMurray" <me22.ca+boost@gmail.com> said:

...
I was just looking deeper into the implementation, and saw some things that worry me. For example, this looks like an aliasing violation in uuid.hpp line 279:

*reinterpret_cast<unsigned long*>(&data[i]) = _v_gen();

I do not believe this is an aliasing violation.

3.10/15 from n2723 (and essentially the same as C++03):

If a program attempts to access the stored value of an object through an lvalue of other than one of the following types the behavior is undefined: — the dynamic type of the object, — a cv-qualified version of the dynamic type of the object, — a type similar (as defined in 4.4) to the dynamic type of the object, — a type that is the signed or unsigned type corresponding to the dynamic type of the object, — a type that is the signed or unsigned type corresponding to a cv-qualified version of the dynamic type of the object, — an aggregate or union type that includes one of the aforementioned types among its members (including, recursively, a member of a subaggregate or contained union), — a type that is a (possibly cv-qualified) base class type of the dynamic type of the object, — a char or unsigned char type.

The dynamic type of the stored objects is unsigned char, and they are being accessed through an lvalue of type unsigned long.

Thank you. I understand now. I will change this.

...

...
...
It's at least clearly byte-order-dependant, which isn't good, since I'd expect a generator seeded the same way to always produce the same UUIDs, regardless of architecture.

Hmm, I was only trying to be efficient. The generator produces values of size = sizeof(unsigned long) and I wanted to use all those bytes. Do you have a suggestion that would improve this?

Actually, the default generator produces uint32_t, so (in the thread on boost-dev I started earlier) I just switched to using that, and used the same code that the sha1 uses to put 4-byte ints into 4 bytes.

Yes, I like that.

...

...
...
I'm pleased to see proper shifting and masking in the SHA1 code, which made me think of something: Why not just make the UUID class a PoD? It has an architecture-independent in-memory format (rfc4122, section 4.1.2, which is already followed), so turning it into one would be straight- forward, and would allow it to be very easily & safely splatted to and from files.

I do really like having a class encapsulate the data and functions.

In general, I agree, but POD doesn't prevent having a class. Also, the UUID itself imposes no invariant on the stored data other than that it's initialized, and allowing arbitrary modification can't break that.

That said, POD-ness is the least important of the aspects I've raised. I would like mutating iterators into the array, though.

I understand your desire. I would like to put in mutating iterators. I wrote in my last post, should the uuid library require CHAR_BITS == 8, or can it relax and only require that CHAR_BITS % 8 == 0. That is the uuid library only needs 8 bit bytes. I guess the difference will be in implementating the iterators.

...

~ Scott

Regards, Andy Tompkins

Scott McMurray

5:40 p.m.

On Tue, Dec 9, 2008 at 12:07, Andy Tompkins <atompkins@fastmail.fm> wrote:

...

I understand your desire. I would like to put in mutating iterators. I wrote in my last post, should the uuid library require CHAR_BITS == 8, or can it relax and only require that CHAR_BITS % 8 == 0. That is the uuid library only needs 8 bit bytes. I guess the difference will be in implementating the iterators.

I think it's reasonable for the uuid class itself to work iff boost/cstdint.hpp provides a uint8_t, and for the generators to work if boost/cstdint.hpp provides a uint32_t. (The latter is mostly for the hash, but requiring it for the random too is convenient). I don't see the added implementation complexity to support unusual architectures as being worth it. For example, even the C compiler for the last mcu I used (one of the weaker MSP430s) provided both, despite it being about 500 (or 65000, depending) times slower than and having one millionth the ram of my laptop. That said, I recall a discussion on the LLVM list a bit back about an architecture with 96-bit registers separated into four 24-bit channels. Is anyone running Boost on such things?

Daryle Walker

8:34 a.m.

New subject: [Boost-announce] [Review] UUID library (mini-)review starts today, November 23rd

[Sorry for the lateness. Haven't looked at any other reviews yet.] On Nov 23, 2008, at 7:13 PM, Hartmut Kaiser wrote:

...

The mini-review of Andy Tompkins UUID library starts today, November 23rd 2008, and will end on November 30th. I really hope to see your vote and your participation in the discussions on the Boost mailing lists!

The library can be downloaded from the vault here (it's the file uuid_v13.zip in the topmost directory): http://tinyurl.com/6xx28b

The initial review of the UUID library ended with a provisional acceptance (read here: http://article.gmane.org/gmane.comp.lib.boost.user/ 27774/).

This mini review is meant to allow for a final decision after looking at the changed parts of the library. Here is a list of things fixed and changed:

- fixed the licensing issues revealed by the initial review - fixed the security vulnerability revealed by the initial review - renamed to uuid, moved all classes, functions, etc. to namespace boost::uuids - implemented sha1 hash function (thus no license problem)

Others, including myself, are working on encoding libraries. If this library, and an encoding library are added, we could change the implementation to use the new encoding library.

...

- new class basic_uuid_generator to create random-based uuids. The random number generator is no longer hard coded. It can use any random number generator, default is boost::mt19937 - implemented a good seed function for random number generators - all functions are now reentrant, all classes are as thread safe as an int and the library is no longer dependent on Boost.Thread

---------------------------------------------------

Please always state in your review, whether you think the library should be accepted as a Boost library!

Yes, please accept it.

...

Additionally please consider giving feedback on the following general topics:

- What is your evaluation of the design?

Why is there a container interface for inspecting the value's octets? For a similar problem in my code (the MD5 stuff in the Sandbox SVN), I defined a copying-out member function template that takes an output iterator and returns the updated version of same. You could do that, making sure to define some (compile-time) item indicating the length (16), to facilitate easier changes to internal storage. Don't some operating systems provide UUID generation? Could there be a way to add a creation function that uses the OS's code? Is default construction useful besides when initialization can't be done in one step? Instead of an "is_null" member function, maybe (pseudo-)Boolean conversion (and "operator not") can be used. I don't think the multitude of string conversion techniques are needed. Keep the constructor with "char const *" for pseudo- literals. Everything else, in either direction, could use "lexical_cast".

...

- What is your evaluation of the implementation?

The "generator_iterator" substitute in "seed_rng.hpp" doesn't completely help. It is supposed to improve on the version in "boost/ generator_iterator.hpp" in the area of end-iterator support. I originally said more here, but I erased it because when I filed a ticket on this issue, I found one already (#2428) and put my notes there. The implementation of the constructor that takes a byte-returning input-iterator shows why we need a double-bounded copy algorithm. (Now in new ticket #2578.) The copy constructor, destructor, and copy-assignment operator don't do anything different than the automatically-defined versions would. So they could be omitted. Maybe the "create_name_based" and "get_showbraces_index" member functions need to be in a new mandatory source file. If so, you could move some common error strings there too. BTW, why do "create" (and "create_name_based") need the length based in as a separate argument? Just use "std::strlen". The I/O function templates could be tightened up some. (I noticed it since I work on I/O quite a bit. It could be done later.) The optimization would be more important if my dump-most-of-the-string- conversions-for-lexical_cast idea is used. Should the serialization be done in a separate header? BTW, how is a custom primitive type handled? Is it just a byte-level save/load of memory? (What if the type isn't POD, like this one?)

...

- What is your evaluation of the documentation?

It's adequate, but maybe detailed Doxygen information on each item should be added.

...

- What is your evaluation of the potential usefulness of the library?

It can be useful for (temporary) IDs. We have to make sure that the type, especially its object representation, doesn't get heavy (i.e. stay quasi-POD).

...

- Did you try to use the library? With what compiler? Did you have any problems?

I used it on Mac OS X 10.4 (PowerPC) with XCode. All the tests, and Paul Bristow's example, seemed to work.

...

- How much effort did you put into your evaluation? A glance? A quick reading? In-depth study?

Two days of reading and trying out the code. Not too much of the theory, though.

...

- Are you knowledgeable about the problem domain?

Not really. -- Daryle Walker Mac, Internet, and Video Game Junkie darylew AT hotmail DOT com

Andy Tompkins

5:17 p.m.

New subject: [Boost-announce] [Review] UUID library (mini-)review starts today, November 23rd

On Tue, 9 Dec 2008 03:34:01 -0500, "Daryle Walker" <darylew@hotmail.com> said:

...

[Sorry for the lateness. Haven't looked at any other reviews yet.]

On Nov 23, 2008, at 7:13 PM, Hartmut Kaiser wrote:

...
The mini-review of Andy Tompkins UUID library starts today, November 23rd 2008, and will end on November 30th. I really hope to see your vote and your participation in the discussions on the Boost mailing lists!

The library can be downloaded from the vault here (it's the file uuid_v13.zip in the topmost directory): http://tinyurl.com/6xx28b

The initial review of the UUID library ended with a provisional acceptance (read here: http://article.gmane.org/gmane.comp.lib.boost.user/ 27774/).

This mini review is meant to allow for a final decision after looking at the changed parts of the library. Here is a list of things fixed and changed:

- fixed the licensing issues revealed by the initial review - fixed the security vulnerability revealed by the initial review - renamed to uuid, moved all classes, functions, etc. to namespace boost::uuids - implemented sha1 hash function (thus no license problem)

Others, including myself, are working on encoding libraries. If this library, and an encoding library are added, we could change the implementation to use the new encoding library.

Yes, I would gladly change the uuid library to use the new encoding library.

...

...
- new class basic_uuid_generator to create random-based uuids. The random number generator is no longer hard coded. It can use any random number generator, default is boost::mt19937 - implemented a good seed function for random number generators - all functions are now reentrant, all classes are as thread safe as an int and the library is no longer dependent on Boost.Thread

---------------------------------------------------

Please always state in your review, whether you think the library should be accepted as a Boost library!

Yes, please accept it.

...
Additionally please consider giving feedback on the following general topics:

- What is your evaluation of the design?

Why is there a container interface for inspecting the value's octets? For a similar problem in my code (the MD5 stuff in the Sandbox SVN), I defined a copying-out member function template that takes an output iterator and returns the updated version of same. You could do that, making sure to define some (compile-time) item indicating the length (16), to facilitate easier changes to internal storage.

This was a requested feature.

...

Don't some operating systems provide UUID generation? Could there be a way to add a creation function that uses the OS's code?

Yes, I will write generators that use OS code.

...

Is default construction useful besides when initialization can't be done in one step? Instead of an "is_null" member function, maybe (pseudo- )Boolean conversion (and "operator not") can be used.

The default constructor is unintuitive. I will likely remove it. I will add a 'boolean' conversion and possibility operator!.

...

I don't think the multitude of string conversion techniques are needed. Keep the constructor with "char const *" for pseudo- literals. Everything else, in either direction, could use "lexical_cast".

I agree, lexical_cast is good. I will remove the to_..._string functions. The constructor that takes a string will likely move to a generator.

...

...
- What is your evaluation of the implementation?

The "generator_iterator" substitute in "seed_rng.hpp" doesn't completely help. It is supposed to improve on the version in "boost/ generator_iterator.hpp" in the area of end-iterator support. I originally said more here, but I erased it because when I filed a ticket on this issue, I found one already (#2428) and put my notes there.

Thanks. Ticket #2428 was added in response to the uuid library by someone else.

...

The implementation of the constructor that takes a byte-returning input- iterator shows why we need a double-bounded copy algorithm. (Now in new ticket #2578.)

The copy constructor, destructor, and copy-assignment operator don't do anything different than the automatically-defined versions would. So they could be omitted.

Fair.

...

Maybe the "create_name_based" and "get_showbraces_index" member functions need to be in a new mandatory source file. If so, you could move some common error strings there too. BTW, why do "create" (and "create_name_based") need the length based in as a separate argument? Just use "std::strlen".

The create_name_based function will be moved to a generator. I added the length so that one could use it with any given object. But this is not what people want, so yes, it can just use std::strlen (as well as have overloads for std::basic_string<>).

...

The I/O function templates could be tightened up some. (I noticed it since I work on I/O quite a bit. It could be done later.) The optimization would be more important if my dump-most-of-the-string- conversions-for-lexical_cast idea is used.

Yes, the i/o functions need improving. I will likely need some help when the time comes.

...

Should the serialization be done in a separate header? BTW, how is a custom primitive type handled? Is it just a byte-level save/load of memory? (What if the type isn't POD, like this one?)

Yes, it is a byte-level save/load. My reason was so that exactly 128 bytes are saved/loaded.

...

...
- What is your evaluation of the documentation?

It's adequate, but maybe detailed Doxygen information on each item should be added.

Agreed.

...

...
- What is your evaluation of the potential usefulness of the library?

It can be useful for (temporary) IDs. We have to make sure that the type, especially its object representation, doesn't get heavy (i.e. stay quasi-POD).

Agreed.

...

...
- Did you try to use the library? With what compiler? Did you have any problems?

I used it on Mac OS X 10.4 (PowerPC) with XCode. All the tests, and Paul Bristow's example, seemed to work.

...
- How much effort did you put into your evaluation? A glance? A quick reading? In-depth study?

Two days of reading and trying out the code. Not too much of the theory, though.

...
- Are you knowledgeable about the problem domain?

Not really.

-- Daryle Walker Mac, Internet, and Video Game Junkie darylew AT hotmail DOT com

Thanks, Andy Tompkins

Daryle Walker

11:15 p.m.

New subject: [Boost-announce] [Review] UUID library (mini-)review starts today, November 23rd

On Dec 9, 2008, at 12:17 PM, Andy Tompkins wrote:

...

On Tue, 9 Dec 2008 03:34:01 -0500, "Daryle Walker" <darylew@hotmail.com> said:

...
[SNIP]

...
Is default construction useful besides when initialization can't be done in one step? Instead of an "is_null" member function, maybe (pseudo- )Boolean conversion (and "operator not") can be used.

The default constructor is unintuitive. I will likely remove it. I will add a 'boolean' conversion and possibility operator!.

...
I don't think the multitude of string conversion techniques are needed. Keep the constructor with "char const *" for pseudo- literals. Everything else, in either direction, could use "lexical_cast".

I agree, lexical_cast is good. I will remove the to_..._string functions. The constructor that takes a string will likely move to a generator. [SNIP]

So, there won't be any constructors besides the constructor template that takes an input iterator for sixteen octet reads (and an implicit copy constructor)? You could make UUID a POD type if you could move that constructor too. (It could probably be moved to a creation function, done as a static member function for UUID. The Nil-UUID can be implemented this way too.)

...

...
Should the serialization be done in a separate header? BTW, how is a custom primitive type handled? Is it just a byte-level save/load of memory? (What if the type isn't POD, like this one?)

Yes, it is a byte-level save/load. My reason was so that exactly 128 bytes are saved/loaded.

[TRUNCATE] You mean 128 bits, or 16 octets. An alternative I used, for a MD5 bit-queue, was to group the bits into sextets (6-packs), map that to a base-64 string, and serialize that string. For 128 bits, you get 21 sextets and 2 remaining bits, for a string of 22 characters. -- Daryle Walker Mac, Internet, and Video Game Junkie darylew AT hotmail DOT com

Andy Tompkins

10 Dec 10 Dec

6:29 p.m.

New subject: [Boost-announce] [Review] UUID library (mini-)review starts today, November 23rd

On Tue, 9 Dec 2008 18:15:44 -0500, "Daryle Walker" <darylew@hotmail.com> said:

...

On Dec 9, 2008, at 12:17 PM, Andy Tompkins wrote:

...
On Tue, 9 Dec 2008 03:34:01 -0500, "Daryle Walker" <darylew@hotmail.com> said:

...
[SNIP]

...
Is default construction useful besides when initialization can't be done in one step? Instead of an "is_null" member function, maybe (pseudo- )Boolean conversion (and "operator not") can be used.

The default constructor is unintuitive. I will likely remove it. I will add a 'boolean' conversion and possibility operator!.

...
I don't think the multitude of string conversion techniques are needed. Keep the constructor with "char const *" for pseudo- literals. Everything else, in either direction, could use "lexical_cast".

I agree, lexical_cast is good. I will remove the to_..._string functions. The constructor that takes a string will likely move to a generator. [SNIP]

So, there won't be any constructors besides the constructor template that takes an input iterator for sixteen octet reads (and an implicit copy constructor)? You could make UUID a POD type if you could move that constructor too. (It could probably be moved to a creation function, done as a static member function for UUID. The Nil-UUID can be implemented this way too.)

Yes, all the user defined constructors could be moved into generators / static functions / free functions. The uuid class could be a POD. What are the benefits of making it a POD? I don't yet understand the advantages.

...

...
...
Should the serialization be done in a separate header? BTW, how is a custom primitive type handled? Is it just a byte-level save/load of memory? (What if the type isn't POD, like this one?)

Yes, it is a byte-level save/load. My reason was so that exactly 128 bytes are saved/loaded. [TRUNCATE]

You mean 128 bits, or 16 octets. An alternative I used, for a MD5 bit- queue, was to group the bits into sextets (6-packs), map that to a base- 64 string, and serialize that string. For 128 bits, you get 21 sextets and 2 remaining bits, for a string of 22 characters.

Yes, thanks, I mean 128 bits or 16 octets. One will be able to use lexical_cast to convert a uuid to/from a string. I can also see the usefulness to be able to convert to/from a base-64 string, a single integer value, or other types of binary to text encoding (like ascii85), but I'm not sure these are in the scope of uuid.

...

-- Daryle Walker Mac, Internet, and Video Game Junkie darylew AT hotmail DOT com

Regards, Andy Tompkins

6100

Age (days ago)

6116

Last active (days ago)

List overview

Download

39 comments

14 participants

participants (14)

Agustín K-ballo Bergé
Andy Tompkins
Christian Henning
Christian Holmquist
Daniel Krügler
Daryle Walker
Dave Jenkins
Hartmut Kaiser
muhammadchang
Rodrigo Madera
Roman Perepelitsa
Scott McMurray
Steven Watanabe
Tim Blechmann