boost infrastructure questions

newer
[test] self-registration facilities

Stefan Seefeld

5 Dec 2005 5 Dec '05

3:23 p.m.

For quite a while people have been discussing various enhancements to the boost infrastructure. What are these projects up to ? Notably: * Are there still plans to move to subversion ? If so, what is missing ? * What is the status of the move to boost.build version 2 ? * Are there still plans to 'officially' use buildbot to coordinate builds, test runs, and checkins ? I find it quite hard to interpret the regression test matrix since it is't obvious which test runs are in really in sync with the latest revision (or any revision, for that matter), and because there doesn't appear to be an obvious way to distinguish real test failures from failed prerequisites (incl. configuration errors, unsufficient resources, etc.) Thanks, Stefan

Show replies by date

Doug Gregor

5 Dec 5 Dec

4:01 p.m.

On Dec 5, 2005, at 10:23 AM, Stefan Seefeld wrote:

...

For quite a while people have been discussing various enhancements to the boost infrastructure. What are these projects up to ? Notably:

* Are there still plans to move to subversion ? If so, what is missing ?

Yes. Nothing is "missing" per se, but the people driving the move to Subversion have been busy with other things. Once the dust has settled from 1.33.1, we'll have a perfect opportunity to make the Subversion switch. Doug

David Abrahams

4:20 p.m.

Stefan Seefeld <seefeld@sympatico.ca> writes:

...

For quite a while people have been discussing various enhancements to the boost infrastructure. What are these projects up to ? Notably:

* Are there still plans to move to subversion ? If so, what is missing ? * What is the status of the move to boost.build version 2 ?

Those two projects are on hold until 1.33.1 is out, in some sense. I hope we'll start with the BBv2 transition on the main trunk immediately after the 1.33.1 release.

...

* Are there still plans to 'officially' use buildbot to coordinate builds, test runs, and checkins ?

I don't know if that ever got to the point of being an "official" plan. Rene was looking into it.

...

I find it quite hard to interpret the regression test matrix since it is't obvious which test runs are in really in sync with the latest revision (or any revision, for that matter), and because there doesn't appear to be an obvious way to distinguish real test failures from failed prerequisites (incl. configuration errors, unsufficient resources, etc.)

I don't think BuildBot fixes that anyway. Would you care to suggest some specific test matrix improvements (on the boost-testing list, preferably)? -- Dave Abrahams Boost Consulting www.boost-consulting.com

Rene Rivera

5:08 p.m.

David Abrahams wrote:

...

Stefan Seefeld <seefeld@sympatico.ca> writes:

...
For quite a while people have been discussing various enhancements to the boost infrastructure. What are these projects up to ? Notably:

* Are there still plans to move to subversion ? If so, what is missing ? * What is the status of the move to boost.build version 2 ?

Those two projects are on hold until 1.33.1 is out, in some sense. I hope we'll start with the BBv2 transition on the main trunk immediately after the 1.33.1 release.

Although some work has been getting done on that. In the form of performance improvements, by me and Volodya, and of course the autoconfiguration, PCH support, etc. I think we need help in creating and testing the various toolsets.

...

...
* Are there still plans to 'officially' use buildbot to coordinate builds, test runs, and checkins ?

I don't know if that ever got to the point of being an "official" plan. Rene was looking into it.

They have been mostly on hold because I've been spending most of my free time on bjam/BB (code and docs), and on website restructuring. -- -- Grafik - Don't Assume Anything -- Redshift Software, Inc. - http://redshift-software.com -- rrivera/acm.org - grafik/redshift-software.com -- 102708583/icq - grafikrobot/aim - Grafik/jabber.org

Stefan Seefeld

5:20 p.m.

David Abrahams wrote:

...

...
I find it quite hard to interpret the regression test matrix since it is't obvious which test runs are in really in sync with the latest revision (or any revision, for that matter), and because there doesn't appear to be an obvious way to distinguish real test failures from failed prerequisites (incl. configuration errors, unsufficient resources, etc.)

I don't think BuildBot fixes that anyway. Would you care to suggest some specific test matrix improvements (on the boost-testing list, preferably)?

Ok, I'll cross-post (the boost-testing page suggests that questions relating to the boost.test framework should go to the main list). Test runs on the test matrix displayed at http://engineering.meta-comm.com are annotated by date, not revisions. Switching to subversion should fix that, or rather, enable the infrastructure to pull out a single revision number for each checkout that is used in the test run. Another issue is about result interpretation and result interdependencies (I was reminded of this issue by the recent mingw failure that issued a 'fail' for each single test, instead of simply flagging the whole test run as 'untested' as some precondition (environment, configuration, whatever) wasn't met. In a similar vein, people have reported in the past that failures of tests were caused by the disk being full or similar. That, too may be caught upfront, or otherwise be flagged not as a test failure but some external precondition failure. What would it take in the current testing framework to enhance that ? Regards, Stefan

Gennadiy Rozental

6:18 p.m.

...

Ok, I'll cross-post (the boost-testing page suggests that questions relating to the boost.test framework should go to the main list).

...

What would it take in the current testing framework to enhance that ?

I don't believe what you describe belong to Boost.Test realm. I think it's Boost.Build/Boost testing scripts one. Gennadiy

Aleksey Gurtovoy

16 Dec 16 Dec

10:54 a.m.

Stefan Seefeld <seefeld@sympatico.ca> writes:

...

Ok, I'll cross-post (the boost-testing page suggests that questions relating to the boost.test framework should go to the main list).

Hmm, a bit of explanation: Boost.Test is a _C++ library_ that "provides a matched set of components for writing test programs, organizing tests in to simple test cases and test suites, and controlling their runtime execution". While I can see how the last part could create an impression that Boost.Test covers the infrastructure behind the regression results on the website, it doesn't. "Boost testing infrastructure" would be a more accurate reference to the whole thing, and that's exactly what's this list is dedicated to.

...

Test runs on the test matrix displayed at http://engineering.meta-comm.com are annotated by date, not revisions. Switching to subversion should fix that, or rather, enable the infrastructure to pull out a single revision number for each checkout that is used in the test run.

FWIW, you can still get any particular file's revision by the date. Why is it an issue, anyway?

...

Another issue is about result interpretation and result interdependencies (I was reminded of this issue by the recent mingw failure that issued a 'fail' for each single test, instead of simply flagging the whole test run as 'untested' as some precondition (environment, configuration, whatever) wasn't met.

Personally, I don't feel that this happens often enough to warrant a special effort, but if there is a consensus that it's a real issue, sure, we could setup a mechanism for manually marking up the whole set of results like you suggest above. Meanwhile, there is always an option of taking the results out of the FTP.

...

In a similar vein, people have reported in the past that failures of tests were caused by the disk being full or similar. That, too may be caught upfront,

Not really...

...

or otherwise be flagged not as a test failure but some external precondition failure.

... but sure.

...

What would it take in the current testing framework to enhance that ?

In case with "disk full" situation, I guess it's easy enough to detect and flag it automatically. Boost.Build could even terminate the build/test process as soon as it discovered the first failure due to a lack of space. In case of other environment/configuration problems, well, it depends on how exactly do you envision this and what we eventually agree upon. -- Aleksey Gurtovoy MetaCommunications Engineering

Stefan Seefeld

1:44 p.m.

Aleksey Gurtovoy wrote:

...

Stefan Seefeld <seefeld@sympatico.ca> writes:

...
Ok, I'll cross-post (the boost-testing page suggests that questions relating to the boost.test framework should go to the main list).

Hmm, a bit of explanation: Boost.Test is a _C++ library_ that "provides a matched set of components for writing test programs, organizing tests in to simple test cases and test suites, and controlling their runtime execution". While I can see how the last part could create an impression that Boost.Test covers the infrastructure behind the regression results on the website, it doesn't. "Boost testing infrastructure" would be a more accurate reference to the whole thing, and that's exactly what's this list is dedicated to.

I see. In my mind testing large C++ projects has at least two aspects: An API that the to-be-tested C++ code uses to formalize und unify fine-grained tests and associated (log) output. On the other hand, there is the infrastructure to actually run the tests, manage results, etc. Both parts should (IMO) not be mixed, for a variety of reasons (robustness, for example).

...

...
Test runs on the test matrix displayed at http://engineering.meta-comm.com are annotated by date, not revisions. Switching to subversion should fix that, or rather, enable the infrastructure to pull out a single revision number for each checkout that is used in the test run.

FWIW, you can still get any particular file's revision by the date. Why is it an issue, anyway?

Because from looking at the online test matrix I can't identify what file revision(s) was/were used for a given test run, and thus, whether the failure I see is still there after my checkin, or whether it just hasn't been updated since.

...

...
Another issue is about result interpretation and result interdependencies (I was reminded of this issue by the recent mingw failure that issued a 'fail' for each single test, instead of simply flagging the whole test run as 'untested' as some precondition (environment, configuration, whatever) wasn't met.

Personally, I don't feel that this happens often enough to warrant a special effort, but if there is a consensus that it's a real issue, sure, we could setup a mechanism for manually marking up the whole set of results like you suggest above. Meanwhile, there is always an option of taking the results out of the FTP.

I'm not suggesting any manual intervention, but a mechanism to catch such a situation before the actual test run is executed, i.e. as a precondition.

...

...
In a similar vein, people have reported in the past that failures of tests were caused by the disk being full or similar. That, too may be caught upfront,

Not really...

Ok, not in the general sense. May be you do want to test how a particular code handles 'no space left on device' errors...

...

...
or otherwise be flagged not as a test failure but some external precondition failure.

... but sure.

...
What would it take in the current testing framework to enhance that ?

In case with "disk full" situation, I guess it's easy enough to detect and flag it automatically. Boost.Build could even terminate the build/test process as soon as it discovered the first failure due to a lack of space.

In case of other environment/configuration problems, well, it depends on how exactly do you envision this and what we eventually agree upon.

At this point I am only wondering what infrastructure is available to deal with preconditions / requirements. I'm developing and maintaining the QMTest test automation tool (http://www.codesourcery.com/qmtest), and there we provide means specifically to declare tests dependent on some 'resources' that need to set up first, and, if that fails, leads to all dependent tests to be marked as 'untested'. It is thus easy to conceive a variety of resources that prepare the testing environment reliably and efficiently. Regards, Stefan

7188

Age (days ago)

7199

Last active (days ago)

List overview

Download

7 comments

6 participants

participants (6)

Aleksey Gurtovoy
David Abrahams
Doug Gregor
Gennadiy Rozental
Rene Rivera
Stefan Seefeld