
Craig Henderson wrote:
I have considerably revised and updated my MapReduce library and committed the changes to the Sandbox. I've also uploaded to the Vault for easy access
I am very interested to hear any comment on design, code, performance or any other area.
Hi Craig, Quoting from the start of your docs: "The Boost.MapReduce library is a MapReduce implementation across a plurality of CPU cores rather than machines." Isn't that rather missing the point of what MapReduce is supposed to be about? If I'm limited to one machine, I can write parallel code using the full repertoire of techniques. By re-designing my application to fit into the MapReduce pattern I can potentially scale it over multiple machines. But if I can't scale over multiple machines, why bother? Are you planning to support scaling over multiple machines in the future? Regards, Phil.