
Hello, The html page is located at " http://photography.nationalgeographic.com/photography/photo-of-the-day/north... " btw, when googling around for xpressive the search results pointed to http://lists.boost.org/boost-users/2008/08/39761.php, which says xpressive is not directly usable with utf-8. I did not find any examples of xpressive with utf-8 strings. does boost::regex with icu have answer to my question ? (i'm going over it now) Surya On Thu, Oct 1, 2009 at 11:02 AM, OvermindDL1 <overminddl1@gmail.com> wrote:
On Wed, Sep 30, 2009 at 10:54 PM, Surya Kiran Gullapalli <suryakiran.gullapalli@gmail.com> wrote:
Hello all,
I'm reading an html file and parsing data from it. I've encountered a spurious string (researcher’s). Notepad++ shows that string as researcher's .
I'm reading the file using std::fstream class. Is there any way i can read the string and get the spurious characters replaced using boost string/regex algorithms ?
It is not spurious, it is probably a UTF-8 file or something. Can you attach it, if so then we can confirm that.
But yes, you can do such a replacement with Boost.Regex, although I would recommend doing it with Boost.Xpressive instead (it even has an example doing such a replace in the docs, and if you use the static version it will run faster). _______________________________________________ Boost-users mailing list Boost-users@lists.boost.org http://lists.boost.org/mailman/listinfo.cgi/boost-users