Hello,
The html page is located at "
http://photography.nationalgeographic.com/photography/photo-of-the-day/north...
"
btw, when googling around for xpressive the search results pointed to
http://lists.boost.org/boost-users/2008/08/39761.php, which says xpressive
is not directly usable with utf-8. I did not find any examples of xpressive
with utf-8 strings.
does boost::regex with icu have answer to my question ? (i'm going over it
now)
Surya
On Thu, Oct 1, 2009 at 11:02 AM, OvermindDL1
On Wed, Sep 30, 2009 at 10:54 PM, Surya Kiran Gullapalli
wrote: Hello all,
I'm reading an html file and parsing data from it. I've encountered a spurious string (researcher’s). Notepad++ shows that string as researcher's .
I'm reading the file using std::fstream class. Is there any way i can read the string and get the spurious characters replaced using boost string/regex algorithms ?
It is not spurious, it is probably a UTF-8 file or something. Can you attach it, if so then we can confirm that.
But yes, you can do such a replacement with Boost.Regex, although I would recommend doing it with Boost.Xpressive instead (it even has an example doing such a replace in the docs, and if you use the static version it will run faster). _______________________________________________ Boost-users mailing list Boost-users@lists.boost.org http://lists.boost.org/mailman/listinfo.cgi/boost-users