[xpressive] multiple matches for a group
data:image/s3,"s3://crabby-images/7b2ed/7b2ed578b3e996d414df9531b2aab8484a0dcb34" alt=""
I'm attempting to learn xpressive by writing a simple IRC regex. The IRC protocol goes like: :prefix numericreplyorcommandstring param1 param2 :trailing param The regex looks like this (ignore comments and newlines): ^ (?::(\S+)\s)? // prefix (?:(\d+)|(\S+)) // command (?:\s([^:]\S*))* // parameters (?:\s:(.+))? // trailing parameter $ And my static xpressive looks like this: bos >> optional(':' >> (s1=+~_s) >> _s) >> // prefix ((s2=+_d)|(s3=+~_s)) >> // command *(_s >> (s4 = ~as_xpr(':') >> *~_s)) >> // parameters optional(_s >> ':' >> (s5=+_)) >> // trailing parameter eos I'm not sure how to retrieve the multiple matches for s4 - any help? -- Cory Nelson http://www.int64.org
data:image/s3,"s3://crabby-images/459b0/459b05c510e36271c5487efcfc0bde5e3554adf1" alt=""
Cory Nelson wrote:
I'm attempting to learn xpressive by writing a simple IRC regex.
The IRC protocol goes like: :prefix numericreplyorcommandstring param1 param2 :trailing param
The regex looks like this (ignore comments and newlines): ^ (?::(\S+)\s)? // prefix (?:(\d+)|(\S+)) // command (?:\s([^:]\S*))* // parameters (?:\s:(.+))? // trailing parameter $
And my static xpressive looks like this: bos >> optional(':' >> (s1=+~_s) >> _s) >> // prefix ((s2=+_d)|(s3=+~_s)) >> // command *(_s >> (s4 = ~as_xpr(':') >> *~_s)) >> // parameters optional(_s >> ':' >> (s5=+_)) >> // trailing parameter eos
I'm not sure how to retrieve the multiple matches for s4 - any help?
In this regard, xpressive behaves like Perl and like Boost.Regex. For captures in repeats, only the last iteration is remembered. If you want to remember all the places s4 matches, you should use a nested regex instead: sregex param = ~as_xpr(':') >> keep(*~_s); sregex protocol = bos >> optional(':' >> (s1=+~_s) >> _s) >> // prefix ((s2=+_d)|(s3=+~_s)) >> // command *(_s >> parameter) >> // parameters optional(_s >> ':' >> (s5=+_)) >> // trailing parameter eos; Now, when you match "protocol", the match_results struct will contain nested matches for the "parameter" regex, which you can access with match_results<>::nested_results(). (You have nested quantifiers in your pattern. I tried to make it more efficient with the keep() directive.) HTH, -- Eric Niebler Boost Consulting www.boost-consulting.com
data:image/s3,"s3://crabby-images/39fcf/39fcfc187412ebdb0bd6271af149c9a83d2cb117" alt=""
In this regard, xpressive behaves like Perl and like Boost.Regex. For captures in repeats, only the last iteration is remembered.
Minor correction (and advertisement!) Boost.Regex has optional support for repeated captures: presenting all the matches found as a list rather than just the last found. John.
participants (3)
-
Cory Nelson
-
Eric Niebler
-
John Maddock