regex problem

japi · Jul 29, 2006

Hi,

as a regex starter I am having a little trouble here.

suppose i want to parse the folling html fragment:

<li>
a
</li>
<li>
b
</li>

I would like to have a regular expression that matches each of the li
starttag and corresponding endtag, including it's inner text.

When i use the following regex (in SingleLine mode) it matches the
first <li> with the last </li> tag, which results in returning only one
match instead of 2.

so

<li>(?<itemcontents>.*)</li>

seems to match the following as a whole (which is not my intention):

<li>
a
</li>
<li>
b
</li>

I hope my problem is clear, and somebody here can help me

Thanks
Jaap

Markus Stoeger · Jul 29, 2006

japi said:
When i use the following regex (in SingleLine mode) it matches the
first <li> with the last </li> tag, which results in returning only one
match instead of 2.

so

<li>(?<itemcontents>.*)</li>

use .*? instead of .*

the ? makes it lazy (without it it is greedy). the difference is that
when it is lazy it only matches to the _next_ match. while when it is
greedy it matches up to the _last_ match.

hth,
Max

japi · Jul 29, 2006

it works like a charm!

Thank you very much Markus!

C# and javascript? Or C# and Javascript + IE?! I really dont know...	1	Oct 5, 2007
HOW TO: Customizing the CSS Friendly Adapters	2	Aug 5, 2009
problem I am trying to solve	4	May 4, 2009
need help generating a dynamic list control <ul />...	2	Jan 25, 2007
Insert character using Regex	4	Apr 29, 2010
My line spacing in nested lists is wrong, why?	2	Oct 25, 2006
php mysql code not working on server	2	Jun 5, 2015
Macro running through selected fields only	1	Nov 4, 2008

regex problem

japi

Markus Stoeger

japi

Ask a Question

Similar Threads