Extract HTML + Reg Ex

Ori · Feb 11, 2004

Hi,

I have a HTML text which I need to parse in order to extract data from
it.

My html contain a table contains few rows and two columns. I want to
extract the data from the 2nd column in the most efficient way (using
Reg Ex.) either than using the "indexOf" function of String.

Thanks,

Ori.

Here is the HTML table:

<table BORDER="1" CELLSPACING="0" CELLPADDING="1">
<tr>
<td>Licensee Name</td>
<td BGCOLOR="#ffffcc">JOHN Doo</td>
</tr>
<tr>
<td><a HREF=>Primary Status</a></td>
<td BGCOLOR="#ffffcc">Data_To_Be_Extracted</td>
</tr>
<tr>
<td>License Number</td>
<td BGCOLOR="#ffffcc">Data_To_Be_Extracted</td>
</tr>
<tr>
<td><a >License Type</a></td>
<td BGCOLOR="#ffffcc">Data_To_Be_Extracted</td>
</tr>
<tr>
<td>Header</td>
<td BGCOLOR="#ffffcc">Data_To_Be_Extracted</td>
</tr>
<tr>
<td>Address</td>
<td BGCOLOR="#ffffcc">Data_To_Be_Extracted</td>
</tr>
<tr>
<td>City State State Zip </td>
<td BGCOLOR="#ffffcc">Data_To_Be_Extracted</td>
</tr>
</table>

Matthias Kwiedor · Feb 11, 2004

Hi!

Try this:

// First split the HTML into Table Lines
string[] arrLines = Regex.Split(strContent, @"<tr.*?>",
RegexOptions.IgnoreCase);

// Go through each line
forearch (string strLine in arrLines)
{
// Split into Rows Array
string[] strCol = Regex.Split(strLine, @"<td.*?>",
RegexOptions.IgnoreCase);
// Remove HTML Tags?
strCol[1] = Regex.Replace(strCol[1], @"<[^>]*>", "");
// second Column
MessageBox.Show(strCol[1]);
}

Hope thats what you want!

Greetings

Matthias

(e-mail address removed) (Ori) wrote in @posting.google.com:

Button not appearing in DataGrid column	1	Oct 27, 2011
How to set first column's property of header row?	4	Nov 29, 2005
Parsing some info out of HTML - can you help me please?	3	Nov 16, 2009
Repeater Problem	1	Feb 10, 2006
file download question in C# and asp.net	2	Dec 2, 2008
Regular Expression Help	15	Sep 17, 2007
External data from HTML document	2	Dec 12, 2009
Using regex in html code	6	May 23, 2007

Extract HTML + Reg Ex

Ori

Matthias Kwiedor

Ask a Question

Similar Threads