regular expression handling the chinese string

G

guozhen

hey.I got a problem
I want to parse the string below with regular expression
¡°1¡¢¾«Í¨»ùÓÚJ2EE¼Ü¹¹µÄJavaºËÐÄÓï·¨¡¢Java±à³Ì£¬3ÄêÒÔÉÏÈí¼þ¿ª·¢»òÅàѵÏà¹Ø
¹¤×÷¾­ÑéÕߣ»<?xml:namespace prefix = o ns =
"urn:schemas-microsoft-com:blush:ffice:blush:ffice" /><o:p></o:p></SPAN></P>
<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt; WORD-BREAK: break-all;
LINE-HEIGHT: 150%; TEXT-ALIGN: left; mso-pagination: widow-orphan"
align=left><SPAN lang=EN-US style="FONT-SIZE: 9pt; COLOR: black;
FONT-FAMILY: ËÎÌå; mso-font-kerning: 0pt; mso-bidi-font-family: ËÎÌå">
2¡¢¾ßÓÐ2ÄêÒÔÉÏÏîÄ¿¹ÜÀí¾­Ñ飬¾ßÓнÏÇ¿µÄÓïÑÔ¹µÍ¨ÄÜÁ¦¡±
the result I want is ¡°¾«Í¨»ùÓÚJ2EE¼Ü¹¹µÄJavaºËÐÄÓï·¨¡¢Java±à³Ì£¬3ÄêÒÔÉÏ
Èí¼þ¿ª·¢»òÅàѵÏà¹Ø¹¤×÷¾­ÑéÕß¡±and ¡°¾ßÓÐ2ÄêÒÔÉÏÏîÄ¿¹ÜÀí¾­Ñ飬¾ßÓнÏÇ¿µÄÓï
ÑÔ¹µÍ¨ÄÜÁ¦¡± two strings
so I have to do the filtering and cutting into the string ,I want to
filter the string without the html tags and the other chars.
what should I do?
 
M

Mihai N.

I want to parse the string below with regular expression
....
so I have to do the filtering and cutting into the string ,I want to
filter the string without the html tags and the other chars.
what should I do?
If you want to do complex processing of generic html files
(not one single format, always the same), then regex is
not strong enough, Chinese or not.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top