Parsing HTML to remove pictures and stylesheets

  • Thread starter Thread starter Seb
  • Start date Start date
S

Seb

Hello,

I am trying to find some object/function able to take an HTML page
(code) as an input, strip out all images, stylesheets and other
external references, and returns "cleaned" HTML only (without external
references) or a text-only version of the page.

Any ideas?

Thanks,
Seb
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Back
Top