----- Original Message -----
From: "PaulH" <>
Newsgroups: microsoft.public.word.docmanagement
Sent: Saturday, December 13, 2003 12:11 PM
Subject: Re: word 2002 - Office XP and the HTML Filter/Save as HTML
Hi Shauna
I save many documents "as html" in Word 97, then edit/apply templates etc in
Dreamweaver before uploading them as small as possible in file size to the
intranet >server in our organisation.
With anybody who has looked at html content this just doesn't make sense?
Why not just save the page using either Dreamweaver or Internet Explorer?
No Word bloat would be added in either of those processes.
We are moving onto Windows XP and Office XP (and Dreamweaver MX) but my
concern is that we did a test: We opened a document in Word XP and did
"Save >as HTML" and the htm file is a lot larger than Word 97 used to create
with a lot of >extra tags in the htm code.
did you turn OFF the image embedding which is accomplished with VML? (VML
was not available with Word 97.)
Word is going to add bloat and unnecessary markup, no matter how many
features you turn off. Word was never intended to create cross-compliant
html, rather to create webpages with out knowledge of html (or conseqences)
while assuring return of the webpage to Word's original document.
You mean besides taking Word off the computer you use to create webpages ;-)
TIC
STOP creating web pages with Word is the simpliest solution.
Is there a way we can configure Word XP (and for that matter >Excel XP I
guess) to >make a smaller more basic output?
There are two general exceptions which create bloat and larger webpages.
1) NEVER cut and/or copy and paste from Word to any HTML document.
2) Turn OFF embedding of images.
The two steps above will not eliminate the bloated html caused by Word,
however they will drastcially reduce the bloat.
I thought we could do a serach and replace to remove tags using a macro,
but our >technical team says no because Word XP does not allow to see the
htm code in >Word (apparently it appears in a script editor instead?!?)
Your technician hasn't exactly told a whole truth :-(
Rather, Word does not create pages in any method which cannot be viewed with
NotePad or any other text editor. (unless you add ASP or PHP or perhaps SQL,
Flash or any other extravagant media.)
Word however does add so much bloat and confusion to compliant html that it
appears unrecognizable to many who create webpages. In the process, it would
require MUCH more time to repair the bloated html than it would to create
new pages from scratch. (Most webmasters will not even attempt to correct
pages which they have been condemming as created by Front Page [actually
Word ] )
Is your technician inclined to design webpages for you? Is that part of his
position responibilities?
Would appreciate any advice or direction to look for help.
Save your pages with either Internet Explorer (NOT with the MHT option, Web
Archive,) or DreamWeaver.
Then afterwards editing the saved webpage with either DreamWeaver or NotePad
or any standard text editor.
Hello again Paul,
I neglected to mention in the previous mail
that I am NOT "Shauna."
Although I despise tearing apart documents? You have a variety of questions
which require that method. (see above)