Jack Brewster said:
I think the point, which has been lost in the thread, is that Terry _wants_
the contents of this Word file in HTML format, not in PDF format.
Correct! I don't want to *link* to such files - I want to use their
content within the website, on 'standard' page. With consistent colour
scheme, font, header styles, etc.
My apologies for not catching that one earlier, Terry.
No problem, I *thought* there had to be some misunderstanding!
As for the Office Filter, I can't say. When I used it I had really great
results. You may need to play with the options a bit. Maybe you haven't
set it to remove all the nasty bits yet?
You're right, many thanks! I've just had another crack at it, this
time enabling *all* options, not just the top set, and it worked OK.
For others who end up googling here with a similar problem, the filter
has one set of options at the top enabled by default, then the
following set which are initially disabled:
Use VML for displaying graphics
Remove standard CSS
Remove all STYLE elements
Remove standard @rule constructs
By trial/error I established that the critical option to enable (which
maybe should have been obvious to me) was 'Remove standard CSS'. In
this case, that gave me a 42 KB file from the original 174 KB HTM
file. Enabling all 4 of that lower set gave the same size.
For background (and possibly to assist with ongoing queries) here are
more details:
This temporary (unfinished) page
http://www.cupod-mentoring.com/swprojectdoc_copy.htm
shows the page content I tediously created the way I described
earlier, e.g. pasting into each cell of table from original Word
table, etc. I have yet to continue that process. Need to master it, as
in many ways it's the preferred way I'd like to work, as it lets me
add content to an existing formatted page. Next time will try your and
Stefan's recommendations about the copy/paste procedure. Anyway, that
page gives you an idea of what result I want.
The original Word doc is here:
http://www.cupod-mentoring.com/simonoriginal.doc
I used File|Save As in Word 2000 to convert that to an HTML file,
which I've copied to the site for the time being:
http://www.cupod-mentoring.com/simonoriginal-save1.htm
(I changed its name, as the Office Filter overwrites same name later.)
Opening that shows the flawed result I described, i.e. left part of
text cut off in places.
BTW, I note that opening the HTML file creates a subfolder
'simonoriginal-save1_files', containing 2 files: filelist.xml and
header.htm. Not entirely sure what if any impact this has on FP - for
time being I've cheerfully ignored it!
I now run Office HTML Filter 2.0 and use Add to select
simonoriginal.htm (identical to simonoriginal-save1.htm), then Apply,
and finally Close. BTW, I see there's no indication when it's
finished? However, if you watch the folder, it's signaled by the
appearance of filename.bak. Happily, that filtered result now looks
fairly good. Some work needed on headers, but the left cut-off is
cured.
Of course, that gives me a plain, separate page. So next step is to
work out how to get content on a standard page, like my manually
prepared example.
Thanks for sticking with me on this!