How to create hyperlink to a caption for foérm Word exported HTML

J

Jan Kratochvil

Now I am staying in front of this problem:

I have a task to create from a large Word document a HTML document with all
the links which are now working in normal DOC file, e. g. cross-referneces
and links to the image captions (through cross-references).

AFAIK it is not posible, if I save the DOC file as HTML or HTM file from
Word, to get functional cross-reference. Cross-references are lost in taht
file.

1/ So I must go now trough the whole document and compare the functional
cross-reference links and replace them in the HTML file with HYPERLINKs.

2/ And what about captions - they was in DOC as well functional through
cross-references?

I do not know how to create in a Word created HTML file the HYPERLINKs for
capptions.

One possibility, I can imagen would be to create the hyperlinks in external
HTML editor like Dremaweaver, but it wil be troublesome and it takes much
time.
 
B

Bob Buckland ?:-\)

Hi Jan,

As Stefan mentioned in the reply to your previous message, Word fields that use a built in hyperlink switch ( \h) don't necessarily
retain the link behavior when converting to HTML. This hasn't really changed much since Word started supporting conversion to HTML
in Word v6, other than the Table of Contents field supporting the linking.

You can select a cross ref in the document and apply a Word hyperlink field to to it if you have a bookmark or heading as a target,
or but part of what you're working on may depend on how much the web page is supposed to look like the Word document, it may hold a
few surprises for you in that area as well.

It may be possible to use a macro (Stefan or one of the other folks may be able to help with tackling that) to convert the xrefs to
hyperlinks.

Another approach that may work for you is to use the Office 2007 save as PDF feature and then a 3rd party PDF-to-HTML converter, if
your project doesn't let you use PDF. The PDF add-in will keep the xrefs.

You may need to try a few different ones to see if you can find one that supports all of the formatting you need. Many of the ones
I tried some time ago didn't do particularly well . A number of the simpler ones, such as the one built in to G-mail, or Adobe's
email (e-mail address removed) converter will strip out graphics, as these tools were in part designed for the use with screen reading
software.

There are also Doc to HTML converters you may want to try as well to see if you can find one that would eliminate the PDF 'middle'
step.

Another possibility would be to try working with Save to XML in Word and apply an XML2HTML transform that would need to be written
to be able to handle {NoteRef} fields from Word.

You mentioned DreamWeaver. If I recall it does offer a .DOC to HMTL conversion capability.

===================

Now I am staying in front of this problem:

I have a task to create from a large Word document a HTML document with all
the links which are now working in normal DOC file, e. g. cross-referneces
and links to the image captions (through cross-references).

AFAIK it is not posible, if I save the DOC file as HTML or HTM file from
Word, to get functional cross-reference. Cross-references are lost in taht
file.

1/ So I must go now trough the whole document and compare the functional
cross-reference links and replace them in the HTML file with HYPERLINKs.

2/ And what about captions - they was in DOC as well functional through
cross-references?

I do not know how to create in a Word created HTML file the HYPERLINKs for
capptions.

One possibility, I can imagen would be to create the hyperlinks in external
HTML editor like Dremaweaver, but it wil be troublesome and it takes much
time.


--
Regards
Jan Kratochvil
WIN XP Pro SP2, Office 2007 SP 1 >> >>
--

Bob Buckland ?:)
MS Office System Products MVP

*Courtesy is not expensive and can pay big dividends*
 
J

Jan Kratochvil

Hi Bob

Thank you for your large and benefitial answer for mz question.
I use the DOC as PDF file and now we need to have the DOC in HTML file as
well.

Problem is, that I did not know, the Word HTML file does not support
conversion from cross-reference to hyperlink.

What I did now, I manually changed the cross-references to hyperlink.
The problem is now with the image caption already. But I have not time and
power to change hundreds of not active references to the active hyperlinks
manually.

I tried befor the conversion from PDF to HTML too, but without success.
Through the Adobe.com conversion was the conversion failed and with my Adobe
Acrobat 6 Pro was the conversion unusable.

Do you think is it for the future better to use hyperlinks instead of
cross-references? I mean for links to headings, bookmarks and so on.

Thank you again

P.S. The Dreamweaver MX can clean up the Word HTML, but not open a DOC file.
But the clean up function is good too, thanks for that.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top