What is all that stuff?

G

Guest

Hello. I am a newbie.
I've begun practicing by tagging some existing documents that I have
attached a schema to. If I save the document as a Word 2007 XML file, and
then open that file in Notepad, I see at least 100 lines of Word XML "stuff"
(Types, Targets, Packages, etc.). What is all this? How can I make it go
away? I'd like to be able to open a Word-produced XML file in Notepad, and
just see the XML file. What am I doing wrong?
Thanks much.
 
P

Peter Jamieson

What to do depends on exactly what you are hoping to end up with.

If you want to end up with a formatted Word document with your markup, all
in XML format, you must save the document either as a Word 2007 XML format
document (e.g. a .docx) or an older Word 2003 XML format document. Both
these formats store all the information needed to reconstitute the entire
Word document with formatting, styles and anything else you put in. The Word
2003 .xml format consists of a single file attached to the Word 2003
WordProcessingML schemas. The Word 2007 .docx format is a ZIP file that
contains a number of .xml format files which may be related in varius ways
(hence all that relationship stuff you probably see at the top of your XML
file). If you are only using Word 2007 you would typically be better off
using the Word 2007 formats.

If you just want to apply XML tags to your text and save as XML data only
(no Word formatting information) then you need to save in Word 2003 XML
Document format, and check the "Save data only" box in the Save As dialog.
There is no "Word 2007" equivalent of this format, as far as I know.

Peter Jamieson
 
G

Guest

I saved it as a Word 2003 XML file and got exactly what I wanted. You are a
good man.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top