From: <su...@mx...> - 2004-06-21 08:00:03
|
> But can MS Office XML files be re-imported losslessly back into MS=20 > Office? > That's the question. the text i quoted included the following sentence: >> WordprocessingML is a lossless format, which means that it contains=20= >> all the information that Word needs to re-open a document, just as if=20= >> it had been saved in the traditional .doc format=97all text,=20 >> formatting, styles, Which pretty looks like an answer to your question. But (there is a but) it looks like the xml dialect created by M$ is=20 actually not very well written: it does not allow tags to contain tags=20= _and_ text: <b>something bold<i>something bold and in italics</i> something bold=20 and not in italics</b> would be translated in WPML to something like: <b>something bold</b> <b i>something bold and in italics</b i> <b> something bold and not in italics</b> which means (and i think that is why a word html file was so long to be=20= loaded in OmT) that tag information is considerably more present than=20 in standard (x|ht)ml and thus leads to more cpu usage. Also, working on an xml parser for M$Office would work only with=20 Windows versions since the Mac Office versions (even the recently=20 released Office 2004 for Mac) terribly lag behind in terms of xml=20 support. It is only possible to do exports to html. And I don't have=20 much info on the losslessness (?) of M$Office html export/import=20 function. Besides, and that is a similar but different subject, I checked the=20 pocket RTF reference sample chapter at oreilly: "The RTF Pocket Guide"=20= "Part 1: RTF tutorial" and it is extremely well explained. For people=20 who are not familiar with how RTF works that's a really good start. JC= |