#2 PRINTDATE field not parsed

v1.0_(example)
closed-fixed
nobody
None
5
2005-02-02
2004-12-21
No

Hello there.

I ran the rtf2xml filter on a reasonably small MS WORD
document, and the output looks reasonable (albeit a
little cluttered). Good job.

I also found that the PRINTDATE fieldcode is not
processed by the script. In the code is a note saying
you want bugs reported, so I'm reporting it. ;-)
Output from stderr below.

* As an added note:

1. It would be 'nice' if people can add fieldcodes in
such cases.
2. Could the script OUTPUT something to the XML
stream nominating the "skipped" field code in-line
(perhaps as a comment)? In that way I could fix-up
the XML with a subsequent processing step, or
manually if it was a small issue.

In general I find with these sorts of filter programs
it is better to KEEP all information, even if there is
an error, so something can be done with the partial
result. If you use a comment, then the comment could
remind "me" to submit a bug report to sourceforge (as
now). :-)

Well done & thanks.

Regards,
William
_______
module is field_strings.py
function is FieldStrings.__field_instruction_func
no key for "PRINTDATE" " PRINTDATE \@ "MMMM yyyy" \*
MERGEFORMAT "

Discussion

  • Paul Henry Tremblay

    • status: open --> pending
     
  • Paul Henry Tremblay

    Logged In: YES
    user_id=663081

    I'll try to fix this problem. I agree that *all* infromation
    should be kept when converting RTF to XML, and the script
    rtf2xml tries to do just this. However, it proves very
    difficult to achive this goal because RTF contains so much
    information, has many different versions, and undergoes
    arbritrary changes.

    Can you send me the file? That would help me fix this
    shortcoming.

     
  • Anonymous - 2004-12-23

    Logged In: YES
    user_id=937473

    Example file attached...

    As far as keeping all the information goes. Yest there is a
    lot of dreck that is not important. However _fieldCodes_
    are user or word proessor dynamic variable values that the
    person put there for a reason. PrintDate for instance is
    the date I printed the document last-time (as this is a
    report) it should be dated by printing.

    Footnotes are fieldcodes, page numbers, section numbers,
    author, etc. are ALL fieldcodes -- and the user can define
    new ones. That's another reason to make it some kind of
    .configfile and always output any you can't recognise as a
    comment. So people can match their own fieldcode variables
    (custome document information) with the rtf2xml too.

    The aqttahment also includes pictures -- like <img ...> tag.
    Not sure what could be done about that - again, I think
    outputting a comment with missing PRESENTATION and CONTENT
    must bne done or even as text so missing content is visible
    the on any printed version, not just "inside the XML" as a
    comment.
    Cheers,
    Will

     
  • Anonymous - 2004-12-23
    • status: pending --> open
     
  • Paul Henry Tremblay

    Logged In: YES
    user_id=663081

    The problem should be fixed.

     
  • Paul Henry Tremblay

    • status: open --> open-fixed
     
  • Paul Henry Tremblay

    Logged In: YES
    user_id=663081

    The problem should be fixed.

     
  • Paul Henry Tremblay

    Logged In: YES
    user_id=663081

    The problem should be fixed.

     
  • Paul Henry Tremblay

    Logged In: YES
    user_id=663081

    The problem should be fixed.

     
  • Paul Henry Tremblay

    Logged In: YES
    user_id=663081

    I'm not seeing any attached file.

     
  • Paul Henry Tremblay

    • status: open-fixed --> closed-fixed
     

Log in to post a comment.