Arno Peters

Show:

What's happening?

  • Comment: PDF input filter

    I have taken down the link mentioned in the request details. I was unaware of the license conflict between OmegaT and PDFBox. I have included the patch in this request so people may experiment to determine if including PDF content extraction is in fact a useful addition to OmegaT. If so, this patch can be rewritten to use a library without a conflicting license. PDFBox can be obtained...

    2009-06-16 21:27:46 UTC in OmegaT - multiplatform CAT tool

  • PDF input filter

    The included link points to a patch and accompanying jar files to teach OmegaT to read the contents of PDF files and present this information to the user. I have used PDFBox (http://www.pdfbox.org) to extract the text contents. Since PDF has no concept of paragraphs, only of lines, the filter will try to append lines together to ensure a better segmentation result. The zip archive can be...

    2009-06-09 22:49:06 UTC in OmegaT - multiplatform CAT tool

  • Comment: LaTeX filter support

    texinfo uses different parsing rules from TeX and LaTeX. I have to investigate how much they diverge. In all probability this requires creating a different input filter just for texinfo files.

    2009-06-09 21:56:26 UTC in OmegaT - multiplatform CAT tool

  • Comment: LaTeX filter support

    The filter will work on plain TeX files, but not on texinfo files. The source contains a list of commands to filter, for proper plain TeX support, these lists should be expanded.

    2009-06-04 20:50:41 UTC in OmegaT - multiplatform CAT tool

  • Comment: LaTeX filter support

    I have uploaded a third version of the LaTeX input filter that achieves technical equivalence on the output documents. This version has only a few minor tweaks to the second patch, for example, the new patch eliminates a number of spurious backslashes in the target files.

    2009-05-29 19:41:10 UTC in OmegaT - multiplatform CAT tool

  • Comment: LaTeX filter support

    The second version of the LaTeX filter tries to insulate the translator as much as possible from the underlying TeX system.

    2009-05-29 05:04:26 UTC in OmegaT - multiplatform CAT tool

  • Dutch segmentation rules

    The attached patch teaches OmegaT some Dutch segmentation rules exceptions.

    2009-05-11 22:10:52 UTC in OmegaT - multiplatform CAT tool

  • LaTeX filter support

    The attached patch teaches OmegaT to read LaTeX files. The code tries to read paragraphs, command sequences and comments the same way TeX does. Everything encountered is presented for translation, the only thing not preserved is the line endings within a paragraph (as per TeX). The patch is taken relative to OmegaT trunk. Apply the patch from OmegaT top level directory like so: $ patch...

    2009-05-11 22:02:33 UTC in OmegaT - multiplatform CAT tool

  • Workaround incorrect byte encoding in Active Directory

    Retrieving information from an Active Directory causes problems. Luma 2.3 (problem also persists in version 2.4) complains: File "/usr/lib/luma/base/utils/gui/BrowserWidget.py", line 149, in itemClicked self.emit(PYSIGNAL("ldap_result"), (deepcopy(result),)) File "/usr/lib/luma/base/utils/gui/AdvancedObjectWidget.py", line 136, in initView self.displayValues() File...

    2008-11-07 23:38:05 UTC in Luma

  • UI system (native) look and feel patch

    I have made the patch against SVN trunk r126 apply as follows: cd datavision patch -p1 < /path/to/ui.patch.

    2008-01-16 15:44:52 UTC in DataVision

About Me

  • 2001-10-21 (8 years ago)
  • 355161
  • ape (My Site)
  • Arno Peters

Send me a message