Home

alonso laurent

Libmwaw is a new project for converting many pre-OSX MAC text formats, some graphic formats and some spreadsheet formats.

Contents:

(see http://sourceforge.net/projects/libmwaw/files/)

  1. libmwaw: the sources of the filters,
  2. libmwaw-regression: a list of test scripts and test files,
  3. mwawOSX: a signed application which contains libmwaw and writerperfect for OSX which tries to convert known files in .odg/.odp/.ods/.odt
  4. mwawCSVOSX: a small utility for OSX which tries to convert the recognized spreadsheet documents in CSV (including AppleWorks/ClarisWorks database/spreadsheet documents),
  5. mwawFileOSX: a small utility for OSX which tries to recognize the type of each files in a repository and which stores the result in a text file.

Note: as mwawCSVOSX and mwawFileOSX are not signed, you can take a look at https://support.apple.com/en-us/HT202491 to launch them if Gatekeeper is active...

Online conversion:

Using Emscripten (see http://kripken.github.io/emscripten-site/ ) and FileSaver (see https://github.com/eligrey/FileSaver.js/ ), I try to create a basic conversion tool which

  • asks for a pre-OSX MAC input file,
  • tries to convert it in a odg/odp/ods/odt file,
  • and finally opens a dialog to save the result or choose an application to open the result,

in http://libmwaw.sourceforge.net/convertMWAW.html.

Notes:

  • as the javascript's file created by Emscripten is quite big (~8MB), it is better to have a good bandwidth to test/use it,
  • converting big files can take several minutes,
  • if you use Safari, after converting the original file, Safari may save the resulting odg/odp/ods/odt file in the download repository using Unknown, Unknow-1, ... as filename ; then you must be able to open this file with LibreOffice, OpenOffice, ... :-~

Developers:

(see https://sourceforge.net/p/libmwaw/wiki/Developers/)

Recognized Features:

In general:

  1. Fonts: in general, it only recognizes some Western Fonts and the Symbol encoding. I also add S-JIS encoding in a few parsers, but I am not sure how to recognize the presence of such encoding, so it may not work, ...
  2. Other character properties: mostly recognized excepted right-to-left writing, ruby characters, ...
  3. Paragraph properties: mostly recognized.
  4. Needed more test files: it is better to have many test files in order to improve a filter, recognize more features, ... but for many formats, I clearly have too few files :-~

Word Processors:

Program Version Extended Fonts Table Frame Position Needed more test files Comments
AppleWorks/ClarisWorks all Yes (E1) Yes Yes No (C1)
BeagleWorks/WordPerfect Works v1.0 to v1.2 No N/A Yes Yes (C2)
FullWrite Professional v1.0 to v2.0 No Yes No (F1) Yes
GreatWorks v1.0 to v2.0 No N/A Yes Yes
HanMac Word J v2.0.4 No Yes Yes Yes
HanMac Word K v2.0.5 to v2.0.6 No Yes Yes Yes
LightWayText Mac Classic v4.5 Yes/No (E2) No Yes Yes ( Mac Classic) (C3), no auto indent support
MacWrite all No N/A Yes Yes (v1-v2.2)
MacWrite II, MacWrite Pro all No Yes Yes Yes
Mariner Write Mac Classic v1.6 to v3.5 Yes (E1) Yes No Yes (v1-v3)
Microsoft Word Mac v1.0 to v5.1 No Yes Yes Yes (v1-v3 ) (C4)
Microsoft Works only Mac No Yes Yes Yes (v3-v4)
Mind Write v2.1 N/A No Yes Yes very limited list support
Nisus Writer v3.4 to v6.5 No Yes Yes Yes (C5)
RagTime v2.1 to v3.2(V1) No Yes Yes Yes
WriteNow all No Yes Yes Yes (v1-v2)
WriterPlus unknown (V2) No Yes Yes Yes
Z-Write v1.3 No N/A N/A Yes (C5)

(Yes meaning mostly recognized)

(C1): accepts to convert draw documents, but the result can be junky if a frame appears on multiple pages.
(C2): retrieves pictures only in zipped or binhexed files.
(C3): only recognizes zipped or binhexed files and files saved in basic Mac format.
(C4): remains some formatting issues (mainly in tables), so some part formats may be bad.
(C5): only recognizes zipped or binhexed files.
(E1): S-JIS encoding.
(E2): S-JIS encoding (but I do not know how to detect S-JIS encoding, so probably NOT working).
(V1): there is also a minimal parser for v5.0 to v6.5 files which can retrieve some texts, pictures and spreadsheets (which may be used to retrieve some basic documents if you have no other way to that).
(V2): I have only a few files with unknown origin.

Basic Text Editors or Documentation Applications:

Program Version Extended Fonts Needed more test files Comments
Acta v2, Classic v1 No Yes
DOCMaker v4.0 No Yes (C1)
eDOC v2.1 No Yes (C1)
MacDoc v1.3 No Yes (C1)
MaxWrite v1 No Yes (C1)
More v2-v3 No Yes only retrieves the outline content
MouseWrite v1 No Yes
SimpleText/TeachText v1 Yes(E1) Yes (C2)
Style v1.6-v1.9 No Yes
Tex-Edit v2 Yes(E1) Yes (C1)

(C1): only recognizes zipped or binhexed files.
(C2): only recognizes zipped or binhexed files which contained at least a style or a picture.
(E1): S-JIS encoding if the font identificator is recognized.

Drawing Applications :

Note: converting graphic's files is a new feature, so some conversions need to be improved a lot.

Format:

Program Version Vector graphic Raster graphic Needed more test files Comments
ApplePict all Yes N/A No (C1)
AppleWorks/ClarisWorks all No(V1) Yes Yes (C2)
BeagleWorks/WordPerfect Works v1.0 to v1.2 Yes Yes Yes
ClarisDraw v1.0 Yes N/A Yes (C3)
CricketDraw v1.0 and v1.1 Yes N/A Yes
FreeHand v1 and v2 Yes N/A Yes (C4)
FullPaint v1 N/A Yes Yes
GreatWorks v1.0 to v2.0 Yes Yes Yes (C5)
MacDraft v1.0-1.2 and v4.3-v5.5 Yes N/A Yes (C6)
MacDraw v0.9 to v1.7 Yes N/A Yes
MacDraw II v1.0 to v1.1 Yes N/A Yes
MacDraw Pro v1.0 to v1.5 Yes N/A Yes (C7)
MacPaint all N/A Yes Yes
Microsoft Works only Mac Yes N/A Yes Basic
PixelPaint v1.0 to v2.1 N/A Yes Yes
SuperPaint v1(v1) Yes Yes Yes

(C1) clipping, transfer mode, video are ignored.
(C2) If a document contains some header/footer, it only retrieves the main bitmap and the text which is present in this header/footer.
(C3) Bounding text with shapes is not recognized...
(C4) This code is actually in libmwaw, but it will probably be moved in libfreehand
(C5) Paint v2.0 documents are only recognized if the documents were previously zipped or binhexed ( but as there are normal Apple PCT files, ...).
(C6) only basic retrieving on measure lines, 2D shapes formed by merging curves are not reconstructed.
(C7) only recognizes zipped or binhexed files.
(V1) Vector graphic documents are in fact converted as text document.
(v1) Newest versions of SuperPaint seem to generate classic Apple PCT or MacPaint files, so there is no plan to add specific support.

Presentation Applications :

Note: converting presentation's files is a new feature, so some conversions need to be improved a lot.

Format:

Program Version Needed more test files Comments
AppleWorks v6 Yes(N1)
PowerPoint Mac v1-v4 and Windows v2-v4(V1) Yes

(N1) I have so few real-word presentations, so...
(V1) Suppose that the encoding is Western Europe on Windows

Spreadsheet/Database Applications:

In general:

  1. Fonts: in general, it only recognizes some Western Fonts and the Symbol encoding. I also add S-JIS encoding in a few parsers, but I am not sure how to recognize the presence of such encoding, so it may not work, ...
  2. Other character properties: mostly recognized excepted right-to-left writing, ruby characters, ...
  3. Main content: values and formula, mostly recognized but some problems may remain for some very small numbers and for some functions in formula which does not have equivalent in modern spreadsheet,
  4. Frames, charts... (only for spreadsheet's conversion): very basic reconstruction.

Notes:

  1. Converting database's and spreadsheet's files is a new feature, so some conversions need to be improved a lot.
  2. Database's files are converted in spreadsheet documents.

Format:

Program Version Spreadsheet Database Extended Fonts Chart Frame Position Needed more test files Comments
AppleWorks/ClarisWorks all Yes Yes Yes (E1) No No Yes
BeagleWorks/WordPerfect Works v1.0 to v1.2 Yes Yes No N/A Basic Yes
Claris Resolve Mac v1.1 Yes N/A No No No Yes
GreatWorks v1.0 to v2.0 Yes Yes No N/A Basic Yes
Microsoft Works only Mac Yes Yes No No Basic Yes
Wingz v1.1 Yes N/A No No No Yes

(E1): S-JIS encoding.

Project Admins:


Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:





No, thanks