osra Wiki
Brought to you by:
igor_filippov
InitializeMagick() (required for GraphicsMagick v1.3.8). Added multi-threading support (OpenMP) for processing many-page documents (Linux/Unix only, does not yet function on Windows). Added command-line option for saving output to a file. -u command line option). Added support for OCRAD v0.19. -R (--rotate) command line switch to rotate the image. Modified debug output (-d option) to show the output from superatom dictionary. Made debug output more consistent. -r (--resolution) command line options. You can get higher quality results (at the expense of slower speed) running with the following command line: osra -r 300 -f sdf file.pdf. Also, you can see the page number for structures from PDF documents with -e (--page) option. superatom.txt contains the SMILES strings for each recongized label and spelling.txt contains spelling variants of every label for cases where OCR engine is not reliable. Please note that the dependencies have changed – OCRAD v0.18 is now required and RDKit support is temporarily suspended.