Download Latest Version v6.0.1 source code.tar.gz (19.0 MB)
Email in envelope

Get an email when there's a new version of Tesseract.js

Home / v6.0.0
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2025-01-07 1.9 kB
v6.0.0 source code.tar.gz 2025-01-07 19.0 MB
v6.0.0 source code.zip 2025-01-07 19.0 MB
Totals: 3 Items   38.0 MB 0

What's Changed

  • Fixed memory leaks (#977)
  • This version fixed a long-standing issue where memory would rise over time, eventually leading to a crash.
  • Reduced runtime and memory usage for most users by updating default formats (#916).
  • Fixed compatibility with Electron main process (#925)
  • Fixed bug where user-provided parameters were overwritten by defaults (#975).

Breaking Changes

  1. All outputs formats other than text are now disabled by default.
  2. To re-enable the hocr output (for example), set the following: worker.recognize(image, {}, { hocr: true })
    • See here for a list of possible output formats.
  3. The JavaScript object output format (blocks) was tweaked.
  4. Only the array of blocks (blocks) is returned.
    • Previous versions would automatically generate lists of every unit of text (words, symbols, etc.).
      • If needed, these should now be generated by the user.
  5. Only text-based blocks are reported.
    • Previous versions reported non-text blocks when detected by Tesseract (e.g. line segments).
  6. The shape of some objects were changed.
    • See the type declarations for reference on properties.
    • The main properties--text and bbox--are unchanged.
  7. Various functions and options marked as depreciated previously have been removed.
  8. This includes worker.initialize and worker.loadLanguage, along with several depreciated options from v2.

See [#993] for additional discussion about this release.

New Contributors

Full Changelog: https://github.com/naptha/tesseract.js/compare/v5.1.1...v6.0.0

Source: README.md, updated 2025-01-07