Scribe.js is a JavaScript library that provides Optical Character Recognition (OCR) and text extraction capabilities for both images and PDF documents, aimed at developers who want to build OCR features directly into their applications. The library can take image files (such as PNG or JPEG) and recognize the text they contain, and it can also extract text from PDF files that either already contain text or are image-based scans, using modern web standards and WebAssembly under the hood. In addition to simple text extraction, Scribe.js supports writing or injecting a high-quality invisible text layer back into PDFs, effectively making them searchable and improving usability for indexing or accessibility. It is written in modern ECMAScript Modules (ESM), so it can be imported in both browser and Node.js environments without a build step, though browser usage requires same-origin hosting of the files.

Features

  • Recognize text from images and scanned files
  • Extract or pull existing text from PDF documents
  • Insert searchable invisible text layers into PDF files
  • Supports usage in both browser (ESM) and Node.js environments
  • Example templates for popular frameworks
  • Designed for developer integration

Project Samples

Project Activity

See All Activity >

Categories

OCR

License

Affero GNU Public License

Follow Scribe.js

Scribe.js Web Site

Other Useful Business Software
Fully Managed MySQL, PostgreSQL, and SQL Server Icon
Fully Managed MySQL, PostgreSQL, and SQL Server

Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Scribe.js!

Additional Project Details

Programming Language

JavaScript

Related Categories

JavaScript OCR Software

Registered

2026-02-13