ocr search image free download

Showing 20 open source projects for "ocr search image"

View related business solutions

Formats and Protocols Clear Filters & Widen Search

Top-Rated Free CRM Software
216,000+ customers in over 135 countries grow their businesses with HubSpot

HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.

Get started free
The #1 Embedded Analytics Solution for SaaS Teams.
Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.

Try Developer Playground
1

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files

OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.

Downloads: 36 This Week

Last Update: 6 days ago
See Project
2

Super-PDF-Editor

World's most comprehensive, powerful, process-based PDF editor

World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. PDF editing with 60+ features rich tools and function like OCR pdf and images and produce output like searchable PDF, Text, Hocr, Box, Unlv. Also, improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry. Easy pdf imposition, booklet, n ups pages, and more. OCR...

3 Reviews

Downloads: 44 This Week

Last Update: 2023-02-02
See Project
3

Super-PDF-Editor-Lite

World's most comprehensive, powerful, process-based PDF editor

World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. Includes features like Create PDF from Images, HTML, Text files. Create a processing log file. Extract Page, Split Page, Rotate Page, Merge Page, Duplicate page, Move Page, Printing, and Compress Page. Improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry...

3 Reviews

Downloads: 23 This Week

Last Update: 2023-02-02
See Project
4

docconv

Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

A Go wrapper library to convert PDF, DOC, DOCX, XML, HTML, RTF, ODT, Pages documents and images (see optional dependencies below) to plain text. See go help install for details on the installation location of the installed docd executable. Make sure that the full path to the executable is in your PATH environment variable. To add image support to the docconv library you first need to install and build gosseract. Now you can add -tags ocr to any go command when building/fetching/testing docconv...

Downloads: 1 This Week

Last Update: 2023-10-30
See Project
Build enterprise-ready GenAI experiences with MongoDB Atlas
Combine the power of Google Cloud's robust infrastructure with the flexibility and scalability of MongoDB Atlas.

MongoDB Atlas is a unified developer platform that enables you to confidently accelerate the deployment of GenAI-powered applications. Additionally, when purchased on Google Cloud Marketplace, you pay for only the resources you use with no upfront commitment.

Get Started
5

mp-html

Small program rich text component, supports rendering and editing html

A powerful applet-rich text component. Small program rich text component supports rendering and editing HTML and supports use on WeChat, QQ, Baidu, Alipay, Toutiao, and uni-app platforms. Displaying dynamic HTML rich text is a necessary requirement for many applications. The applet platform does not support dom operations, making this a problem. The built-in rich-text component supports few tags and blocks all events, making it difficult for practical application. Therefore, there is such a...

Downloads: 1 This Week

Last Update: 2024-04-22
See Project
6

DocArray

The data structure for multimodal data

DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. Data...

Downloads: 0 This Week

Last Update: 2023-12-22
See Project
7

Responsively Lazy

Lazy load responsive images

.... They make you break your HTML by removing the src attribute, or make you put tiny version there or make you use <noscript> to make your images appear in Google Images. The lazy loading works in browsers supporting the srcset attribute. As of December 2017 that's 86.78%. Unsupported browsers will load the image in the src attribute. That's the image search engines and social networks will find, so it's better to make it high-resolution.

1 Review

Downloads: 0 This Week

Last Update: 2024-04-30
See Project
8

Super PDF Editor Lite

Create, Edit, Delete, Organize , Convert, Export, Secure & Sign.

Super PDF Editor Lite is a robust and versatile PDF management software designed to streamline your document handling needs. Whether you're an individual, student, or professional, this software offers a comprehensive suite of tools to create, edit, and manage your PDFs with ease. Key Features: Extract Page: Easily extract specific pages from a PDF document. Split Page: Divide a single PDF page into multiple smaller pages. Rotate Page: Rotate pages to adjust their orientation. Merge...

6 Reviews

Downloads: 17 This Week

Last Update: 2024-08-11
See Project
9

Discord Bot Client

A patched version of discord, with bot login support

Discord Bot Client allows you to use your bot, just like any other user account, except Friends and Groups. View Guilds (Lazy load them) Manage Guilds (Name, Image, Audit log, Emoji, Webhooks, Invites, Bans, Widget, Moderation, Roles) Manage Channels (Add, Delete, Name, Permissions, Invites, Webhooks, Slowmode, NSFW, Topic) Messages (Send, View History, Embeds, View Reactions, Add/Remove Reactions, Delete, Edit, Pin) Create a Guild (if the bot has fewer than 10 Servers) See Guild members...

Downloads: 36 This Week

Last Update: 2023-05-05
See Project
Bright Data - All in One Platform for Proxies and Web Scraping
Say goodbye to blocks, restrictions, and CAPTCHAs

Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.

Get Started
10

Merge PDF Files

It is a Windows library that merges standard PDFs into a final PDF

... or asynchronous. We want to give you a benchmark, the library was used to create a PDF from single page(scanned) image by an OCR SDK (it is not included in our library, you can use any on the market): 20,000 Images (the OCR SDK creates single page PDF text searchable, running 50 threads) in 80 minutes. The size of the final PDF searchable was 800Mb. If you download the library, we provide a sample which cover all the scenarios possible (synchronous and asynchronous).

Downloads: 0 This Week

Last Update: 2020-02-12
See Project
11

ftdetector

File type detector library

This project is a tool to detect file types by signatures and mime types. It uses hash tables to make the detection of a file type as fast as possible. The signature and mime types lists are stored at simple user-friendly files. This file type detector supports a lot of formats (image, archive, text, documents, audio, video, fonts and others). It also includes Microsoft OLE compound file types. The detector's algorythm has special features to detect text file types like (HTML, XML, JSON, etc...

Downloads: 0 This Week

Last Update: 2019-04-08
See Project
12

iText®, a JAVA PDF library

PDF Library for Developers

iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions...

Downloads: 460 This Week

Last Update: 2024-06-01
See Project
13

QBooru

Simple viewer app for most Booru Sites, coded in Qt C++

..., within 3-5 rows and 5-8 columns. The recommended size is 3*5, as it fits perfectly the smallest window size. Search tab example : http://puu.sh/cK0pj.jpg Users can change nearly all the parameters through the Option tab, or by editing the configuration.json file. Boorus can be added, edited, and deleted through the Booru tab in the software, but they can also be edited manually in the configuration file.

Downloads: 16 This Week

Last Update: 2016-10-28
See Project
14

Editor.md

The open source embeddable online markdown editor (component)

The open source embeddable online markdown editor (component), based on CodeMirror & jQuery & Marked. The available dependents are CodeMirror, marked, jQuery, FontAwesome, github-markdown.css, KaTeX, prettify.js, Rephael.js, flowchart.js, sequence-diagram.js, and Prefixes.scss. You can enable HTML tags decode, TeX, Flowchart, sequence diagram, Emoji, FontAwesome, task lists. You can also enable image upload, [TOCM], search replace features, code fold, etc. Combine multiple JS module files used...

Downloads: 1 This Week

Last Update: 2021-05-25
See Project
15

MathOCR

A scientific document recognition system

MathOCR is a printed scientific document recognition system. MathOCR is still in the pre-alpha stage, recognition result may not be good enough for practical purposes. MathOCR is a printed scientific document recognition system written in pure Java. MathOCR has the functionality of image preprocessing, layout analysis and character recognition, especially the ability to recognize mathematical expression. MathOCR can work do not depends on other OCR software.

Downloads: 0 This Week

Last Update: 2024-05-16
See Project
16

XPDFSYNC

Scripts to make emacs, pdfsync and xpdf work together.

Forward/Inverse from tex to pdf never quite seem to work out of the box. These scripts aim to ensure that you will be able to do forward/inverse search from emacs to tex. Pdfsync is used for the synchronization.

Downloads: 0 This Week

Last Update: 2015-11-11
See Project
17

DekarLab Book Shepherd

Book Shepherd helps to manage ebooks, articles, links and other electronic documents.

Downloads: 0 This Week

Last Update: 2015-11-16
See Project
18

LikeGrep

Now you don't need to rack your brains with making a regexp to extract necessary rows from log files. Just run that tool and you'll get the right Regular Expression. Image how to extract rows with the time amoung your log?

Downloads: 0 This Week

Last Update: 2014-07-09
See Project
19

Chamonix

Chamonix is a CHM viewer for Mac OS X 10.4. It is an Objective-C app that uses CHM lib (http://sourceforge.net/projects/chmlib), Cocoa and WebKit. It supports ToC, Indexing, Search and Favourites and multiple CHM documents.

9 Reviews

Downloads: 1 This Week

Last Update: 2015-11-08
See Project
20

Image2DocInfo

Image2DocInfo has been made to quickly tag digital pictures. A GUI allows you to set attributes for an image, and then store them in XML files. Those files follow the Dublin Core naming scheme and are stored in the same directories than the pictures.

Downloads: 0 This Week

Last Update: 2013-03-13
See Project