Showing 20 open source projects for "ocr search image"

View related business solutions
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
    Get started free
  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • 1
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 2
    Super-PDF-Editor

    Super-PDF-Editor

    World's most comprehensive, powerful, process-based PDF editor

    World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. PDF editing with 60+ features rich tools and function like OCR pdf and images and produce output like searchable PDF, Text, Hocr, Box, Unlv. Also, improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry. Easy pdf imposition, booklet, n ups pages, and more. OCR...
    Downloads: 44 This Week
    Last Update:
    See Project
  • 3
    Super-PDF-Editor-Lite

    Super-PDF-Editor-Lite

    World's most comprehensive, powerful, process-based PDF editor

    World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. Includes features like Create PDF from Images, HTML, Text files. Create a processing log file. Extract Page, Split Page, Rotate Page, Merge Page, Duplicate page, Move Page, Printing, and Compress Page. Improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 4
    docconv

    docconv

    Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

    A Go wrapper library to convert PDF, DOC, DOCX, XML, HTML, RTF, ODT, Pages documents and images (see optional dependencies below) to plain text. See go help install for details on the installation location of the installed docd executable. Make sure that the full path to the executable is in your PATH environment variable. To add image support to the docconv library you first need to install and build gosseract. Now you can add -tags ocr to any go command when building/fetching/testing docconv...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build enterprise-ready GenAI experiences with MongoDB Atlas Icon
    Build enterprise-ready GenAI experiences with MongoDB Atlas

    Combine the power of Google Cloud's robust infrastructure with the flexibility and scalability of MongoDB Atlas.

    MongoDB Atlas is a unified developer platform that enables you to confidently accelerate the deployment of GenAI-powered applications. Additionally, when purchased on Google Cloud Marketplace, you pay for only the resources you use with no upfront commitment.
    Get Started
  • 5
    mp-html

    mp-html

    Small program rich text component, supports rendering and editing html

    A powerful applet-rich text component. Small program rich text component supports rendering and editing HTML and supports use on WeChat, QQ, Baidu, Alipay, Toutiao, and uni-app platforms. Displaying dynamic HTML rich text is a necessary requirement for many applications. The applet platform does not support dom operations, making this a problem. The built-in rich-text component supports few tags and blocks all events, making it difficult for practical application. Therefore, there is such a...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    DocArray

    DocArray

    The data structure for multimodal data

    DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. Data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Responsively Lazy

    Responsively Lazy

    Lazy load responsive images

    .... They make you break your HTML by removing the src attribute, or make you put tiny version there or make you use <noscript> to make your images appear in Google Images. The lazy loading works in browsers supporting the srcset attribute. As of December 2017 that's 86.78%. Unsupported browsers will load the image in the src attribute. That's the image search engines and social networks will find, so it's better to make it high-resolution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Super PDF Editor Lite

    Super PDF Editor Lite

    Create, Edit, Delete, Organize , Convert, Export, Secure & Sign.

    Super PDF Editor Lite is a robust and versatile PDF management software designed to streamline your document handling needs. Whether you're an individual, student, or professional, this software offers a comprehensive suite of tools to create, edit, and manage your PDFs with ease. Key Features: Extract Page: Easily extract specific pages from a PDF document. Split Page: Divide a single PDF page into multiple smaller pages. Rotate Page: Rotate pages to adjust their orientation. Merge...
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 9
    Discord Bot Client

    Discord Bot Client

    A patched version of discord, with bot login support

    Discord Bot Client allows you to use your bot, just like any other user account, except Friends and Groups. View Guilds (Lazy load them) Manage Guilds (Name, Image, Audit log, Emoji, Webhooks, Invites, Bans, Widget, Moderation, Roles) Manage Channels (Add, Delete, Name, Permissions, Invites, Webhooks, Slowmode, NSFW, Topic) Messages (Send, View History, Embeds, View Reactions, Add/Remove Reactions, Delete, Edit, Pin) Create a Guild (if the bot has fewer than 10 Servers) See Guild members...
    Downloads: 36 This Week
    Last Update:
    See Project
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 10

    Merge PDF Files

    It is a Windows library that merges standard PDFs into a final PDF

    ... or asynchronous. We want to give you a benchmark, the library was used to create a PDF from single page(scanned) image by an OCR SDK (it is not included in our library, you can use any on the market): 20,000 Images (the OCR SDK creates single page PDF text searchable, running 50 threads) in 80 minutes. The size of the final PDF searchable was 800Mb. If you download the library, we provide a sample which cover all the scenarios possible (synchronous and asynchronous).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    ftdetector

    File type detector library

    This project is a tool to detect file types by signatures and mime types. It uses hash tables to make the detection of a file type as fast as possible. The signature and mime types lists are stored at simple user-friendly files. This file type detector supports a lot of formats (image, archive, text, documents, audio, video, fonts and others). It also includes Microsoft OLE compound file types. The detector's algorythm has special features to detect text file types like (HTML, XML, JSON, etc...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions...
    Leader badge
    Downloads: 460 This Week
    Last Update:
    See Project
  • 13
    QBooru

    QBooru

    Simple viewer app for most Booru Sites, coded in Qt C++

    ..., within 3-5 rows and 5-8 columns. The recommended size is 3*5, as it fits perfectly the smallest window size. Search tab example : http://puu.sh/cK0pj.jpg Users can change nearly all the parameters through the Option tab, or by editing the configuration.json file. Boorus can be added, edited, and deleted through the Booru tab in the software, but they can also be edited manually in the configuration file.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 14
    Editor.md

    Editor.md

    The open source embeddable online markdown editor (component)

    The open source embeddable online markdown editor (component), based on CodeMirror & jQuery & Marked. The available dependents are CodeMirror, marked, jQuery, FontAwesome, github-markdown.css, KaTeX, prettify.js, Rephael.js, flowchart.js, sequence-diagram.js, and Prefixes.scss. You can enable HTML tags decode, TeX, Flowchart, sequence diagram, Emoji, FontAwesome, task lists. You can also enable image upload, [TOCM], search replace features, code fold, etc. Combine multiple JS module files used...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    MathOCR

    MathOCR

    A scientific document recognition system

    MathOCR is a printed scientific document recognition system. MathOCR is still in the pre-alpha stage, recognition result may not be good enough for practical purposes. MathOCR is a printed scientific document recognition system written in pure Java. MathOCR has the functionality of image preprocessing, layout analysis and character recognition, especially the ability to recognize mathematical expression. MathOCR can work do not depends on other OCR software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    XPDFSYNC

    Scripts to make emacs, pdfsync and xpdf work together.

    Forward/Inverse from tex to pdf never quite seem to work out of the box. These scripts aim to ensure that you will be able to do forward/inverse search from emacs to tex. Pdfsync is used for the synchronization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Book Shepherd helps to manage ebooks, articles, links and other electronic documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Now you don't need to rack your brains with making a regexp to extract necessary rows from log files. Just run that tool and you'll get the right Regular Expression. Image how to extract rows with the time amoung your log?
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Chamonix is a CHM viewer for Mac OS X 10.4. It is an Objective-C app that uses CHM lib (http://sourceforge.net/projects/chmlib), Cocoa and WebKit. It supports ToC, Indexing, Search and Favourites and multiple CHM documents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Image2DocInfo has been made to quickly tag digital pictures. A GUI allows you to set attributes for an image, and then store them in XML files. Those files follow the Dublin Core naming scheme and are stored in the same directories than the pictures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next