Search Results for "extract text from multiple word files"

Showing 428 open source projects for "extract text from multiple word files"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
    Get started free
  • 1
    Extract TOTP/HOTP secrets

    Extract TOTP/HOTP secrets

    Extract one time password (OTP) secrets from QR codes

    The Python script extract_otp_secrets.py extracts one-time password (OTP) secrets from QR codes exported by two-factor authentication (2FA) apps such as "Google Authenticator".
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    LosslessCut

    LosslessCut

    The swiss army knife of lossless video/audio editing

    LosslessCut aims to be the ultimate cross platform FFmpeg GUI for extremely fast and lossless operations on video, audio, subtitle and other related media files. The main feature is lossless trimming and cutting of video and audio files, which is great for saving space by rough-cutting your large video files taken from a video camera, GoPro, drone, etc. It lets you quickly extract the good parts from your videos and discard many gigabytes of data without doing a slow re-encode and thereby...
    Downloads: 284 This Week
    Last Update:
    See Project
  • 3
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu...
    Downloads: 49 This Week
    Last Update:
    See Project
  • 4
    Joplin

    Joplin

    Open source note taking and to-do app with synchronization

    Joplin is a free and open source note-taking and to-do application that can handle a large number of notes in Markdown format, organize them into notebooks and synchronize them with various cloud services. All notes can also be copied, tagged, searched and modified directly from the app or through your own text editor. Notes that are exported from Evernote can be imported into Joplin, be it formatted content, resources, complete metadata or plain Markdown files. When notes are synchronized...
    Downloads: 41 This Week
    Last Update:
    See Project
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 5
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    ...-source software which means that transparency is the core value of our software development. Source code can be reviewed and improved by anyone from anywhere. Papermerge supports multiple users. Each user can be assigned different permissions to perform only a specific kind of action e.g. view only documents from a specific folder. OCR technology is vital part of Papermerge. It extracts text information from scanned documents, PDF, JPEG, TIFF files.
    Downloads: 38 This Week
    Last Update:
    See Project
  • 6
    Tesseract.js

    Tesseract.js

    A pure Javascript Multilingual OCR

    Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images. The main Tesseract.js functions (ex. recognize, detect) take an image...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 7
    go-i18n

    go-i18n

    Translate your Go program into multiple languages

    go-i18n is a Go package and a command that helps you translate Go programs into multiple languages. Supports pluralized strings for all 200+ languages in the Unicode Common Locale Data Repository (CLDR). Code and tests are automatically generated from CLDR data. Supports strings with named variables using text/template syntax. Supports message files of any format (e.g. JSON, TOML, YAML). Use goi18n extract to extract all i18n.Message struct literals in Go source files to a message file...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    SOPS

    SOPS

    Simple and flexible tool for managing secrets

    sops is an editor of encrypted files that supports YAML, JSON, ENV, INI and BINARY formats and encrypts with AWS KMS, GCP KMS, Azure Key Vault, age, and PGP. For the adventurous, unstable features are available in the develop branch, which you can install from source. To use sops as a library, take a look at the decrypt package. We rewrote Sops in Go to solve a number of deployment issues, but the Python branch still exists under python-sops. We will keep maintaining it for a while, and you can...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 9
    Image Toolbox

    Image Toolbox

    Image Toolbox is an powerful picture editor, which can crop

    Image Toolbox is a powerful picture editor, which can crop, apply filters, add some drawings, erase background, edit EXIF, or even create a PDF file.
    Downloads: 11 This Week
    Last Update:
    See Project
  • JobNimbus Construction Software Icon
    JobNimbus Construction Software

    For Roofers, Remodelers, Contractors, Home Service Industry

    Track leads, jobs, and tasks from one easy to use software. You can access your information wherever you are, get everyone on the same page, and grow your business.
    Learn More
  • 10
    syslog-ng

    syslog-ng

    Log management solution that improves the performance of SIEM

    syslog-ng is the log management solution that improves the performance of your SIEM solution by reducing the amount and improving the quality of data feeding your SIEM. With syslog-ng Store Box, you can find the answer. Search billions of logs in seconds using full text queries with Boolean operators to pinpoint critical logs. syslog-ng Store Box provides secure, tamper-proof storage and custom reporting to demonstrate compliance. syslog-ng can deliver data from a wide variety of sources...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 11
    PaperQA2

    PaperQA2

    High accuracy RAG for answering questions from scientific documents

    PaperQA2 is a package for doing high-accuracy retrieval augmented generation (RAG) on PDFs or text files, with a focus on the scientific literature. See our recent 2024 paper to see examples of PaperQA2's superhuman performance in scientific tasks like question answering, summarization, and contradiction detection. In this example we take a folder of research paper PDFs, magically get their metadata - including citation counts and a retraction check, then parse and cache PDFs into a full-text...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    Super-PDF-Editor-Lite

    Super-PDF-Editor-Lite

    World's most comprehensive, powerful, process-based PDF editor

    World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. Includes features like Create PDF from Images, HTML, Text files. Create a processing log file. Extract Page, Split Page, Rotate Page, Merge Page, Duplicate page, Move Page, Printing, and Compress Page. Improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 13
    fzf

    fzf

    A command-line fuzzy finder

    ... in fullscreen mode, but you can make it start below the cursor with the height option. Unless otherwise specified, fzf starts in "extended-search mode" where you can type in multiple search terms delimited by spaces. Fuzzy completion for files and directories can be triggered if the word before the cursor ends with the trigger sequence, which is by default **. Fuzzy completion for PIDs is provided for the kill command. In this case, there is no trigger sequence.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    micro

    micro

    A modern and intuitive terminal-based text editor

    Micro's number one feature is being easy to install (it's just a static binary with no dependencies) and easy to use. Use a simple json format to configure your options and rebind keys to your liking. If you need more power, you can use Lua to configure the editor further. Micro supports over 75 languages and has 7 default colorschemes to choose from. Micro supports 16, 256, and truecolor themes. Syntax files and colorschemes are also very simple to make. Micro has support for Sublime-style...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    PdfPig

    PdfPig

    Read and extract text and other content from PDFs in C#

    This project allows users to read and extract text and other content from PDF files. In addition the library can be used to create simple PDF documents containing text and geometrical shapes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Markdown Monster

    Markdown Monster

    An extensible Markdown Editor, Viewer and Weblog Publisher for Windows

    ... and word counts keep your content streamlined. You can export Markdown to PDF or HTML on disk or copy Markdown selections as HTML to the clipboard. The HTML preview can display syntax-colored code snippets for most coding languages. Choose from light or dark app themes, and individual and fully customizable preview themes. Use the built-in folder browser to open, manage and drag files into content, use the document outline to quickly jump through content, or use our shell integration.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    dirsearch

    dirsearch

    Web path scanner

    An advanced command-line tool designed to brute force directories and files in webservers, AKA web path scanner. Wordlist is a text file, each line is a path. About extensions, unlike other tools, dirsearch only replaces the %EXT% keyword with extensions from -e flag. For wordlists without %EXT% (like SecLists), -f | --force-extensions switch is required to append extensions to every word in wordlist, as well as the /. To use multiple wordlists, you can separate your wordlists with commas...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 18
    Nativefier

    Nativefier

    Make any web page a desktop application

    ... nativefier --help to learn about other command-line flags usable to configure the packaged app. For a list of build commands contributed by the nativefier community take a look at the CATALOG.md file. Nativefier is also usable from Docker. You can pass Nativefier flags, and mount volumes to provide local files.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    D2

    D2

    D2 is a modern diagram scripting language that turns text to diagrams

    ... beautifully right out of the box. See ./d2themes to browse the available themes and make or contribute your own creation. D2 is designed with language tooling in mind. D2's parser can parse multiple errors from a broken program, has an autoformatter, syntax highlighting, and we have plans for LSP's and more. Good language tooling is necessary for creating and maintaining large diagrams. D2 is designed to be extensible and composable.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    DocTR

    DocTR

    Library for OCR-related tasks powered by Deep Learning

    DocTR provides an easy and powerful way to extract valuable information from your documents. Seemlessly process documents for Natural Language Understanding tasks: we provide OCR predictors to parse textual information (localize and identify each word) from your documents. Robust 2-stage (detection + recognition) OCR predictors with pretrained parameters. User-friendly, 3 lines of code to load a document and extract text with a predictor. State-of-the-art performances on public document...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Onefetch

    Onefetch

    Git repository summary on your terminal

    ... and provides the user with valuable information like code distribution, pending changes, number of dependencies (by package manager), top contributors (by number of commits), the size on disk, creation date, LOC (lines of code), etc. Onefetch can be configured via command-line flags to display exactly what you want, the way you want it to: you can customize ASCII/Text formatting, disable info lines, ignore files & directories, and output in multiple formats (JSON, Yaml), etc.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Cloud Commander

    Cloud Commander

    Cloud Commander file manager for the web with console and editor

    Cloud Commander is a file manager for the web. It includes a command-line console and a text editor. Cloud Commander helps you manage your server and work with files, directories and programs in a web browser from any computer, mobile or tablet. Can be used local or remotely. Adapts to screen size. 3 built-in editors with support of syntax highlighting: Dword, Edward and Deepword. Console with support of the default OS command line. Written in JavaScript/Node.js. Built-in archives pack: zip...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Conversations

    Conversations

    The very last word in instant messaging

    Conversations is an open source Jabber/XMPP client for Android, designed to provide a unique and more freeing mobile messaging experience. XMPP is a federated protocol that can work with every XMPP server out there, giving you the freedom to choose a trustworthy server for yourself while still being able to chat with contacts from other servers. All communications are TLS encrypted and Conversations offers to enable one of two state of the art end-to-end encryption mechanisms, so you can...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Carvel ytt

    Carvel ytt

    YAML templating tool that works on YAML structure instead of text

    Template and patch as needed to easily make your configuration reusable and extensible. Works with your own and third-party YAML configuration. ytt templates are plain YAML documents made from nodes such as maps and arrays. ytt allows you to set values and attach statements (such as if and for loops) on those nodes. This eliminates text insertion concerns like manual escaping and allows for easy structure reuse, thus, increasing readability. Weave ytt templating into your own configuration...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    cheat

    cheat

    Create and view interactive cheatsheets on the command-line

    .... Cheatsheet text may optionally be preceded by a YAML frontmatter header that assigns tags and specifies syntax. Cheatsheets are stored on "cheatpaths", which are directories that contain cheatsheets. Cheatpaths are specified in the conf.yml file. It can be useful to configure cheat against multiple cheatpaths. A common pattern is to store cheatsheets from multiple repositories on individual cheatpaths.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next