Showing 262 open source projects for "text extract"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 1
    Code7248.word_reader

    Code7248.word_reader

    A C# .NET Library for reading Word Documents (doc and docx)

    A simple .NET Library compatible with .NET 2.0, 3.0, 3.5 and 4.0. It can currently extract only the raw text from a .doc or .docx file. Use a RichTextBox instead of a normal TextBox to retain formatting, if you want to display it in your form.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    Open Extract Processor

    Database ETL utility

    The Open Extract Processor (OEP) is a database ETL utility capable of merging/sorting/aggregating large partitioned database extracts. Native data files can be created from text files or from the accompanying MYSQL and Oracle import/export programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    AADRTE

    Automatic Arabic Domain-Relevant Term Extraction

    In this research we propose a model for automatic domain-relevant term extraction from Arabic text corpus. The proposed model uses a hybrid approach composed of linguistic and statistical methods to extract terms relevant to specific domains depending on prevalence and tendency term ranking mechanism. This increases precision and recall as a measures of relevancy of extracted terms to a specific domain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    BiblePaste

    BiblePaste

    Paste Bible verses to your note in a fast and effective way

    BiblePaste is a free software. With it you can copy to the clipboard from the entire Bible and paste to any text editor in one button press. Through the settings you can customize the way and format of the insertion, so studying the Bible or taking notes is much easier. The software doesn't require installation. Just download, extract and run it. Even on USB flash drives.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Free CRM Software With Something for Everyone Icon
    Free CRM Software With Something for Everyone

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
    Get free CRM
  • 5

    tmx2text

    Extract text data from tmx files

    Tmx2text provides a simple interface to extract text data from tmx translation memories. It is written in Python (requires Python3 or higher) and uses PyQt (Qt 4) and is released under the GPL. Although it was created for Linux it should work on other platforms where Python3 and PyQt4 are installed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    iCalDoit

    iCalDoit

    automating iCal...

    iCalDoit is a collection of software elements, that aims to be the "missing link" between iCal on the one side and Mac OSX workflow programming (Automator, services) on the other side. Updated nightly builds to bundle 0.5a2 for Snow Leopard... License: GNU GPL v.3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Automatic categorization of texts based on supplied controlled vocabularies. Is a php tool to extract terms from a text and use it to obtain keywords from a specific controlled vocabulary. Use the terminological web services provided by TemaTres.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    This is a library to extract raw unicode text from any written documents (office documents such as PDF, Word, OpenOffice, ...). It should be useful to developpers of search engine, text processing, corpus analysis, ....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    PDFTools

    Basic PDF tools.

    Project contains two applications. PdfTextExtractor - extract text from pdf with text layer. PdfShapeDrawer - draw shapes on pdf files. Project use iTextSharp library (http://itextpdf.com/). This program is free software; you can redistribute it and/or modify it under the terms of the GNU Affero General Public License version 3 as published by the Free Software Foundation
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 10

    PhenoProcess

    A MATLAB tool to process canopy digital repeat photography

    This MATLAB tool / GUI allows you to process time series (digital repeat photography) of vegetation and extract a smoothed time series of the greenness of this vegetation across a growing season. The software allows you to select multiple regions of interest (save and load these) and output the resulting smoothed data series to text file for further processing and statistics. Compiled standalone versions of the MATLAB code will be provided soon.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    XVim

    XCode plugin for Vim keybindings

    This project has moved to https://github.com/JugglerShu/XVim This plugin enables you to control XCode source editor as if you are using Vim inside there. ### How to Install the Plugin ### Download the zip file from above and extract the file into "$(HOME)/Library/Application Support/Developer/Shared/Xcode/Plug-ins" directory. Make the directory if there is not. ### For XVim lite users ### Delete XVim_lite.xcplugin directory after or before installing XVim. Any...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Lioness (Languages Interop Framework)
    Framework for making Windows applications that are one .exe file in AutoHotKey_L,C++,C#, VB.NET,Java,Groovy,Common Lisp,Nemerle,Ruby,Python,PHP,Lua,Tcl,Perl,Jint,S#,WSH VBScript,HTML/JavaScript/CSS,COM, PowerShell without compiling . For .NET 4.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    The Hironico Db Tool is a graphical database client that can run on all major platforms today. It provides a powerfull, feature rich and user friendly set of tools to work with databases of any vendor using Java drivers while being fast & light.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SNote is small program writen in c++ with QT. With this program you can to save your text and when you start program again thet text will be showed To install program just extract the archive and go to snote directory and run install.sh as root
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    SEMANTIXS is a semantic information extraction system that can extract, represent and visualize domain-specific information from free-text in the form of complex (and simple) relationships. Refer - http://www.cs.iastate.edu/~semantix/ for more info.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Computer-Processable Versions of Engineering Standards To start an XML meta-language will be developed to embed in the marked up text of engineering codes. The goal is mining of specifications to automatically extract engineering requirements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    New Project to Convert the Apache Tika text extraction tool to the .net platform. .NET library designed to extract text and from multiple document types most notably various office suites and multimedia types.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    BatchConvert PDF2Text
    This application enables you to batch extract text from single and multipage PDF documents - which were not originated from scanning - into text files. Options are available to select the text format and add page header and footer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ConvertPDFsToText Service
    This service will be available in Finder's Services menu and contextual menu if PDF files are selected. It enables you to batch extract text from single and multipage PDF documents - which were not originated from scanning - into text files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    CD & DVD Autorun Kit
    This Autorun kit is 1,2,3 simple. Extract to the root folder, Edit the Autorun.info file and you will have an autorun menu for your CD, DVD, or USB Pen Distribution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Autshumato PTE (PDF Text Extractor) is a utility application which extracts the text from PDF documents with the aim of making it translatable. It is also able to extract the pages of the PDF document as PNG images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The Textract Project consists of C++ source code to extract text from a growing assortment of file formats. Output is indexing-ready. The Textract Project is intended as a foundation to support research-quality search engines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Create or parse ANY Mark-up Language (HTML XML X3D VRML MathML XAML XDP CDA SCORM COLLADA XBRL) file or string into a simple and versatile MLDocument, MLElement, MLParameter hierarchical object model, written in VB 6 (Win32). Alternative to using DOM.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Simple and intuitive image uploader for TinyMCE wysiwyg editor. I've created this uploader/imagemanager because it is hard to find a decent for free out there.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Extracts images(if any) from an .odt(Open Office Document Text) file. It takes command line arguments and saves the images at path given by the user itself. Runs only on GNU/Linux Systems.
    Downloads: 0 This Week
    Last Update:
    See Project