Showing 262 open source projects for "text extract"

View related business solutions
  • Nectar: Employee Recognition Software to Build Great Culture Icon
    Nectar: Employee Recognition Software to Build Great Culture

    Nectar is an employee recognition software built for the modern workforce.

    Our 360 recognition & rewards platform enables everyone (peer to peer & manager to employees alike) to send meaningful recognition rooted in core values. Nectar has the most extensive rewards catalog so users can choose from company branded swag, Amazon products, gift cards or custom reward types. Integrate with your other tools like Slack and Teams to make sending recognition easy. We support top organizations like MLB, SHRM, Redfin, Heineken and more.
  • Let your volunteer coordinators do their best work. Icon
    Let your volunteer coordinators do their best work.

    For non-profit organizations requiring a software solution to keep track of volunteers

    Stop messing with tools that aren’t designed to amplify volunteer programs. With VolunteerMatters, it’s a delight to manage everything in one place.
  • 1

    Personalized Search Engine

    Personalized Search Engine for Your Files

    ... also extract text content from files of many wildly used file types such as pdf, doc, ppt, and mp3 to improve the index quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    eLibrary

    Personalized Search Engine for Commonly Used Files

    eLibrary (electric library) is a Java software to search files and folders in an OS file system. It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary can also extract...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    AutomaticParser

    Automatic Parser

    Automatic Parser of Text is based on supervised learning. It can extract entities from your document without any coding and investigation of features. You should only mark entities in some documents to use machine learning algorithm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Pdf Text Extractor

    A Java Application that extracts text from pdf files.

    A Java Application that extracts text from pdf files. User can select different areas on the pdf file and can extract text from those areas.Extraction of text can be done for single or multiple pages. Generate Bookmarks on the basis of Font Heights entered by the user.
    Downloads: 0 This Week
    Last Update:
    See Project
  • High-performance Open Source API Gateway Icon
    High-performance Open Source API Gateway

    KrakenD is a stateless, distributed, high-performance API Gateway that helps you effortlessly adopt microservices

    KrakenD is a high-performance API Gateway optimized for resource efficiency, capable of managing 70,000 requests per second on a single instance. The stateless architecture allows for straightforward, linear scalability, eliminating the need for complex coordination or database maintenance.
  • 5
    ePUBator

    ePUBator

    Minimal offline PDF to ePUB converter for Android

    Minimal offline PDF to ePUB converter for Android - ©2011 Ezio Querini ePUBator extract text from a PDF file and put it in a well formed (epubcheck compliant) ePUB file. PDF extraction based on iText library <http://itextpdf.com/> released under the AGPL license. - ePUBator IS THINKED FOR BOOKS (NOT FOR EVERY TYPE OF PDF), BUT IF YOU NEED A BETTER RESULT TRY SOMETHING ELSE LIKE CALIBRE. - ePUBator doesn't need internet connection (doesn't send your docs somewhere on the net...
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • 6
    Jar Ajar is a JAR-based self-extractor for zip files. Zip up files and package them with descriptive images and text using Jar Ajar's graphical interface. When recipients launch the resulting JAR, Jar Ajar guides users through the unzip process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Newspaper3k

    Newspaper3k

    News, full-text, and article metadata extraction in Python 3

    .... Although installing newspaper is simple with pip, you will run into fixable issues if you are trying to install on ubuntu. Source objects are an abstraction of online news media websites like CNN or ESPN. You can initialize them in two different ways. Building a Source will extract its categories, feeds, articles, brand, and description for you. You may also provide configuration parameters like language, browser_user_agent, and etc seamlessly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Chameleon

    Chameleon

    Color framework for Swift and Objective-C

    Chameleon is a lightweight, yet powerful, color framework for iOS (Objective-C & Swift). It is built on the idea that software applications should function effortlessly while simultaneously maintaining their beautiful interfaces. With Chameleon, you can easily stop tinkering with RGB values, wasting hours figuring out the right color combinations to use in your app, and worrying about whether your text will be readable on the various background colors of your app. With a plethora of color...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Products of the project: Java HTMLParser - VietSpider Web Data Extractor - Extractor VietSpider News. Click on "Show project details" to see more feature about each product.
    Downloads: 0 This Week
    Last Update:
    See Project
  • The Voice API that just works | Twilio Icon
    The Voice API that just works | Twilio

    Build a scalable voice experience with the API that's connecting millions around the world.

    With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Create an engaging voice experience that you can quickly scale and modify with a wide array of customization options and resources.
  • 10
    textextract

    textextract

    برنامج تفريغ الدروس والمحاضرات

    extract text from mdia يتيح لك هذا البرنامج تفريغ الدروس والمحاضرات الصوتية. يمكن حفظ الجلسة ثم استعادنها. يمكن الحفظ لعة تنسيقات مثل html pdf. يمكنك الحصول على الدعم من هذا المقع h
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Corrupt PPTX Salvager

    Corrupt PPTX Salvager

    A corruption ignoring unzipper & regular expressions extract PPTX text

    The biggest cause of corruption of PPTX corruption appears to be zip problems. This GUI uses a somewhat corruption immune unzipper, 7zip. 7zip sometimes succeeds in extracting the slide xml files that contain the text from corrupt pptx files where PowerPoint 2007 - 2013 fail with their built in unzipper. Furthermore Corrupt PPTX Salvager uses regular expressions to extract the text from these slide XML files rather than getting hung up on correct XML structure as PowerPoint seems to do...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Corrupt Office File Salvager

    Corrupt Office File Salvager

    Extract text/data from corrupt MS Office and Open Office files.

    This program will extract the text from some corrupted or all healthy Microsoft Office and Open Office files with the extensions .doc, docx, xls, xlsx, ppt, pptx, odt, ods and odp. It may succeed at doing so where MS Office and Open Office fail to salvage text/data. It can also attempt to recover formatting in the form of a full Open Office file with a regular, odt, ods or odp extension At this time there is no facility for recovering anything but basic formatting for Microsoft Office files...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    PDF Comaprision JINI

    This project is forged to compare two PDFs

    This project is forged to compare two PDFs . IT uses following approach in compression 1 . Extract All text of both pdfs and compare them Page by Page 2. Extract all images from both PDF and save in folders and then compare them one by one and save difference in Difference Folder 3. Convert PDF 1 and 2 pages to JPG and compare them one by one
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    UTX Converter

    UTX Converter

    Format Converter of UTX Format File

    UTX Converter is a format converter of UTX format file. UTX (Universal Terminology eXchange) format is a standard format of glossary. See http://www.aamt.info/english/utx/ or http://www.aamt.info/japanese/utx/ for more information. UTX Converter provides the following functions: - UTX   - To verify UTX file format   - To extract forbbidden words   - To extract pairs of forbidden word and approved word   - To extract pairs of non-standard word and approved word - Conversion...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    Corrupt Extractor for Microsoft Office

    Corrupt Extractor for Microsoft Office

    Extracts text/data from corrupt MS Office 2007-13 format files.

    Corrupt Office 2007 Extractor will extract the text/data from corrupt docx, xlsx, and pptx files where the respective MS Office files error out and refuse to open. In advanced mode the program can fix the zip structure of "Office Open XML" format files, a step which I now recommend despite our dissuasive blurb which comes up when you start that function. Advanced mode also allows recovering images and includes is a basic editor for editing the corrupt XML subfiles. Additionally I...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    isbntools

    A command line tool to extract, transform and get metadata for ISBNs

    As of 2015-06-02, this project is no longer under active development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Corrupt DOCX Salvager

    Corrupt DOCX Salvager

    Extract text from corrupt DOCX files where Word itself fails.

    Previously known as Damaged DOCX2TXT, this GUI program will extract text from damaged/corrupted Word 2007 - 2013 DOCX format documents. DOCX files are actually zipped collections of mostly XML files. The main text in docx files is found in document.xml file in the collection. Corrupt DOCX Salvager uses 7Zip, an unzipper that sometimes unzips partially corrupt document.xml files despite reporting an error. XML as a format is unforgiving of data corruption but Corrupt DOCX Salvager uses...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    MusicTagInfo

    Extract info from music files (single or playlist) to results.ini

    ... (when called by an external program) -c (beta) convert text for output to codepage 850 (west European) for console Data Exchanging > output.ini for save file (default DOS function) Based on BASS library by un4seen.com, url: www.un4seen.com/bass.html Refer to its license. I developed this program as part of my Music_Browser: http://sf.net/p/vespadjapps
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ArabicDiacritizer

    ArabicDiacritizer

    An automatic restoration of Arabic diacritic marks

    ... *************** - Extract the archive "ArabicDiacritizer Setup.rar". - Install the application using "Setup.exe". - Put an Arabic text in the Text Box. - Start the diacritization process. If the following problem occured: <Access to the path '..\ArabicDiacritizer v1.0\text.data' is denied> - Access to the path "Program Files\ArabicDiacritizer\ArabicDiacritizer v1.0\", - Right click on "ArabicDiacritizer" - Choose "Run as administrator" For further information, please contact: rebai_ily
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    Blender Auto Update Bash Script

    Update your Blender on Linux to the latest version - automatically

    Tested on Ubuntu this script goes out and grabs you a new blender build and puts in in $HOME/Blender Builds/Bleeding Edge. Defaults to gooseberry branch. Look at the defines at the top for tweekability. IMPORTANT: You must have wget installed first To install, in a terminal type: sudo apt-get install wget To use: extract: Upgrade Blender.sh to a directory open up the file in your text editor to tweak options. See inside the file for more info on that. Then run: Most distros...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    TextHunter

    User friendly toolkit to extract structured data from free text.

    TextHunter is being developed to support researchers who need to extract information from large volumes of free text. For example, in epidemiology, clinical researchers often need to review large numbers of electronic medical records to identify variables that describe the patient (symptomatology, medication status etc). This often a tedious process, increasing the difficulty and cost of research. To address this, TextHunter provides two key tools: - An efficient annotation interface...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    reconfig

    This program can manage input/output database of text files data

    This package can get configuration from files, pages and databases. The main class can get input data from files, remote site pages, a database or the result of execution of given PHP code. The class can extract values from the input data using regular expressions or custom PHP code and saves it to files or a database. Currently it supports MySQL, PostgreSQL and Microsoft SQL server databases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    iGi Reporter

    iGi Reporter

    Write Report,Read Multiple Notepad,pdf,Word,Design Logo and picture

    ... Write Report With High Speed ------------------------------------------------------------------- Fourth Problem:- problem is,if You Want To Read Pdf , extract Text From Pdf , And extract text From Word Documents. in this Program You Can Read:-Word-PDF-TXT And You Can Edit in it. -------------------------------------------------------------------- You Can Use Some Operation im Mathmatical
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AnalysesOracle Performance Doctor is a tool which helps you to understand the behavior of your application on Oracle database level. The tool does: 1. Extract execution plan for all SQL’s executed by specified db user and print it to file on server side. 2. For all executed SQL statements the tool will provide more performance version of sql text by using built-in module dbms_sqltune.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    docx2txt

    docx2txt

    Perl based utility to extract formatted text content from MS Docx file

    Docx2txt is a Perl based command-line utility to convert (even corrupted) Microsoft docx documents to reasonably formatted text files, along with appropriate character conversions. Apart from Perl it also requires a command line unzipping program like unzip/7z/pkzipc/wzunzip.
    Leader badge
    Downloads: 43 This Week
    Last Update:
    See Project