Showing 218 open source projects for "text extract"

View related business solutions
  • Red Hat Enterprise Linux on Microsoft Azure Icon
    Red Hat Enterprise Linux on Microsoft Azure

    Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

    Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
  • 1

    Convert HTML to PDF in .NET with C#

    Convert HTML to PDF in .NET with C# using EVO HTML to PDF for .NET

    EVO HTML to PDF Converter for .NET is a library that can be easily integrated and distributed in your ASP.NET and MVC web sites, desktop applications, Windows services and Azure cloud services to convert web pages, HTML strings and streams to PDF, to images or to SVG and to create nicely formatted and easily maintainable PDF reports and documents. The converter has full support for HTML5, CSS3, SVG, Canvas, Web Fonts and JavaScript. Does not require installation or any third party tools. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Php Email Extractor
    to extract emails from text sources and removes duplicate emails and removes unwanted words from emails
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    DoAllWithPDF_servicemenu

    DoAllWithPDF_servicemenu

    KDE servicemenu for pdf

    allows kde user to make a lot of things whit right click on a pdf file.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Savvy DOCX Recovery

    Savvy DOCX Recovery

    Open corrupt Word DOCX files and possibly recover formatting too.

    ... subfiles of the DOCX. If this doesn't work a second attempt is made where the corrupt XML subfiles are truncated at the first error, and the correct ending tags are again added with xmllint. If all else fails, SilverCoder's DocToText is used to extract text. Try also http://wordcorruptdocchecker.codeplex.com/ and https://support.microsoft.com/en-us/kb/2528942 and my other SF projects: Corrupt Extractor for Microsoft Office, Corrupt DOCX Salvager, S2 Recovery Tools for Microsoft Word.
    Leader badge
    Downloads: 287 This Week
    Last Update:
    See Project
  • Engage for Amazon Connect, the Pre-built Contact Center Platform Icon
    Engage for Amazon Connect, the Pre-built Contact Center Platform

    Utilizing the power of AWS and Generative AI, Engage provides your customers with highly personalized, exceptional experiences.

    Engage is a pre-built, intelligent contact center platform that transforms customer service.
  • 5
    ZORE is a syntax-based Chinese (Zh) ORE system, which can extract relations and semantic patterns from Chinese text. ZORE identifies relation candidates from auto- matically parsed dependency trees, and then extracts relations with their semantic patterns iteratively through a novel double propagation algorithm. Empirical results on two data sets show the effectiveness of the proposed system. This software source is under GPL (v.3), and a separate commercial license issued by the authors for non...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Blogspot To Docx

    Blogspot To Docx

    Blogger To Docx

    Created your blog online, or on your phone? Use BlogspotToDocx.exe to reverse engineer your blog to your original transcript/text. With no images and hyperlinks! In docx format! It uses your blogger RSS feed to extract your text. Blogger to docx. Made according to Google’s Blogger Data API. It's written with Java, you got to have java on your system or just download the Java JRE. http://java.com/download Unzip it and put it on your desktop. RaY
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Personalized Search Engine

    Personalized Search Engine for Your Files

    ... also extract text content from files of many wildly used file types such as pdf, doc, ppt, and mp3 to improve the index quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    eLibrary

    Personalized Search Engine for Commonly Used Files

    eLibrary (electric library) is a Java software to search files and folders in an OS file system. It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary can also extract...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Pdf Text Extractor

    A Java Application that extracts text from pdf files.

    A Java Application that extracts text from pdf files. User can select different areas on the pdf file and can extract text from those areas.Extraction of text can be done for single or multiple pages. Generate Bookmarks on the basis of Font Heights entered by the user.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Finance Automation that puts you in charge Icon
    Finance Automation that puts you in charge

    Tipalti delivers smart payables that elevate modern business.

    Our robust pre-built connectors and our no-code, drag-and-drop interface makes it easy and fast to automatically sync vendors, invoices, and invoice payment data between Tipalti and your ERP or accounting software.
  • 10
    Jar Ajar is a JAR-based self-extractor for zip files. Zip up files and package them with descriptive images and text using Jar Ajar's graphical interface. When recipients launch the resulting JAR, Jar Ajar guides users through the unzip process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Products of the project: Java HTMLParser - VietSpider Web Data Extractor - Extractor VietSpider News. Click on "Show project details" to see more feature about each product.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    textextract

    textextract

    برنامج تفريغ الدروس والمحاضرات

    extract text from mdia يتيح لك هذا البرنامج تفريغ الدروس والمحاضرات الصوتية. يمكن حفظ الجلسة ثم استعادنها. يمكن الحفظ لعة تنسيقات مثل html pdf. يمكنك الحصول على الدعم من هذا المقع h
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Corrupt PPTX Salvager

    Corrupt PPTX Salvager

    A corruption ignoring unzipper & regular expressions extract PPTX text

    The biggest cause of corruption of PPTX corruption appears to be zip problems. This GUI uses a somewhat corruption immune unzipper, 7zip. 7zip sometimes succeeds in extracting the slide xml files that contain the text from corrupt pptx files where PowerPoint 2007 - 2013 fail with their built in unzipper. Furthermore Corrupt PPTX Salvager uses regular expressions to extract the text from these slide XML files rather than getting hung up on correct XML structure as PowerPoint seems to do...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Corrupt Office File Salvager

    Corrupt Office File Salvager

    Extract text/data from corrupt MS Office and Open Office files.

    This program will extract the text from some corrupted or all healthy Microsoft Office and Open Office files with the extensions .doc, docx, xls, xlsx, ppt, pptx, odt, ods and odp. It may succeed at doing so where MS Office and Open Office fail to salvage text/data. It can also attempt to recover formatting in the form of a full Open Office file with a regular, odt, ods or odp extension At this time there is no facility for recovering anything but basic formatting for Microsoft Office files...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    PDF Comaprision JINI

    This project is forged to compare two PDFs

    This project is forged to compare two PDFs . IT uses following approach in compression 1 . Extract All text of both pdfs and compare them Page by Page 2. Extract all images from both PDF and save in folders and then compare them one by one and save difference in Difference Folder 3. Convert PDF 1 and 2 pages to JPG and compare them one by one
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    UTX Converter

    UTX Converter

    Format Converter of UTX Format File

    UTX Converter is a format converter of UTX format file. UTX (Universal Terminology eXchange) format is a standard format of glossary. See http://www.aamt.info/english/utx/ or http://www.aamt.info/japanese/utx/ for more information. UTX Converter provides the following functions: - UTX   - To verify UTX file format   - To extract forbbidden words   - To extract pairs of forbidden word and approved word   - To extract pairs of non-standard word and approved word - Conversion...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Corrupt Extractor for Microsoft Office

    Corrupt Extractor for Microsoft Office

    Extracts text/data from corrupt MS Office 2007-13 format files.

    Corrupt Office 2007 Extractor will extract the text/data from corrupt docx, xlsx, and pptx files where the respective MS Office files error out and refuse to open. In advanced mode the program can fix the zip structure of "Office Open XML" format files, a step which I now recommend despite our dissuasive blurb which comes up when you start that function. Advanced mode also allows recovering images and includes is a basic editor for editing the corrupt XML subfiles. Additionally I...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    isbntools

    A command line tool to extract, transform and get metadata for ISBNs

    As of 2015-06-02, this project is no longer under active development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Corrupt DOCX Salvager

    Corrupt DOCX Salvager

    Extract text from corrupt DOCX files where Word itself fails.

    Previously known as Damaged DOCX2TXT, this GUI program will extract text from damaged/corrupted Word 2007 - 2013 DOCX format documents. DOCX files are actually zipped collections of mostly XML files. The main text in docx files is found in document.xml file in the collection. Corrupt DOCX Salvager uses 7Zip, an unzipper that sometimes unzips partially corrupt document.xml files despite reporting an error. XML as a format is unforgiving of data corruption but Corrupt DOCX Salvager uses...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    MusicTagInfo

    Extract info from music files (single or playlist) to results.ini

    ... (when called by an external program) -c (beta) convert text for output to codepage 850 (west European) for console Data Exchanging > output.ini for save file (default DOS function) Based on BASS library by un4seen.com, url: www.un4seen.com/bass.html Refer to its license. I developed this program as part of my Music_Browser: http://sf.net/p/vespadjapps
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ArabicDiacritizer

    ArabicDiacritizer

    An automatic restoration of Arabic diacritic marks

    ... *************** - Extract the archive "ArabicDiacritizer Setup.rar". - Install the application using "Setup.exe". - Put an Arabic text in the Text Box. - Start the diacritization process. If the following problem occured: <Access to the path '..\ArabicDiacritizer v1.0\text.data' is denied> - Access to the path "Program Files\ArabicDiacritizer\ArabicDiacritizer v1.0\", - Right click on "ArabicDiacritizer" - Choose "Run as administrator" For further information, please contact: rebai_ily
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    TextHunter

    User friendly toolkit to extract structured data from free text.

    TextHunter is being developed to support researchers who need to extract information from large volumes of free text. For example, in epidemiology, clinical researchers often need to review large numbers of electronic medical records to identify variables that describe the patient (symptomatology, medication status etc). This often a tedious process, increasing the difficulty and cost of research. To address this, TextHunter provides two key tools: - An efficient annotation interface...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    reconfig

    This program can manage input/output database of text files data

    This package can get configuration from files, pages and databases. The main class can get input data from files, remote site pages, a database or the result of execution of given PHP code. The class can extract values from the input data using regular expressions or custom PHP code and saves it to files or a database. Currently it supports MySQL, PostgreSQL and Microsoft SQL server databases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    iGi Reporter

    iGi Reporter

    Write Report,Read Multiple Notepad,pdf,Word,Design Logo and picture

    ... Write Report With High Speed ------------------------------------------------------------------- Fourth Problem:- problem is,if You Want To Read Pdf , extract Text From Pdf , And extract text From Word Documents. in this Program You Can Read:-Word-PDF-TXT And You Can Edit in it. -------------------------------------------------------------------- You Can Use Some Operation im Mathmatical
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    AnalysesOracle Performance Doctor is a tool which helps you to understand the behavior of your application on Oracle database level. The tool does: 1. Extract execution plan for all SQL’s executed by specified db user and print it to file on server side. 2. For all executed SQL statements the tool will provide more performance version of sql text by using built-in module dbms_sqltune.
    Downloads: 0 This Week
    Last Update:
    See Project