LF Aligner helps translators create translation memories from texts and their translations. It relies on Hunalign for automatic sentence pairing. Input: txt, doc, docx, rtf, pdf, html. Output: tab delimited txt, TMX and xls. With web features. My email address is listed in readme.txt; for support, use the forum here. My personal website: www.farkastranslations.com.
Provides OCR solutions for Nepali, based on Tesseract 4.0.
NeOCR is a free software based on Tesseract (Open Source OCR Engine) for the Windows operating system. It provides an easy and user-friendly user interface to recognize texts contained in images as well as PDF documents and convert to editable text formats (.txt, .doc, .docx). This product is accessible to Blind and Visually Impaired peoples (tested with NVDA and Narrator).
Note: for binaries and installation instruction please visit official website. http://ictfax.org/ ICTFAX is multi-user, web based business solution with advance billing capabilities featuring duration as well as per unite billing , ICTFAX features email to fax, web to fax , fax to email, supports G.711, PSTN and T.38 origination and termination.
Free office suite for working with text, spreadsheets and presentation
ONLYOFFICE Desktop Editors is an open source and 100% free office suite, combining text, spreadsheet and presentation editors for working on documents offline. The application features all types of formatting options and allows users to edit complex documents. Collaboration features such as reviewing and real-time co-editing are available as well. The editors offer 100% compatibility with MS Office and support other popular document formats including OpenDocument. The application also...
An open source system for Arabic corpora processing
.... Accept TXT, DOC, DOCX, RTF and HTML formats h. Export the processing results in CSV file format
AnyCount counts characters, words, lines, and pages in 37 file formats
AnyCount—the most accurate word-count software and the industry standard, compatible with 37 file formats: Microsoft Office: DOC, DOCX, RTF, XLS, XSLX, PPT, PPS, PPTX, PPSX, PUB, VSD, VSDX Images and PDF: GIF, PNG, BMP, JPG, PDF Open Office and Text files: ODT, SXW, SDW, ODS, SXC, SDC, ODP, SXI, SDD, TXT, CSV HTML+, Help and other: HTML, XML, HLP, CHM, WPD, SLP, MIF, ZIP, RAR The tool counts words not only in saved documents but also directly from websites. The 3D version allows users to impose...
Nextgen word app. Word Docs made easy!
Word Doctor is a word editor/ writers aid, designed to analyze writing "Content" and "Style". Inspire your creative process and get to work fast using dictation (Speech to Text), or the Ink-Blot test to inspire creativity. Analyze what you already have and Identify imagery, weak writing structures, and more. Content is king, and Word Doctor can certainly help with that!
IFile, PHP based framework for indexing and search in the documents
Index documents using Lucene Seach Engine or the MySql Full-Text. IFile supports many type of documents: Rich Text Format (.rtf); Moving Picture Expert Group-1/2 Audio Layer 3 (.mp3); Joint Photographic Experts Group (.jpg - .jpeg); Tagged Image File Format (.tiff); Microsoft Word 97-2000 (.doc); Microsoft Word 2003-2007 (.docx); Microsoft Excel 97-2000 (.xls); Microsoft Excel 2003-2007 (.xlsx); Microsoft PowerPint 2003-2007 (.pptx); OpenOffice.org Writer (.odt); OpenOffice.org Calc...
Nettoyage et préparation de corpus de transcriptions d'entretiens
Scripts réalisés dans le cadre du projet SUMTEC pour la préparation des corpus de transcription en vue d'une exploitation sur RQDA et IRAMUTEQ. http://www.msh-lorraine.fr/index.php?id=623 Le projet contient 3 programmes PERL. L'objectif consiste à récupérer des transcriptions d'entretien non structurées afin de les structurer sous la forme d'un arbre xml. L'intérêt consiste à pouvoir, in fine, identifier les tours de parole et séparer les discours des interviewés et des intervieweurs.
A free C# word API to process word documents generated by DocX
As a free C# word API, it can enable developers to edit, copy, create, print and convert word documents (.docx, .doc) which were generated by DocX. By using DocX and Free Spire.Doc together, developers can operate word documents in much more fields. Main features: Convert word to RTF Convert word to PDF Convert word to Image Convert word to txt Convert .docx to .doc Insert Watermark Insert Subscript and Superscript Remove macros Insert footnote Insert endnote Set highlight Encrypt the word...
Group file share with advanced text parsing capability for easy search
Originally created as a church resource sharing system, phpShare&Search allows users to create accounts, share documents, search documents, and like or report documents. phpShare&Search's power comes from its advanced document parser which extracts text from .PDF, .TXT, .DOC, and .DOCX files and its community features of liking resources and reporting them as inappropriate or SPAM. Users also subscribe to weekly updates of new content. User's may choose to download and host/install/configure...
Hiring made easy
... resumes Get great leads from Employee referrals Source Resumes from print media & advertisements Supports multiple formats (docx, doc, pdf, txt ) Content parsing with keyword search View applicant details - with scribd Aggregate Applicants from various Sources Track applicant state in tree format Prevent applicant duplication Enrich Job Requisitions Candidate Pre-Screening - Exam Assumption – apache-tomcat-6.0.24 and mysql 5.1 and jdk1.6.0_14 is installed in your system.
Text To Speech converter
This application can convert the given text into speech.The speech may converted into seperate audio file for future use. we can give .txt,.doc,.docx text file as a input can convert the text in to audible .wav file.
A document clustering system with search & report generation features
... and gives similar documents to these. The 2nd feature gives each sentence containing the search term from documents found. The report generation feature specifically for use by audit companies takes an audit report as an input and outputs an insight log and draft management letter with insights pulled from the report. This feature can be customised to suit a company's requirements. This software works with pdf, docx, txt and csv files and the zip file must be saved in "My Documents".
This process allows to shift the ANSI representation of Tifinaghe to Unicode representation for the content of a file in one of the following format: .txt, .rtf, .doc, and .docx or the content of a text area.
A tool can hide information into a Word 2007 file
The app use the idea of text segment to hide information into a Word 2007(.docx) file.After hide the information into the file,it will not affect the normal use of the file. When you want to hide information in a .docx file,use the app as follows: 1、Select the operation:”Embed” or “Extract”. 2、Select a .docx file. 3、Select the file (.txt) hold the secret information. 4、Click the button ”Run”. When you want to extract information embedded in a .docx file use my app,you do like this: 1、Select...
Rym-rebooks is designed by below features: Storage users books content in other type via index business steps, output is called indexed books File types are supported: doc, docx, pdf, chm, htm, html, txt Searching function based on the indexed books Suggestion function base on the indexed books Searching / Suggestion functions may be used by developer for customization search engine by customized books content and using export API to search. Goal purpose of Rym...
Harmoni is search engine application written with Python
Harmoni is search engine application written with Python. Harmoni search engine is not only focused on the searching the files, but also focused on searching any keywords on document files. Harmoni is easy to use application, and it can find any keywords fast. Harmoni has simple interface in “home”. User can search any string on some types of document (*.txt, *.html, *.docx, *.pptx, *.xlsx, *.pdf). There are two kinds of searching method which can be used to search the keyword; fragment word...
Harmoni is Python-based application for searching any files on your PC. It can also search any keywords in some formats of document, such as; .txt, html, docx, xlsx, pptx, and pdf. Harmoni is a fast search Engine, it also supported by some tools; multi deleting, renaming, moving, and so fourth. It is just like google on your computer. It is recommended for anyone who are working with office documents.
LemonFlex, de quoi s'agit-il ? Il s'agit d'un système de gestion de contenu (ou CMS) qui propose au webmaster de gérer un site entièrement dynamique sans pour autant qu'il sache programmer, mais qu'il pourra configurer simplement.
N-Pad is a light weight text-editor for mac. It has lots of features that come in handy when researching a project for example, like the built in web browser.
Expressub est un logiciel pour créer et timer des sous titre -format supporté: -audio : tout les format reconnu par mplayer (avi, mp4, wav, mp3 ...) -vidéo : codecs vfw compatible avec la vidéo exigé -script : ass, txt, rtf, doc, docx
vxworks notepad docx
Notepad for vxWorks. Feature including: 1 Edit .txt .docx 2 Open/Save as .docx, .pdf, .html, .exlx format 3 Import from other format 4 As simple as possible. 5 code size is less than 2M If you have interests, please join us!
A Java software that imports data from Microsoft Office (docx, pptx, xlsx), PDF, HTML, txt, image and other files into DBMS (DB2, Oracle, MySQL, MSSQL, PostgreSQL)
Free reader for any kind of document or source data, full Java. Read using a single interface : - MS WORD 97-2003 .doc files - MS EXCEL 97-2003 .xls files - TEXT .txt files Future version : docx, odt, pdf and more This project is a part of REQCHECKER software http://sites.google.com/site/reqchecker