Showing 8 open source projects for "mining pdf"

View related business solutions
  • Employee monitoring software with screenshots Icon
    Employee monitoring software with screenshots

    Clear visibility and insights into how employees work. Even remotely

    Our computer monitoring software allows employees, field contractors, and freelancers to manually clock in when they begin working on an assignment. The application will take screenshots randomly or at set intervals, which allows employers to observe the work process. The application only tracks activity when the employee is clocked in. No spying, only transparency.
  • Red Hat Enterprise Linux on Microsoft Azure Icon
    Red Hat Enterprise Linux on Microsoft Azure

    Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

    Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.
  • 1
    Text Analysis Markup System
    Text Analysis Markup System (TAMS) is both a system of marking documents for qualitative analysis and a series of tools for mining information based on that syntax.
    Leader badge
    Downloads: 26 This Week
    Last Update:
    See Project
  • 2
    TEXminer

    TEXminer

    Text Mining Classification for Texts in ASCII, Unicode and PDF Format.

    TEXminer uses generic Text Mining Methods to analyze Unicode Files as plain Text or PDF. The Text Database can be saved in XML where the orginal Text, the Sentence and Word Lists and additional Parameters (e.g. Abbreviations) are stored. TEXminer allows Language Detection by Letter Frequency Analysis, finding important Words by Cooccurrence Analysis, Determination of Central Expressions, Thematic Text Classification (also Semantic Groups) and Fingerprint Comparison. Because TEXminer...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing in C++17/20

    DocWire SDK, a standout C++17/20 data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. The upcoming integration of C++17 and C++20 will bring advanced functionalities, particularly in areas like HTTP capabilities and web data extraction. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    Decaleon

    Decaleon

    Multilingual Esperanto Translator, Word Dictionary, Vocabulary Trainer

    ... Characters; Export into Text, HTML, TeX, PDF Files; Text files may be imported in other Vocabulary Training Software. Version 6.0 gives a big addition of standard words and supports another 23 Languages; a small Vocabulary for: Albanian, Bulgarian, Czech, Dutch, Finnish, Hungarian, Norwegian, Romanian, Serbian, Slovak, Slovene, Turkish, Ukrainian, Interslavic, Arabian and Asian Languages. The Sourceforge Project TEXminer uses the same XML Database for Text Mining. Cooccurrences in development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Powering the next decade of business messaging | Twilio MessagingX Icon
    Powering the next decade of business messaging | Twilio MessagingX

    For organizations interested programmable APIs built on a scalable business messaging platform

    Build unique experiences across SMS, MMS, Facebook Messenger, and WhatsApp – with our unified messaging APIs.
  • 5

    FastaTools

    Performs several operations to Fasta protein databases

    FastaTools performs several operations to Fasta protein databases. For more information, you can have a look at the README.md file in the source code tree: https://sourceforge.net/p/lp-csic-uab/fastatools/code/ci/default/tree/README.md Or you can download the Documentation an Tutorial PDF file in the Files section: https://sourceforge.net/projects/fastatools.lp-csic-uab.p/files/FastaTools%20Documentation%20and%20Tutorials.pdf - Gallardo, Ó., Ovelleiro, D., Gay, M., Carrascal, M...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    JSentiWordNet

    A wrapper for the famous SentiWordNet, a resource for opinion mining

    This project aims to provide a wrapper around the SentiWrodnet, a lexical resource for opinion mining. As defined by the authors : SentiWordNet assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity. You can find additional information about the creation of SentiWordnet here : http://nmis.isti.cnr.it/sebastiani/Publications/LREC06.pdf sentiWordnet (avilable here : https://drive.google.com/open?id=0B0ChLbwT19XcOVZFdm5wNXA5ODg) is a text file...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    freeDatamap

    freeDatamap

    FreeDatamap spatializes the map of your organization’s data.

    ... navigate and drill down from macro data to micro. FreeDatamap delivers a fast and visually attractive user interface that runs on any support: computers, tablets or phones. Features list : • Unlimited users : full web 2.0 application • Data visualization : one centralized trusted map for all your data • Workflow and business process visualization • Search capabilities • Report creation • Advanced analytics • Data mining • Dashboard, gauges, alerts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    pydocrawl automatically downloads pdf-, ps- and doc- files from web sites. An initial URL and a wordlist must be given. Multithreaded information mining (harvesting) tool written entirely in Python. Version 0.1 successfully runs on Linux and Cygwin.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next