VecText is an application that converts raw text to a structured format suitable for various data mining software. The application is written in interpreted programming language Perl. A part of the functionality is realized by external modules (e.g., Lingua::Stem::Snowball for stemming). The graphical user interface enables user-friendly software employment without requiring specialized technical skills and knowledge of a particular programming language, names of libraries and their functions, etc. All preprocessing actions are specified using common graphical elements organized into logically related blocks. The graphical user interface is implemented in Perl/Tk. In the command-line interface mode, all options need to be specified using the command line parameters. This way of non-interactive communication enables incorporating the application into a more complicated data mining process integrating several software packages or performing multiple conversions in a batch.

Features

  • document vector representation
  • natural language processing

Project Samples

Project Activity

See All Activity >

Follow VecText

VecText Web Site

Other Useful Business Software
Your top-rated shield against malware and online scams | Avast Free Antivirus Icon
Your top-rated shield against malware and online scams | Avast Free Antivirus

Browse and email in peace, supported by clever AI

Our antivirus software scans for security and performance issues and helps you to fix them instantly. It also protects you in real time by analyzing unknown files before they reach your desktop PC or laptop — all for free.
Free Download
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of VecText!

Additional Project Details

Languages

English

Intended Audience

Science/Research

User Interface

Command-line, Tk

Programming Language

Perl

Related Categories

Perl Information Analysis Software, Perl Machine Learning Software, Perl Natural Language Processing (NLP) Tool

Registered

2016-04-07