python data analysis free download

iramuteq

IRAMUTEQ : Interface de R pour les Analyses Multidimensionnelles de Textes et de Questionnaires. Logiciel de traitement de données pour des corpus texte ou de type individus/caractères. Permet notamment de réaliser des analyses de type "ALCESTE"

Downloads: 1,105 This Week

Last Update: 2024-11-03

See Project

modnlp-plugins

External plugins for modnlp/teccli

This is a general project for modnlp/teccli plugins, with focus on text visualizaton.

Downloads: 0 This Week

Last Update: 2023-05-06

See Project

Apertium: Machine Translation Toolbox

The free and open-source rule-based machine translation platform

Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance tools, and open linguistic data for several language pairs.

17 Reviews

Downloads: 9 This Week

Last Update: 2021-04-16

See Project

UnsupervisedMT

Phrase-Based & Neural Unsupervised Machine Translation

Unsupervised Machine Translation is a research repository that implements both phrase-based SMT and neural MT approaches for translation without parallel corpora. The neural component supports multiple architectures—seq2seq, biLSTM with attention, and Transformer—and allows extensive parameter sharing across languages to improve data efficiency. Training relies on denoising auto-encoding and back-translation, with on-the-fly, multithreaded generation of synthetic parallel data to continually...

Downloads: 0 This Week

Last Update: 8 hours ago

See Project

concordia

Powerful search library, best suited for computer-aided translation

Concordia - Roman goddess of agreement. Concordance searcher - tool for translators who need their translations to "agree" with one standard. Concordia is a C++ library for fast text lookup in large corpora. It uses a RAM stored index, which takes up approximately 600MB of memory for a corpus of 2 million sentences. It is based on the idea of a suffix array, enhanced by the presence of other auxiliary data structures. The effects are stunning - Concordia is able to do simple substring...

Downloads: 0 This Week

Last Update: 2019-02-28

See Project

ACOPOST - a collection of POS taggers

Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.

1 Review

Downloads: 0 This Week

Last Update: 2016-02-26

See Project

Automatic Compound Processing (AuCoPro)

Automatic compound splitting and semantic analysis of compounds

The central problem to be addressed in this project concerns a multidisciplinary (linguistics and computational linguistics) investigation into sharing of knowledge and resources between closely-related languages, specifically relating to the automatic processing of compounds. Specifically, we will explore the possibility to create new knowledge about closely-related languages, and efficiently develop additional, more advanced resources for (a) compound segmentation; and (b) the semantic...

Downloads: 0 This Week

Last Update: 2015-07-28

See Project

EyeMap - Eye Movement Data Analyzer

EyeMap is a visualization and analysis tool for text reading eye movement data. It can process Unicode, proportion/non-proportion and spaced/unspaced reading materials, which supports various languages and experiment methods.

1 Review

Downloads: 2 This Week

Last Update: 2013-08-10

See Project

CRFSharp

CRFSharp is a .NET(C#) implementation of Conditional Random Field

CRFSharp(aka CRF#) is a .NET(C#) implementation of Conditional Random Fields, an machine learning algorithm for learning from labeled sequences of examples. It is widely used in Natural Language Process (NLP) tasks, for example: word breaker, postagging, named entity recognized, query chunking and so on. CRF#'s mainly algorithm is the same as CRF++ written by Taku Kudo. It encodes model parameters by L-BFGS. Moreover, it has many significant improvement than CRF++, such as totally...

Downloads: 0 This Week

Last Update: 2015-08-03

See Project

eurown

A Python module for EuroWordNet files and data.

Downloads: 0 This Week

Last Update: 2013-05-30

See Project

ELIA(eye-tracking for psycholinguistics)

ELIA(Eyegaze Language Integration Analysis) supports the analysis of eye-tracking data for studies in language processing. ELIA eases early analysis of data to enable iterative development of experiments in response to spoken language.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-24

See Project

PyAnnotation

PyAnnotation is a Python Library to access and manipulate linguistically annotated corpus files. Supported file formats are Kura XML, Elan XML and Toolbox files. A Corpus Reader API is provided to support statistical analysis within the NLTK.

Downloads: 0 This Week

Last Update: 2013-04-29

See Project

KAF2Tiger2

KAF2Tiger2 is a KAF (KYOTO annotation format) to <tiger2/> (Tiger2 XML) converter.

Downloads: 0 This Week

Last Update: 2013-04-19

See Project

WordNetLMF

WordNetLMF converts WordNet (http://wordnet.princeton.edu/) lexicographer files into KYOTO-LMF, the LMF dialect used in the KYOTO project (http://www.kyoto-project.eu/).

Downloads: 0 This Week

Last Update: 2013-04-11

See Project

SWIPE' pitch extractor

This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.

Downloads: 0 This Week

Last Update: 2013-04-11

See Project

Varro

The Varro toolkit is a system for identifying and frequently recurring unordered subtrees in semi-structured data. It is mostly for linguistics but has applications in semi-structured data mining too.

Downloads: 0 This Week

Last Update: 2015-06-04

See Project

Scheme Natural Language Toolkit

The Scheme Natural Language Toolkit (S-NLTK) is a Scheme R6RS library for language and text processing, and various tasks related to symbolic and statistical analysis of language data.

Downloads: 0 This Week

Last Update: 2015-09-01

See Project

Wordcorr

Data management for comparative linguistics

Wordcorr automates the tedious and risky process of tabulating and managing the sound correspondences used in working out the historical development of natural languages. Initial support was from NSF.

4 Reviews

Downloads: 11 This Week

Last Update: 2013-01-05

See Project

Search Results for "python data analysis"

18 projects for "python data analysis" with 2 filters applied:

iramuteq

modnlp-plugins

Apertium: Machine Translation Toolbox

UnsupervisedMT

concordia

ACOPOST - a collection of POS taggers

Automatic Compound Processing (AuCoPro)

EyeMap - Eye Movement Data Analyzer

CRFSharp

eurown

ELIA(eye-tracking for psycholinguistics)

PyAnnotation

KAF2Tiger2

WordNetLMF

SWIPE' pitch extractor

Varro

Scheme Natural Language Toolkit

Wordcorr

Search Results for "python data analysis"

18 projects for "python data analysis" with 2 filters applied:

iramuteq

modnlp-plugins

Apertium: Machine Translation Toolbox

UnsupervisedMT

concordia

ACOPOST - a collection of POS taggers

Automatic Compound Processing (AuCoPro)

EyeMap - Eye Movement Data Analyzer

CRFSharp

eurown

ELIA(eye-tracking for psycholinguistics)

PyAnnotation

KAF2Tiger2

WordNetLMF

SWIPE' pitch extractor

Varro

Scheme Natural Language Toolkit

Wordcorr

Related Searches

Related Categories