A Python tool which produced lifespan sequences from edits history.
The tool is first developed for the Wikipedia edits history but can easily be adapted for others applications. From a database containing for each article its list of revisions, produce one csv file per article containing authored sequences and lifespans.
Output format:
i,j,lifespan,author
with
- i : begining of the chars sequence
- j : end of the chars sequence
- lifespan : number of edits the sequence has survives until the lattest revision
- author : author id of the sequence.

Project Samples

Project Activity

See All Activity >

Follow irit_diff_sequences

irit_diff_sequences Web Site

Other Useful Business Software
Get the most trusted enterprise browser Icon
Get the most trusted enterprise browser

Advanced built-in security helps IT prevent breaches before they happen

Defend against security incidents with Chrome Enterprise. Create customizable controls, manage extensions and set proactive alerts to keep your data and employees protected without slowing down productivity.
Download Chrome
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of irit_diff_sequences!

Additional Project Details

Languages

English

Intended Audience

Developers

User Interface

Console/Terminal

Programming Language

Python

Database Environment

SQLite

Related Categories

Python Information Analysis Software

Registered

2015-04-20