A Python tool which produced lifespan sequences from edits history.
The tool is first developed for the Wikipedia edits history but can easily be adapted for others applications. From a database containing for each article its list of revisions, produce one csv file per article containing authored sequences and lifespans.
Output format:
i,j,lifespan,author
with
- i : begining of the chars sequence
- j : end of the chars sequence
- lifespan : number of edits the sequence has survives until the lattest revision
- author : author id of the sequence.
Categories
Information AnalysisFollow irit_diff_sequences
Other Useful Business Software
Gen AI apps are built with MongoDB Atlas
MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of irit_diff_sequences!