Read Me
The project is aiming to do two things: First, track the total transit time it takes a news story to
travel from site A to site Z across the internet (total transit time = t3, get it? :P)
Additionally, we'll be indexing the content of the article at each hop and tracking the morphing of
the text of the article (or summaries) as it traverses the web. The goal here is to gather some
interesting data sets relating to how quickly news flows across the net, and how it changes based on
the genre of news site it hits (or if it changes at all).
The technologies used are as follows:
Main parsing system: PHP-CLI interfacing with TokyoCabinet / TokyoTyrant Database system
Main Front End (web): PHP
Controller scripts (add/remove/edit news sites we watch): PHP-CLI / Perl
If you are interested in helping the project out with code, database design, or anything that may
be of use, please visit our sourceforge page at: http://www.sourceforge.net/projects/t3study and
get in touch!
- Mike <mb1689@gmail.com>