A tool for discovering external dependencies in Office files.
The Dependency Discovery Tool searches through binary office files (.doc, .xls and .ppt) and tries to find any documents or files that are linked to the document.
It is written in java, using the Apache POI libraries ( http://poi.apache.org )
This project was part of a summer scholarship from the School of Engineering and Computer Science at Victoria University, Wellington (http://ecs.vuw.ac.nz) in conjunction with Archives New Zealand ( http://archives.govt.nz ).
[2014-10-31] This project is obsolete, for latest version (6.1.3) see GitHub https://github.com/digital-preservation/droid (source) and http://www.nationalarchives.gov.uk/information-management/manage-information/preserving-digital-records/droid/ (binary)
DROID (Digital Record Object Identification) is an automatic file format identification tool. It is the first in a planned series of tools developed by The National Archives under the umbrella of its PRONOM technical registry service.
[2013-01-24] The binary download of the latest version of DROID has now been moved to The National Archives website: http://www.nationalarchives.gov.uk/information-management/projects-and-work/droid.htm
The source code for the latest version of DROID remains available via Github: http://digital-preservation.github.com/droid/
[2012-09-07] DROID 6.1 has been released. ...
This project defines the Simple API for Binary REpresentations (SABRE) for processing hierarchically structured, binary-oriented documents, comparable to the Simple API for XML (SAX). The library is e.g. used in the Java ISO Image Creator (JIIC).