I've completed the development of a working backend. It is very much at a development stage with lots of room for improvements. It's undocumented and crude but it works. A list of UNC paths are generated with a command file creating a list of indexes of the files of interest. This list is then stripped and uploaded into a database, verifying access to the files, extraacting size and date info and inserted into the database of file names. The list of files are then converted to text files using a file to text conversion utility (various options available here) and the text file is parsed and stripped ito a list of meaningfull words using a PHP scripts. The list of words are then sorted, uploaded firstly into a common wordlist and secondly into a document refrence list mapping words to a specific document reference.... read more