Should now be able to extract words from all files in subdirectories. Performance problem with lists => will have to use lists for non-volatile storage and dictionaries for use in memory.
Created a project structure and architecture base frame.