I'm working on a project similar to what you describe. It extracts words from the Icelandic Wiktionary project and merges with another wordlist. Other languages and sources, e.g. wikipedia can be added with ease. Here is a link to the project: http://launchpad.net/hunspell-is .