We can find another source of documentation in cvs/svn/git code repository commit messages. Assessing this data differs in interesting ways from source code comments, but it offers another public data source that is easy to get our hands on. Also provides a nice domain for applying more NLP-y style assessment.