Work at SourceForge, help us to make it a better place! We have an immediate need for a Support Technician in our San Francisco or Denver office.

Close

#7 Pre- and post-stopwords

open
nobody
None
5
2008-09-07
2008-09-07
Samuel Murray
No

In many languages there are certain words that are always associated with certain other words. For example, the words "the" and "a" are associated with words that follow it. Certain prepositions in English are associated with words preceding it.

It would be great if a user could specify two lists of stopwords for his language, one with words that should be displayed with the word that follows it, and one with words that should be displayed with the word that it follows. These lists should be able to contain multi-word stopwords also.

So if "I", "we", "he" and "she" were in the pre-stopword list, and "the" and "this" were in the post-stopword list, the following sentence would be displayed as follows:

Yesterday I saw the man who read this newspaper.

Yesterday
I saw
the man
who
read
this newspaper

This can be useful for dealing with very short words, but it may also enable users to customise their reading experience for their language or their text's subject field.

Discussion