Menu

#5 Sampling for limiting the retrieval of very common words

1.0
open
nobody
Feature (7)
9
2017-03-05
2017-03-05
S Luz
No

Words that are too common and might slow the browser down too much. The proposed solution is to sample such words randomly from a collection of files up to a certain limit of
concordances. This is not an issue with the current size of the corpora currently handled by modnlp (in fact, it may never become an issue) but it may be useful to implement this feature in future.

Discussion

  • S Luz

    S Luz - 2017-03-05
    • Priotity: --> 10
     
  • S Luz

    S Luz - 2017-03-05
    • Priotity: 10 --> 9
     

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.