Bullet Points in Mosaic Removed and Resulting Display
Modular Suite of NLP Tools
Brought to you by:
luzs
Working with Mosaic, one of the things one can do is get rid of some noise in Mosaic by removing collocates that are meaningless, such as square bullet points. The Java Installer version uses a tokeniser that correctly identifies these as punctuation marks and removes them for display purposes. But webcli doesn't.
Another bug that is specific to webcli is that something seems to be worong with the resulting display (after the square bullet points are removed in Mosaic): the concordance then displays some empty lines, and lines where the name of file is broken over two lines.
punctuationregex has been modified in webcli/js/main.js to remove the bullet points. this change should appear in webcli-dev by tomorrow