Activity for craig

  • craig craig posted a comment on discussion Features

    These would need proper tests added. Ideally test macro files containing macro features (I don't know details of this format) before I include them. If you would like to help, please add a PR with macro versions of, say, core/src/test/resources/ms files 'excel.xls' and 'word.docx' which actually include macros (?) so that I can include in tests and do a rebuild. (The build is tricky to complete cleanly with all the env setup required, easier if I do it).

  • craig craig posted a comment on discussion Announce

    This is an across the board uplift of DB drivers, tools and other library dependencies. This release was built and tested with Java 19. The models (NLP/mood and OCR) should also be improved. The main feature add is -e/--regex for file search. This interprets the 'search pattern' as a regular expression instead of the default wildcard. DB search is based on LIKE queries so this remains wildcard only (if -d is specified then -e is ignored, with warning). I removed Neo4J at the same time, the uplift...

  • craig craig posted a comment on discussion General Discussion

    Yes! Beta testing for me would be muich appreciated. On any platform, but my testing has been on Windows. So mac, linux testing needs some help

  • craig craig posted a comment on discussion Features

    This requires regex support which I expect to include in the upcoming 1.0.6 release. Stay tuned

  • craig craig modified a comment on discussion General Discussion

    Hi Darren, thanks for your post. Sorry but notifications of new posts were lost somehow, I only just noticed this one! The future is good, I've finally found time to do a major uplift and in the processes of completing all the integration tests which are the hardest to setup. I expect to release 1.0.6 in the near future with some initial feature tweaks. The main one being regex support for the search pattern. CRGREP comes with a lot of docs but looking again you're right, the 'search-pattern' is...

  • craig craig posted a comment on discussion General Discussion

    Hi Darren, thanks for your post. Sorry but notifications of new posts were lost somehow, I only just noticed this one! The future is good, I've finally found time to do a major uplift and in the processes of completing all the integration tests which are the hardest to setup. I expect to release 1.0.6 in the near future with some initial feature tweaks. The main one being regex support for the search pattern. CRGREP comes with a lot of docs but looking again you're right, the 'search-pattern' is...

  • craig craig posted a comment on discussion General Discussion

    Hi Culverine, it's not meant to be bundled or packaged with standalone external tools included. The idea is that all these third party packages I make use of (NLP, OCR etc) need to be installed independently of crgrep because their datasets and software are typically quite large while the default crgrep distro is small given that it's a simple CLI tool and users can decide what extra tools they wish to install and use with it. I've made every attempt to provide complete documentation, have you tried...

  • craig craig posted a comment on discussion General Discussion

    Hi Darren, I could look at handling nbsp the same as space. Leave that with me. In the meantime, I created a Word doc with nbsp line (nb space char between 34 and mm) and another line using a normal space character and the crgrep call below matched both for me using a wildcard in the pattern: Data: (the first line will display as '35mm' on the command line) 34 mm nbsp 34 mm space $ crgrep '34*mm' nbspace.docx nbspace.docx:P:34mm nbsp nbspace.docx:P:34 mm space Hope that helps as a workaround.

  • craig craig posted a comment on discussion General Discussion

    in 1.0.5 I've just discovered an issue in crgrep.bat file which I will fix in the...

  • craig craig posted a comment on discussion Announce

    Uses Natural Language Processing (NLP) and exposes a --mood option to filter matching...

  • craig craig posted a comment on discussion General Discussion

    Thanks gymit (very annoying I didn't see auto-notification you had posted a question,...

  • craig craig modified a comment on discussion Features

    I started to look into the possibility of running crgrep on a URL pointing to a remote...

  • craig craig posted a comment on discussion Features

    I started to look into the possibility of running crgrep on a URL pointing to a remote...

  • craig craig posted a comment on a wiki page

    One of the documents I ship is a ROADMAP.txt with various notes I keep on feature...

  • craig craig posted a comment on discussion General Discussion

    Quite a few releases have gone out up until now and I've had some feedback through...

  • craig craig posted a comment on discussion Announce

    Added MP3 file support to search through music libraries. Addressed some issues with...

  • craig craig posted a comment on discussion Announce

    Added color highlighting of matched text in this release with the --color (or --colour)...

  • craig craig posted a comment on discussion General Discussion

    An issue was reported which occurs when Java (and JAVA_HOME setting) points to a...

  • craig craig posted a comment on discussion Announce

    The main feature implemented in this release is support for Microsoft Office document...

  • craig craig modified a comment on discussion Announce

    Quite a bit of refactoring in this release to implement proper Java 7 Paths for file...

  • craig craig posted a comment on discussion Announce

    Quite a bit of refactoring in this release to implement proper Java 7 Paths for file...

  • craig craig posted a comment on discussion Announce

    Updated the 1.0.0 distro with missing docs directory

  • craig craig posted a comment on discussion General Discussion

    Part of the latest (1.0.0) release you'll see at the bottom of the CHANGELOG.txt...

  • craig craig posted a comment on discussion Announce

    The first major release following alpha releases 0.1 to 0.5 Added support for SQLServer,...

  • craig craig posted a comment on discussion General Discussion

    Example from README.txt is missing '-l' option, without this option the column data...

  • craig craig modified a wiki page

    Home

  • craig craig posted a comment on discussion Announce

    This release was mostly focused on improved quality. Substantial improvements in...

  • craig craig modified a comment on discussion General Discussion

    I have found some significant issues in 0.4 which may cause Db search failures, stack...

  • craig craig posted a comment on discussion General Discussion

    I have found some significant issues in 0.4 which may cause Db search failures, stack...

  • craig craig posted a comment on discussion Announce

    Discovered a significant issue in 0.4 so have removed the distro temporarily, I will...

  • craig craig posted a comment on discussion Announce

    The most significant update with this release is support for Optical Character Recognition...

1
MongoDB Logo MongoDB