Hi. I just installed 1.0.6. Looks good. I recently ran it and got this: ERROR StatusLogger Log4j2 could not find a logging implementation. Please add log4j-core to the classpath. Using SimpleLogger to log to the console... I went to https://logging.apache.org/log4j/2.x/download.html and made sure I had both of log4j-api-2.20.0.jar log4j-core-2.20.0.jar in the crgrep lib folder, and the warning went away. Just FYI Darren
I do not have a Mac, though I have colleagues who do and I might be able to test on Mac. I regularly use Linux, and I have used the program on Linux with success. Happy to try some things out on Linux, and I could probably put together a BSD (NetBSD or FreeBSD etc) VM in VirtualBox, say, to see how it goes.I am pretty busy right now, but feel free to ask -- I just cannot guarantee fast turnaround! Happy also to test on Windows, but sounds like that is covered.
These would need proper tests added. Ideally test macro files containing macro features (I don't know details of this format) before I include them. If you would like to help, please add a PR with macro versions of, say, core/src/test/resources/ms files 'excel.xls' and 'word.docx' which actually include macros (?) so that I can include in tests and do a rebuild. (The build is tricky to complete cleanly with all the env setup required, easier if I do it).
This is an across the board uplift of DB drivers, tools and other library dependencies. This release was built and tested with Java 19. The models (NLP/mood and OCR) should also be improved. The main feature add is -e/--regex for file search. This interprets the 'search pattern' as a regular expression instead of the default wildcard. DB search is based on LIKE queries so this remains wildcard only (if -d is specified then -e is ignored, with warning). I removed Neo4J at the same time, the uplift...
Yes! Beta testing for me would be muich appreciated. On any platform, but my testing has been on Windows. So mac, linux testing needs some help
Hi, yeah, that's me. I am really glad to hear the project is going to a new version. Let me know if there's some testing I can do.
This requires regex support which I expect to include in the upcoming 1.0.6 release. Stay tuned
Hi Darren, thanks for your post. Sorry but notifications of new posts were lost somehow, I only just noticed this one! The future is good, I've finally found time to do a major uplift and in the processes of completing all the integration tests which are the hardest to setup. I expect to release 1.0.6 in the near future with some initial feature tweaks. The main one being regex support for the search pattern. CRGREP comes with a lot of docs but looking again you're right, the 'search-pattern' is...
Hi Darren, thanks for your post. Sorry but notifications of new posts were lost somehow, I only just noticed this one! The future is good, I've finally found time to do a major uplift and in the processes of completing all the integration tests which are the hardest to setup. I expect to release 1.0.6 in the near future with some initial feature tweaks. The main one being regex support for the search pattern. CRGREP comes with a lot of docs but looking again you're right, the 'search-pattern' is...
I see you commented on something like this at https://bitbucket.org/cryanfuse/crgrep/issues/19/feature-request-regular-expression; sorry to bother you.
HI. I use crgrep all the time and talk about it to people. It must be a major effort to maintain, and I am wondering what the future holds. Also, I was wondering where the format for the \<search pattern> is documented. Some grep notation, like [abcd] to match one of a, b, c and d does not seem to work for me, but maybe I am doing it wrong. I looked in the docs and I'm not sure where the syntax is explained. Thanks for crgrep.
I'd love to see a tail like feature. In other words, the ability to get nth lines before or after a match.
Hi Culverine, it's not meant to be bundled or packaged with standalone external tools included. The idea is that all these third party packages I make use of (NLP, OCR etc) need to be installed independently of crgrep because their datasets and software are typically quite large while the default crgrep distro is small given that it's a simple CLI tool and users can decide what extra tools they wish to install and use with it. I've made every attempt to provide complete documentation, have you tried...
Hello Craig, Would it be possible to have deb package for crgrep ? I have installed Tesseract but im quite confused on what should be done next to install on ubuntu. Regards,
Yep, that's great in some circumstances and I'll use it. Thanks agin. I use crgrep on an almost daily basis. Thanks again
Hi Darren, I could look at handling nbsp the same as space. Leave that with me. In the meantime, I created a Word doc with nbsp line (nb space char between 34 and mm) and another line using a normal space character and the crgrep call below matched both for me using a wildcard in the pattern: Data: (the first line will display as '35mm' on the command line) 34 mm nbsp 34 mm space $ crgrep '34*mm' nbspace.docx nbspace.docx:P:34mm nbsp nbspace.docx:P:34 mm space Hope that helps as a workaround.
Hi Craig crgrep is really handy for looking at Word files, which sadly I have to do a lot. But I notice that it seems to treat a non-breaking space (ctrl-alt-spacebar in Word) as if it was not there, which causes words to run together and then not show up in the results. For example, if I write 34 mm (as in millimetres), I don't want the mm separated from the 34, so I put a nbsp in between; but if I grep for "34 mm" on the command line, with a normal space typed in as part of the expression, I get...
in 1.0.5 I've just discovered an issue in crgrep.bat file which I will fix in the...
Uses Natural Language Processing (NLP) and exposes a --mood option to filter matching...
Thanks gymit (very annoying I didn't see auto-notification you had posted a question,...
hi, very good tool. Does it works with regex (not only wildcards)? For example if...
I started to look into the possibility of running crgrep on a URL pointing to a remote...
I started to look into the possibility of running crgrep on a URL pointing to a remote...
One of the documents I ship is a ROADMAP.txt with various notes I keep on feature...
Quite a few releases have gone out up until now and I've had some feedback through...
Added MP3 file support to search through music libraries. Addressed some issues with...
Added color highlighting of matched text in this release with the --color (or --colour)...
An issue was reported which occurs when Java (and JAVA_HOME setting) points to a...
The main feature implemented in this release is support for Microsoft Office document...
Quite a bit of refactoring in this release to implement proper Java 7 Paths for file...
Quite a bit of refactoring in this release to implement proper Java 7 Paths for file...
Updated the 1.0.0 distro with missing docs directory
Part of the latest (1.0.0) release you'll see at the bottom of the CHANGELOG.txt...
The first major release following alpha releases 0.1 to 0.5 Added support for SQLServer,...
Example from README.txt is missing '-l' option, without this option the column data...
Home
This release was mostly focused on improved quality. Substantial improvements in...
I have found some significant issues in 0.4 which may cause Db search failures, stack...
I have found some significant issues in 0.4 which may cause Db search failures, stack...
Discovered a significant issue in 0.4 so have removed the distro temporarily, I will...
The most significant update with this release is support for Optical Character Recognition...