-
schtepf committed revision 59 to the IMS Open Corpus Workbench SVN repository, changing 1 files.
2009-11-11 17:35:59 UTC in IMS Open Corpus Workbench
-
schtepf committed revision 56 to the IMS Open Corpus Workbench SVN repository, changing 9 files.
2009-11-07 22:41:30 UTC in IMS Open Corpus Workbench
-
schtepf committed revision 55 to the IMS Open Corpus Workbench SVN repository, changing 6 files.
2009-11-04 15:10:27 UTC in IMS Open Corpus Workbench
-
Fixed in version 2.2.b101.
The header line is now optional if the undump is loaded from a regular file. CQP will automatically detect the new format and read the undump file in two passes (first to determine number of lines, then to read actual data). The new format cannot be used when reading from a pipe or from standard input (because pipes cannot be re-read).
2009-11-04 15:00:23 UTC in IMS Open Corpus Workbench
-
The "undump" command in CQP requires an explicit line count header in the first line of the undump file, so that arrays can be pre-allocated. This is a major hassle for exchanging data with spreadsheets, SQL database engines, R, and other software that would otherwise work quite well with the TAB-delimited format of dump/undump files. Without this restriction, it would also be possible to use...
2009-11-04 14:57:52 UTC in IMS Open Corpus Workbench
-
Changing the registry is less important: it's easy to just copy over registry files to CQPweb's separate registry, as long as they can share the original data files. In fact, it may be best to require a single registry directory for CQPweb.
2009-10-29 10:55:49 UTC in IMS Open Corpus Workbench
-
From our experience working interactively in CQP, it's even more useful to be able to run subqueries, i.e. filter query results either by collocates (tokens with a certain property within a specified range, e.g. a finite verb within 3 words) or by another CQP query. This could easily be implemented using "set keyword" and subqueries in CQP, but the results would have to be stored as saved...
2009-10-25 17:27:05 UTC in IMS Open Corpus Workbench
-
If a corpus is properly encoded with the ##::charset property set, then the CQP interface might be able to figure out the corpus encoding by itself. Basically, it would have to watch for a corpus activation command and then access the corpus properties. Currently, this can be done with the "info;" command if one doesn't mind the overhead of reading and printing the entire .info file. But it...
2009-10-25 17:19:41 UTC in IMS Open Corpus Workbench
-
schtepf committed revision 51 to the IMS Open Corpus Workbench SVN repository, changing 1 files.
2009-10-02 10:02:15 UTC in IMS Open Corpus Workbench
-
schtepf committed revision 50 to the IMS Open Corpus Workbench SVN repository, changing 1 files.
2009-10-01 12:59:27 UTC in IMS Open Corpus Workbench