schtepf Activity

Activity for Stephanie Evert

2 days ago
Stephanie Evert committed [r1884] on Code

Enhance CQP with custom SpheroscopeDebug option (experimental)
3 months ago
Stephanie Evert committed [r1883] on Code

Make sure cwb-encode aborts with meaningful error message if there are unparseable arguments
5 months ago
Stephanie Evert committed [r1881] on Code

Encourage users to upgrade to v3.5.1
9 months ago
Stephanie Evert committed [r1877] on Code

Fix bug in cl_string_canoncial(), which would fail on diacritic-folding of long strings (where NFC exceeds CL_MAX_LINE_LENGTH)
10 months ago
Stephanie Evert posted a comment on ticket #72

A kwic concordance (where left and right context might not even contain complete tokens!) is very different from a list of sentences with pre-determined annotation as in the FreeLing output. I don't think we can learn much from it to help us address the challenges of kwic XML output. SGML print mode is really badly broken if you display s-attributes in the concordance. It also includes them (and any p-attributes) as plain text in the tokens rather than in a way that allows them to be processed e.g....
10 months ago
Stephanie Evert posted a comment on ticket #7

You seem to have forgotten to activate the corpus: info PRUEBA; but PRUEBA; show cd;
10 months ago
Stephanie Evert posted a comment on ticket #72

Note to those not familiar with CQP print modes: Their implementation is a horrible mess, so we are reluctant to add extensions and very limited in what can be achieved. Moreover, the print modes only affect some CQP output (kwic concordances, frequency tables from group) but by far not all.
10 months ago
Stephanie Evert modified ticket #72

XML output mode for CQP
10 months ago
Stephanie Evert posted a comment on ticket #7

Because that's how the orginal developer decided to do things in 1994. It's a quirk that we live with for the sake of backwards compatibility. Note that the filename of the registry file also has to be in lowercase, while corpus IDs are to be specified in all caps everywhere else. You can get the list of attributes with show cd or using cwb-describe-corpus -s on the command line. Canonical attribute names (both positional and structural) should be all lowercase and only use ASCII characters. While...
10 months ago
Stephanie Evert modified ticket #80

SGML invalid structure
10 months ago
Stephanie Evert posted a comment on ticket #80

PS: If you want to pursue this, please add a feature request “XML output mode for CQP” for CWB v3.6.
10 months ago
Stephanie Evert posted a comment on ticket #80

Not a bug: SGML allows omission of closing tags – you just have to assume a suitable DTD for the output produced by CQP. Note that your second suggestion isn't valid SGML and would have to be written <attribute// instead. If your SGML output also included the kwic line with some s-attributes shown, you'd get many more validation errors (because nothing guarantees that open/close tags match up within a kwic line, and they can also overlap between context and match). If we ever find the nerves to implement...
11 months ago
Stephanie Evert committed [r1871] on Code

cwb-scan-corpus now also reports type count before applying frequency filter
11 months ago
Stephanie Evert committed [r1870] on Code

Ziggurat design: even more B-tree algorithms
11 months ago
Stephanie Evert committed [r1869] on Code

Ziggurat design: added binsearch benchmark observations with Rust on MacOS
11 months ago
Stephanie Evert committed [r1866] on Code

Ziggurat design: try yet another binary search algo (unsuccessfully)
11 months ago
Stephanie Evert committed [r1863] on Code

Ziggurat design: fix uint typedef conflict (now zuint)
11 months ago
Stephanie Evert committed [r1860] on Code

Added binary search benchmarks for random walks and exponential search algorithm
11 months ago
Stephanie Evert committed [r1859] on Code

Convert README to Markdown + HTML for easier reading
11 months ago
Stephanie Evert committed [r1858] on Code

Ziggurat design: estimate disk size of compressed sparse inverted index
11 months ago
Stephanie Evert committed [r1857] on Code

Add HTML version of Markdown README for convenience
11 months ago
Stephanie Evert committed [r1856] on Code

Ziggurat design: benchmark results for binary search in sort index, with thorough discussion
11 months ago
Stephanie Evert committed [r1855] on Code

Ziggurat design: benchmark binary lookup in large tables vs. b-tree
11 months ago
Stephanie Evert committed [r1854] on Code

Ziggurat design: estimate size of sparse inverted index
11 months ago
Stephanie Evert committed [r1853] on Code

Fix extremely embarrassing as well as catastrophic bug in cwb-scan-corpus introduced by r1851
11 months ago
Stephanie Evert committed [r1852] on Code

minor display fix in cwb-scan-corpus
11 months ago
Stephanie Evert committed [r1851] on Code

cwb-scan-corpus now obtains total token/document counts if no regular keys are specified
1 year ago
Stephanie Evert posted a comment on ticket #78

Re. 3: If you installed this particular corpus in CQPweb, you must have ignored all the error messages that it shot at you. It should have outright refused to install the corpus, but perhaps it went far enough to get its database into an inconsistent state that causes the lock-up.
1 year ago
Stephanie Evert modified a comment on ticket #78

(a) Does this happen for a specific query, or for every query that returns some matches? In the latter case there' s probably sth in the corpus that confuses CQPweb. (b) Are there also problems if you run the query directly in CQP? (c) You can't possibly have installed this corpus in CQPweb because it's lacking the mandatory <text id="..."> elements!
1 year ago
Stephanie Evert modified a comment on ticket #78

(a) Does this happen for a specific query, or for every query that returns some matches? In the latter case there' s probably sth in the corpus that confuses CQPweb. (b) Are there also problems if you run the query directly in CQP? (c) You can't possibly have installed this corpus in CQPweb because it's lacking the mandatory <text id="..."> ... </text> elements!
1 year ago
Stephanie Evert modified a comment on ticket #78

(a) Does this happen for a specific query, or for every query that returns some matches? In the latter case there' s probably sth in the corpus that confuses CQPweb. (b) Are there also problems if you run the query directly in CQP? (c) You can't possibly have installed this corpus in CQPweb because it's lacking the mandatory <text id="..."> elements!</text>
1 year ago
Stephanie Evert posted a comment on ticket #78

(a) Does this happen for a specific query, or for every query that returns some matches? In the latter case there' s probably sth in the corpus that confuses CQPweb. (b) Are there also problems if you run the query directly in CQP?
1 year ago
Stephanie Evert posted a comment on ticket #6

cwb-ccc should give you most (if not all) of what you can get from the CQPweb API, and it will be faster (as everything is directly in Python and doesn't need to be serialised and de-serialised) and more directly under your control. I think CQPweb is useful for your use case only if you need its GUI.
1 year ago
Stephanie Evert modified ticket #5

Best approach for interfacing CQP with other software
1 year ago
Stephanie Evert posted a comment on ticket #5

We don't currently have plans to continue the development of CQi, though cqpserver and the Perl client are still fully functional. CQi doesn't wrap all relevant CQP functionality yet and has a few suboptimal design choices, and currently there doesn't seem to be much demand for a client-server API of this type. Direct APIs such as the ones available for Python, Perl and R are more efficient and convenient to use. As a point of interest, the R API (RCppCWB) is modeled after CQi, but accesses the internal...
1 year ago
Stephanie Evert posted a comment on ticket #5

Since you're working in Python, you should use the native API provided by cwb-ccc package. It is well-maintained and up to date, but doesn't support Windows platforms yet (help wanted!).
1 year ago
Stephanie Evert committed [r1840] on Code

Merge bugfix r1839 into branches/3.5
1 year ago
Stephanie Evert committed [r1839] on Code

Fix undefined behaviour / segmentation fault of cwb-scan-corpus if more than 32 keys are specified.
1 year ago
Stephanie Evert committed [r1838] on Code

Document handling of empty element tags in Corpus Encoding Manual
1 year ago
Stephanie Evert committed [r1837] on Code

Fix bug in r1836
1 year ago
Stephanie Evert committed [r1836] on Code

Make cwb-encode silently discard empty element tags for s-attributes with nesting specifiers
1 year ago
Stephanie Evert committed [r1835] on Code

Merge bugfix r1826 into branches/3.5
1 year ago
Stephanie Evert committed [r1834] on Code

Create CWB release/3.5.0 as stable archive w/o any bug fixes added to branches/3.5 after release.
1 year ago
Stephanie Evert committed [r1832] on Code

Downgrade CWB trunk to v3.5.1 rather than v3.6
1 year ago
Stephanie Evert committed [r1829] on Code

Varint benchmark for batch processing (inlined loop) + results for M1Max
1 year ago
Stephanie Evert committed [r1828] on Code

Update varint encoding benchmarks:
2 years ago
Stephanie Evert committed [r1826]

First bug fix after release of v3.5.0 :-}
2 years ago
Stephanie Evert committed [r1823]

Repair last changes to CWB::CL configuration script
2 years ago
Stephanie Evert committed [r1820]

Improve CWB::CL makefile
2 years ago
Stephanie Evert committed [r1819]

A few final bugfixes
2 years ago
Stephanie Evert committed [r1817]

Correct typo in CWB/Perl test
2 years ago
Stephanie Evert committed [r1809]

Platform configuration for MacOS binary release based on HomeBrew
2 years ago
Stephanie Evert committed [r1808]

Typo in config/platform/darwin-release
2 years ago
Stephanie Evert committed [r1807]

Update INSTALL and INSTALL-MACOS with instructiosn for building self-contained binary releases.
2 years ago
Stephanie Evert committed [r1806]

More work on self-contained binary release package for MacOS
2 years ago
Stephanie Evert committed [r1805]

New cwb-config Perl script for self-contained binary packages; remove install-cwb.sh
2 years ago
Stephanie Evert committed [r1804]

First attempt to build self-contained package for MacOS (work in progress)
2 years ago
Stephanie Evert committed [r1803]

Same for hombrew-formula configuration
2 years ago
Stephanie Evert committed [r1802]

Change linker flags to -Xlinker notation, which seems most widely supported, and remove the special case on MacOS
2 years ago
Stephanie Evert committed [r1801]

Use correct synatx for -Wl,-rpath flag (stupid thinko)
2 years ago
Stephanie Evert committed [r1797]

Minor fix to uninstall-cwb.perl
2 years ago
Stephanie Evert committed [r1796]

Uninstall script for CWB (e.g. to switch from locally installed version to Debian package)
2 years ago
Stephanie Evert committed [r1795]

Proper bugfix for CL regexp optimizer, so grains aren't incorrectly anchored at end of string
2 years ago
Stephanie Evert committed [r1794]

Explain in README how to test CQP with set Optimize on; or other options
2 years ago
Stephanie Evert committed [r1793]

Some more regression tests for regexp optimizer
2 years ago
Stephanie Evert committed [r1791]

Ignore some local files in CWB source tree (svn propedit svn:ignore .)
2 years ago
Stephanie Evert committed [r1790]

Enable dynamic linking on MacOS by default
2 years ago
Stephanie Evert committed [r1789]

Some further cleanup of config files
2 years ago
Stephanie Evert committed [r1788]

Amend previous checkin
2 years ago
Stephanie Evert committed [r1787]

Improve dynamic linking in homebrew formula (experimental)
2 years ago
Stephanie Evert committed [r1786]

Fix rpath problem in homebrew on MacOS (but may persist on Linux?)
2 years ago
Stephanie Evert committed [r1785]

Enable shared linking in homebrew formula (experimental)
2 years ago
Stephanie Evert committed [r1784]

Get dynamic linking to work properly on MacOS
2 years ago
Stephanie Evert committed [r1783]

Improve Makefile for automated Perl module testing
2 years ago
Stephanie Evert committed [r1782]

Make CWB-Web package available on CPAN (needs dummy CWB::Web module)
2 years ago
Stephanie Evert committed [r1779]

Clean up beta-install config; enable optional "local_config.mk" for developers
2 years ago
Stephanie Evert committed [r1751]

Fix .dylib building on MacOS
2 years ago
Stephanie Evert committed [r1732]

Forgot to save a file before last check-in
2 years ago
Stephanie Evert committed [r1731]

Re-enable regular expression optimizer with cl_optimize=1;, which had been disabled due to a stupid bug.
2 years ago
Stephanie Evert committed [r1730]

Fix segfault in cwb-s-encode (using corpus handle to get charset when not adding to existing corpus)
2 years ago
Stephanie Evert committed [r1726]

Remove a few left-over references to editline/ from config
2 years ago
Stephanie Evert committed [r1721]

Finally switch config.mk to default to standard installation rather than separate beta tree
2 years ago
Stephanie Evert modified ticket #76

"set RightContext n character" is off by 1
2 years ago
Stephanie Evert modified ticket #73

CQPweb: Collocation analysis with POS filter is inconsistent
2 years ago
Stephanie Evert modified ticket #73

CQPweb: Collocation analysis with POS filter is inconsistent
2 years ago
Stephanie Evert committed [r1710]

Fix bug #76: token separator next to match included in RightContext character count
2 years ago
Stephanie Evert posted a comment on ticket #76

Reason: the token separator between match and co-text is included in the character count for the right context size, but excluded from the count for the left context size. Seems wrong to me.
2 years ago
Stephanie Evert modified ticket #75

CQP can save a named query results file larger than 2 GiB but cannot read it
2 years ago
Stephanie Evert modified ticket #75

CQP can save a named query results file larger than 2 GiB but cannot read it
2 years ago
Stephanie Evert posted a comment on ticket #75

Thanks for the report and thorough diagnosis. Should be fixed in r1708.
2 years ago
Stephanie Evert committed [r1708]

Bug fix: CQP fails to load saved query results if file size is > 2 GiB
2 years ago
Stephanie Evert modified ticket #74

Inconsistency in CQP regexp matching
2 years ago
Stephanie Evert posted a comment on ticket #74

Fixed in r1705. The macro CL_REGEX_METACHARACTERS is defined in cl/cl.h (and documentation explains that it's not a list of all "unsafe" characters despite the name).
2 years ago
Stephanie Evert committed [r1705]

Make sure that CQP always matches search strings that contain ^ or $ as regular expressions
2 years ago
Stephanie Evert created ticket #74

Inconsistency in CQP regexp matching
3 years ago
Stephanie Evert committed [r1699]

Perl/CWB-CQi: v3.0.0 for CPAN release
3 years ago
Stephanie Evert committed [r1698]

(amend previous check-in: forgot one file)
3 years ago
Stephanie Evert committed [r1697]

Perl/CWB-Web: v3.4.0 for CPAN release
3 years ago
Stephanie Evert committed [r1696]

Perl/CWB-CL: v3.4.33 for CPAN release; update author name
3 years ago
Stephanie Evert committed [r1695]

(amend previous check-in: forgot one file)

1 >

Stephanie Evert Activity

Activity for Stephanie Evert

Stephanie Evert committed [r1884] on Code

Stephanie Evert committed [r1883] on Code

Stephanie Evert committed [r1881] on Code

Stephanie Evert committed [r1877] on Code

Stephanie Evert posted a comment on ticket #72

Stephanie Evert posted a comment on ticket #7

Stephanie Evert posted a comment on ticket #72

Stephanie Evert modified ticket #72

Stephanie Evert posted a comment on ticket #7

Stephanie Evert modified ticket #80

Stephanie Evert posted a comment on ticket #80

Stephanie Evert posted a comment on ticket #80

Stephanie Evert committed [r1871] on Code

Stephanie Evert committed [r1870] on Code

Stephanie Evert committed [r1869] on Code

Stephanie Evert committed [r1866] on Code

Stephanie Evert committed [r1863] on Code

Stephanie Evert committed [r1860] on Code

Stephanie Evert committed [r1859] on Code

Stephanie Evert committed [r1858] on Code

Stephanie Evert committed [r1857] on Code

Stephanie Evert committed [r1856] on Code

Stephanie Evert committed [r1855] on Code

Stephanie Evert committed [r1854] on Code

Stephanie Evert committed [r1853] on Code

Stephanie Evert committed [r1852] on Code

Stephanie Evert committed [r1851] on Code

Stephanie Evert posted a comment on ticket #78

Stephanie Evert modified a comment on ticket #78

Stephanie Evert modified a comment on ticket #78

Stephanie Evert modified a comment on ticket #78

Stephanie Evert posted a comment on ticket #78

Stephanie Evert posted a comment on ticket #6

Stephanie Evert modified ticket #5

Stephanie Evert posted a comment on ticket #5

Stephanie Evert posted a comment on ticket #5

Stephanie Evert committed [r1840] on Code

Stephanie Evert committed [r1839] on Code

Stephanie Evert committed [r1838] on Code

Stephanie Evert committed [r1837] on Code

Stephanie Evert committed [r1836] on Code

Stephanie Evert committed [r1835] on Code

Stephanie Evert committed [r1834] on Code

Stephanie Evert committed [r1832] on Code

Stephanie Evert committed [r1829] on Code

Stephanie Evert committed [r1828] on Code

Stephanie Evert committed [r1826]

Stephanie Evert committed [r1823]

Stephanie Evert committed [r1820]

Stephanie Evert committed [r1819]

Stephanie Evert committed [r1817]

Stephanie Evert committed [r1809]

Stephanie Evert committed [r1808]

Stephanie Evert committed [r1807]

Stephanie Evert committed [r1806]

Stephanie Evert committed [r1805]

Stephanie Evert committed [r1804]

Stephanie Evert committed [r1803]

Stephanie Evert committed [r1802]

Stephanie Evert committed [r1801]

Stephanie Evert committed [r1797]

Stephanie Evert committed [r1796]

Stephanie Evert committed [r1795]

Stephanie Evert committed [r1794]

Stephanie Evert committed [r1793]

Stephanie Evert committed [r1791]

Stephanie Evert committed [r1790]

Stephanie Evert committed [r1789]

Stephanie Evert committed [r1788]

Stephanie Evert committed [r1787]

Stephanie Evert committed [r1786]

Stephanie Evert committed [r1785]

Stephanie Evert committed [r1784]

Stephanie Evert committed [r1783]

Stephanie Evert committed [r1782]

Stephanie Evert committed [r1779]

Stephanie Evert committed [r1751]

Stephanie Evert committed [r1732]