Thread: [octc-svn] SF.net SVN: octc:[197] trunk/tools/xsl/tokenizer.xsl
Brought to you by:
bansp
From: <ba...@us...> - 2010-05-29 01:05:03
|
Revision: 197 http://octc.svn.sourceforge.net/octc/?rev=197&view=rev Author: bansp Date: 2010-05-29 01:04:57 +0000 (Sat, 29 May 2010) Log Message: ----------- a very preliminary version Added Paths: ----------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-05-29 18:54:43
|
Revision: 201 http://octc.svn.sourceforge.net/octc/?rev=201&view=rev Author: bansp Date: 2010-05-29 18:54:34 +0000 (Sat, 29 May 2010) Log Message: ----------- segmentation done, need to calculate the offsets now Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-05-30 09:47:00
|
Revision: 202 http://octc.svn.sourceforge.net/octc/?rev=202&view=rev Author: bansp Date: 2010-05-30 09:46:53 +0000 (Sun, 30 May 2010) Log Message: ----------- hopefully closer now Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-05-31 10:07:04
|
Revision: 203 http://octc.svn.sourceforge.net/octc/?rev=203&view=rev Author: bansp Date: 2010-05-31 10:06:55 +0000 (Mon, 31 May 2010) Log Message: ----------- + option to dump debug info Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-06-01 22:03:52
|
Revision: 204 http://octc.svn.sourceforge.net/octc/?rev=204&view=rev Author: bansp Date: 2010-06-01 22:03:46 +0000 (Tue, 01 Jun 2010) Log Message: ----------- working version that produces proper segmentation that is improperly parsed by the buggy xmllint (libxml2-2.7.6); bugs reported Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-06-01 22:08:46
|
Revision: 205 http://octc.svn.sourceforge.net/octc/?rev=205&view=rev Author: bansp Date: 2010-06-01 22:08:40 +0000 (Tue, 01 Jun 2010) Log Message: ----------- typo Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-06-01 22:34:13
|
Revision: 207 http://octc.svn.sourceforge.net/octc/?rev=207&view=rev Author: bansp Date: 2010-06-01 22:34:07 +0000 (Tue, 01 Jun 2010) Log Message: ----------- switched on <head> tokenization (for the Swahili UDHR file as a demo), switched off debugging Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-06-03 19:05:23
|
Revision: 216 http://octc.svn.sourceforge.net/octc/?rev=216&view=rev Author: bansp Date: 2010-06-03 19:05:17 +0000 (Thu, 03 Jun 2010) Log Message: ----------- default values: conserve space; <include> elements get the prefix instead of namespace specification (the previous version was needed for DTDs, but we forget about DTDs anyway) Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-06-03 21:35:04
|
Revision: 219 http://octc.svn.sourceforge.net/octc/?rev=219&view=rev Author: bansp Date: 2010-06-03 21:34:55 +0000 (Thu, 03 Jun 2010) Log Message: ----------- + something to remember about Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <jim...@us...> - 2010-06-12 14:15:26
|
Revision: 240 http://octc.svn.sourceforge.net/octc/?rev=240&view=rev Author: jimregan Date: 2010-06-12 14:15:20 +0000 (Sat, 12 Jun 2010) Log Message: ----------- small fix to get to the next error message with Saxon :) Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-08-05 20:24:13
|
Revision: 313 http://octc.svn.sourceforge.net/octc/?rev=313&view=rev Author: bansp Date: 2010-08-05 20:24:06 +0000 (Thu, 05 Aug 2010) Log Message: ----------- fixy fix of something embarrassing Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-08-06 01:10:29
|
Revision: 314 http://octc.svn.sourceforge.net/octc/?rev=314&view=rev Author: bansp Date: 2010-08-06 01:10:23 +0000 (Fri, 06 Aug 2010) Log Message: ----------- partial commit, more typing, more params, more vars Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-08-06 01:38:06
|
Revision: 315 http://octc.svn.sourceforge.net/octc/?rev=315&view=rev Author: bansp Date: 2010-08-06 01:37:59 +0000 (Fri, 06 Aug 2010) Log Message: ----------- kludge for one of xmllint bugs, the edge problem: https://bugzilla.gnome.org/show_bug.cgi?id=620190 Unfortunately, the problem of embedded markup is still there (https://bugzilla.gnome.org/show_bug.cgi?id=620195) Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-08-06 20:02:20
|
Revision: 317 http://octc.svn.sourceforge.net/octc/?rev=317&view=rev Author: bansp Date: 2010-08-06 20:02:13 +0000 (Fri, 06 Aug 2010) Log Message: ----------- preparing to put in a kludge for another xmllint bug (that prevents me from using string-value of nodes if they embed others) Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-08-06 20:29:14
|
Revision: 318 http://octc.svn.sourceforge.net/octc/?rev=318&view=rev Author: bansp Date: 2010-08-06 20:29:08 +0000 (Fri, 06 Aug 2010) Log Message: ----------- cosmetics Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-08-07 03:54:45
|
Revision: 320 http://octc.svn.sourceforge.net/octc/?rev=320&view=rev Author: bansp Date: 2010-08-07 03:54:38 +0000 (Sat, 07 Aug 2010) Log Message: ----------- + second kludge, around the second outstanding bug (https://bugzilla.gnome.org/show_bug.cgi?id=620195) -- it *really* works now; this might be close to the first time ever that a multilayer stand-off corpus started to work with *generic* XML tools. We should be opening some bottles now. Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-08-07 04:31:23
|
Revision: 323 http://octc.svn.sourceforge.net/octc/?rev=323&view=rev Author: bansp Date: 2010-08-07 04:31:17 +0000 (Sat, 07 Aug 2010) Log Message: ----------- cosmetics Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-08-08 12:59:24
|
Revision: 324 http://octc.svn.sourceforge.net/octc/?rev=324&view=rev Author: bansp Date: 2010-08-08 12:59:17 +0000 (Sun, 08 Aug 2010) Log Message: ----------- treat embedded elements with text content just like non-embedded ones (use the id) Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <ba...@us...> - 2010-12-18 23:23:02
|
Revision: 420 http://octc.svn.sourceforge.net/octc/?rev=420&view=rev Author: bansp Date: 2010-12-18 23:01:47 +0000 (Sat, 18 Dec 2010) Log Message: ----------- keep head elements distinct Modified Paths: -------------- trunk/tools/xsl/tokenizer.xsl This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |