From: Christian S. <si...@mi...> - 2004-04-21 12:15:50
|
On Wed, 21 Apr 2004, Christian Siefkes wrote: > About the tokenization: The pattern I called "simplified tokenization" was > as follows: > > - In CRM syntax: > [[:graph:]][-[:alnum:]]*[[:graph:]]? > > - In Java syntax (Unicode-based): > [^\p{Z}\p{C}][-\p{L}\p{M}\p{N}]*[^\p{Z}\p{C}] Sorry, the trailing '?' got lost. This must read: [^\p{Z}\p{C}][-\p{L}\p{M}\p{N}]*[^\p{Z}\p{C}]? Bye Christian ------------ Christian Siefkes ----------------------------------------- | Email: chr...@si... | Web: http://www.siefkes.net/ | Graduate School in Distributed IS: http://www.wiwi.hu-berlin.de/gkvi/ -------------------- Offline P2P: http://www.leihnetzwerk.de/ ---------- All I want is more than I deserve. |