Re: [Pyparsing] To parse any language character set
Brought to you by:
ptmcg
From: Eike W. <eik...@gm...> - 2008-10-27 15:02:06
|
On Monday 27 October 2008, Ujjaval Suthar wrote: > Hi everyone, > > I need to parse strings with mix of English and any other unicode > characters from any Asian or European languages. > > The format of strings is like following: > > ABC|[any unicode character]|[any unicode character]|XYZ Hello Ujjaval! If I understand your question right, CharsNotIn is the parser you are looking for. I don't see any general problem with Unicode. As you seem somewhat knowledgeable about the requirements of Asian languages, you could maybe propose a parser for words in Asian languages (or even post a patch). Here is an example for CharsNotIn: http://pastebin.com/f7d6a3331 I hope this helped you. Kind regards, Eike. |