Re: [Pyparsing] To parse any language character set

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

On Monday 27 October 2008, Ujjaval Suthar wrote:
> Hi everyone,
>
> I need to parse strings with mix of English and any other unicode
> characters from any Asian or European languages.
>
> The format of strings is like following:
>
> ABC|[any unicode character]|[any unicode character]|XYZ

Hello Ujjaval!

If I understand your question right, CharsNotIn is the parser you are 
looking for. I don't see any general problem with Unicode. As you 
seem somewhat knowledgeable about the requirements of Asian 
languages, you could maybe propose a parser for words in Asian 
languages (or even post a patch).

Here is an example for CharsNotIn:
  http://pastebin.com/f7d6a3331

I hope this helped you.

Kind regards,
Eike.