Thread: [Pyparsing] To parse any language character set

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

Hi everyone,

I need to parse strings with mix of English and any other unicode characters
from any Asian or European languages.

The format of strings is like following:

ABC|[any unicode character]|[any unicode character]|XYZ

In above string, I have ABC and XYZ as literals which are start and end of
the string while '|' is the delimiter for the content in between start and
end of the strings.

How can I use pyparsing to parse this kind of string? Here in the outcome I
should have a list of unicode character strings which are in between ABC and
XYZ in a form of list. These strings are separated by '|' in between.

Thanks,

Ujjaval

Thread: [Pyparsing] To parse any language character set

pyparsing-users