Thread: [Exist-open] character encoding

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

I have an off topic question to propose at this list.
I'm working with eXist to store terminology record that contains terms even
in arabic or chinese language.
With the UTF-8 coding i have no problem in this xml files, but i have to
export this kind of record in text files to
import it in another proprietary application.
I do this with an xslt transformation, but when i import this text files th=
e
arabic and chinese terms are all transformed in
unrecognized characters and rendered by a series of ? .
I know that a text file is not encoded in UTF-8 but is in a machine
dependant coding, but how i can mantain the original
character sequence even for the arabic and chinese terms ?

This is my xslt:

<?xml version=3D"1.0" encoding=3D"UTF-8"?>
<xsl:stylesheet xmlns:xsl=3D"http://www.w3.org/1999/XSL/Transform" version=
=3D"
2.0">
    <xsl:output method=3D"text" version=3D"2.0" indent=3D"no" encoding=3D"U=
TF-8"/>
    <xsl:strip-space elements=3D"*"/>
    <xsl:variable name=3D"newline">
        <xsl:text>&#13;&#10;</xsl:text>
    </xsl:variable>
    <xsl:template match=3D"/items">
        <xsl:for-each select=3D"termEntry">
            <xsl:apply-templates select=3D"."/>
        </xsl:for-each>
    </xsl:template>
    <xsl:template match=3D"termEntry">
        <xsl:text>**</xsl:text>
        <xsl:for-each select=3D"descrip">
            <xsl:apply-templates select=3D"."/>
        </xsl:for-each>
        <xsl:for-each select=3D"langSet">
            <xsl:sort select=3D"@xml:lang"/>
            <xsl:apply-templates select=3D"."/>
        </xsl:for-each>
        <xsl:text>**</xsl:text>
        <xsl:value-of select=3D"$newline"/>
    </xsl:template>
    <xsl:template match=3D"descrip">
        <xsl:if test=3D"node()!=3D''">
            <xsl:text>&lt;</xsl:text>
            <xsl:value-of select=3D"@type"/>
            <xsl:text>&gt;</xsl:text>
            <xsl:apply-templates/>
            <xsl:value-of select=3D"$newline"/>
        </xsl:if>
    </xsl:template>
    <xsl:template match=3D"langSet">
        <xsl:for-each select=3D"tig">
            <xsl:sort select=3D"term"/>
            <xsl:apply-templates select=3D"term">
                <xsl:with-param name=3D"lang" select=3D"parent::langSet/@xm=
l
:lang"/>
            </xsl:apply-templates>
            <xsl:apply-templates select=3D"descrip"/>
        </xsl:for-each>
    </xsl:template>
    <xsl:template match=3D"term">
        <xsl:param name=3D"lang"/>
        <xsl:choose>

                        [ ... ]

            <xsl:when test=3D"$lang =3D 'en'">
                <xsl:text>&lt;English&gt;</xsl:text>
            </xsl:when>
            <xsl:when test=3D"$lang =3D 'ar'">
                <xsl:text>&lt;Arabic&gt;</xsl:text>
            </xsl:when>
            <xsl:when test=3D"$lang =3D 'zh'">
                <xsl:text>&lt;Chinese&gt;</xsl:text>
            </xsl:when>
        </xsl:choose>
        <xsl:apply-templates/>
        <xsl:value-of select=3D"$newline"/>
    </xsl:template>
</xsl:stylesheet>

Thanks for any help,

Giuseppe Corrarello.

Thread: [Exist-open] character encoding

eXist-db is a feature rich Open Source native XML database

exist-open