We are generating RDF from our XMLDB and the exported XML gives a few parsing problems while importing into our RDF store:
rapper: Parsing URI http://qa.data.nxp.com/dataset/data_sheet.rdf with parser rdfxml
rapper: Serializing with serializer ntriples
rapper: Warning - URI http://qa.data.nxp.com/dataset/data_sheet.rdf:49486 - Property element 'title' has a string not in Unicode Normal Form C: PSMN1R0-30YLC - N-channel 30 V 1.15 mΩ logic level MOSFET in LFPAK using NextPower technology
rapper: Warning - URI http://qa.data.nxp.com/dataset/data_sheet.rdf:49502 - Property element 'title' has a string not in Unicode Normal Form C: PSMN4R5-30YLC - N-channel 30 V 4.8 mΩ logic level MOSFET in LFPAK using NextPower technology
They are due to Normalization of Unicode characters. I was checking Sedna documentation
http://www.sedna.org/progguide/ProgGuidesu5.html#x9-300002.2.1
and noticed I can't use the normalization-form parameter:
http://www.w3.org/TR/xslt-xquery-serialization/#XML_NORMALIZATION-FORM
Anonymous