From: Erich B. <er...@eb...> - 2018-03-30 20:35:04
|
The fault was mine. I forgot to specify the "UTF-8" encoding in the file writer: writer = new OutputStreamWriter(new GZIPOutputStream(output)); to writer = new OutputStreamWriter(new GZIPOutputStream(output), "UTF-8"); Although the file looks near identical and even looks correct in some programs, it wasn't until the encoding is added that the resultant file is correctly uploaded into Virtuoso. - Erich On Tue, Mar 27, 2018 at 7:29 PM, Hugh Williams <hwi...@op...> wrote: > Hi Erich, > > When you say bulk update method I presume you mean with the Virtuoso RDF > Bulk Loader ie https://virtuoso.openlinksw.com/dataspace/doc/ > dav/wiki/Main/VirtBulkRDFLoader ? > > If so are you able to provide a sample NT file for upload to recreate > locally , as we have not had such complaints and Virtuoso is used for > hosting datasets in many languages with unicode chars which are uploaded > using the bulk loader ? > > Best Regards > Hugh Williams > Professional Services > OpenLink Software, Inc. // http://www.openlinksw.com/ > Weblog -- http://www.openlinksw.com/blogs/ > LinkedIn -- http://www.linkedin.com/company/openlink-software/ > Twitter -- http://twitter.com/OpenLink > Google+ -- http://plus.google.com/100570109519069333827/ > Facebook -- http://www.facebook.com/OpenLinkSoftware > Universal Data Access, Integration, and Management Technology Providers > > > > On 27 Mar 2018, at 21:41, Erich Bremer <er...@eb...> wrote: > > I've uploaded some Unicode characters via sparql-auth endpoint using: > > prefix : <http://www.stonybrook.edu/ns/> > insert data {graph <http://www.ebremer.com> {<urn:sbu:khan> :status > "卩尺乇ᐯ丨乇山 ㄒ乇乂ㄒ"}} > > which will display fine: > urn:sbu:test http://www.stonybrook.edu/ns/status > > "Yay!" > > urn:sbu:test http://www.stonybrook.edu/ns/status > > "Ω" > > urn:sbu:test http://www.stonybrook.edu/ns/status > > "卩尺乇ᐯ丨乇山 ㄒ乇乂ㄒ" > > > It seems the unicode data uploaded via the bulk update method ala isql is > not uploading the characters correctly. They get converted to question > marks. - E > > On Thu, Mar 22, 2018 at 8:07 PM, Erich Bremer <er...@eb...> wrote: > >> I've uploaded some N-Triples files using the bulk uploader method to my >> Virtuoso instance (07.20.3217). The NT files look fine, but when I query >> the data via the SPARQL endpoint, the Unicode characters are all converted >> to question marks. How do I properly upload Unicode data into Virtuoso? - >> Erich Bremer >> > > ------------------------------------------------------------ > ------------------ > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot______ > _________________________________________ > Virtuoso-users mailing list > Vir...@li... > https://lists.sourceforge.net/lists/listinfo/virtuoso-users > > > |