Docutils: Documentation Utilities / Patches / #113 writers/odf_odt: Use only ASCII filenames in ODF packages

writers/odf_odt: Use only ASCII filenames in ODF packages

#113 writers/odf_odt: Use only ASCII filenames in ODF packages

Milestone: None

Status: closed-fixed

Owner: nobody

Labels: None

Priority: 5

Updated: 2023-04-18

Created: 2013-08-05

Creator:

Private: No

The odf_odt writer embeds images in its output files and uses the original filenames as part of the embedded filenames. Since the OpenDocument standard does not specify the filename charset, recode to ASCII (dropping non-representable characters) to be on the safe side.

The actual reason that brought about this patch is an invalid assumption about character sets in docutils.writers.odf_odt.Writer.store_embedded_files(). This has been reported as Debian bug http://bugs.debian.org/714317.

1 Attachments

odt-writer-ascii-filenames.diff

Discussion

engelbert gruber - 2015-02-11

the patch does two things. first

remove decode('latin-1').encode('utf-8')
the filename stored in zipfile.

seams good to me. as the filename refererenced should not be
changed and encoding/decoding should have happened in docutils.io anyway

APPLIED in revision 7786

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

engelbert gruber - 2015-02-11

second::

def visit_image(self, node):

@@ -2076,7 +2075,8 @@
else:
self.image_count += 1
filename = os.path.split(source)[1]
- destination = 'Pictures/1%08x%s' % (self.image_count, filename, )
+ destination = 'Pictures/1%08x_%s' % (self.image_count,
+ filename.encode("ascii", "ignore"))
if source.startswith('http:'):
try:

i do not see why the first part removes encode and the second adds ?

NOT APPLIED

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Günter Milde - 2023-04-18

status: open --> closed-fixed
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Günter Milde - 2023-04-18

The related Debian bug was closed in 2013. Thanks for the patch.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

writers/odf_odt: Use only ASCII filenames in ODF packages

Group

Searches

Help

#113 writers/odf_odt: Use only ASCII filenames in ODF packages

Discussion