jPod intarsys PDF library / Patches / #3 Some useful functions for jPod: TIFF, JPEG, lossless, unicode TTF writing

I was able to add the cmap generation too.

Something like this:

        StringBuilder cmapContents = new StringBuilder();
        cmapContents.append("/CIDInit /ProcSet findresource begin\n");
        cmapContents.append("12 dict begin\n");
        cmapContents.append("begincmap\n");
        cmapContents.append("/CIDSystemInfo << /Registry (Adobe) /Ordering (UCS) /Supplement 0 >> def\n");
        cmapContents.append(String.format("/CMapName /%s def\n", cmapName));
        cmapContents.append("/CMapType 2 def\n");
        cmapContents.append("1 begincodespacerange\n");
        cmapContents.append("<0000> <ffff>\n");
        cmapContents.append("endcodespacerange\n");

        /* Reverse the map from unicode -> glyph to glyph -> unicode. */
        Map<Object, List<Map.Entry<?, ?>>> map = unicodeMap.entrySet().stream().collect(Collectors.groupingBy(x -> x.getValue()));
        cmapContents.append(map.size() + " beginbfchar\n");
        for (Map.Entry<Object, List<Map.Entry<?, ?>>> entry : map.entrySet()) {
            int glyphId = (Integer) entry.getKey();
            cmapContents.append(String.format("<%04x> <", glyphId));
            /* What to do about multiple mappings? We could crash or something... */
            int codePoint = (Integer) entry.getValue().get(0).getKey();
            byte[] value = new String(Character.toChars(codePoint)).getBytes(StandardCharsets.UTF_16BE);
            for (byte b : value) {
                cmapContents.append(String.format("%02x", b & 0xff));
            }
            cmapContents.append(">\n");
        }
        cmapContents.append("endbfchar\n");
        cmapContents.append("endcmap\n");
        cmapContents.append("CMapName currentdict /CMap defineresource pop\n");
        cmapContents.append("end\n");
        cmapContents.append("end\n");
        COSStream cmapObject = COSStream.create(null);
        cmapObject.setDecodedBytes(cmapContents.toString().getBytes(StandardCharsets.UTF_8));
        pdFont0.setToUnicode((CMap) InternalCMap.META.createFromCos(cmapObject));

mtraut - 2014-06-10

Thank you for your support.

I hope we can come back to your submission with our next release. To enable us to do so, please decorate your code/posting clearly with lesser BSD license agreement.

Currently we are very busy, so please forgive any delay.

Maven repository is not supported by us. We currently only provide code as a file download on this site. The maven submission is managed by a third party supporter.

Regards, Michael

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Antti S. Lankila - 2014-06-10
  
  mtraut mtraut@users.sf.net kirjoitti 10.6.2014 kello 10.24:
  
  Thank you for your support.
  
  I hope we can come back to your submission with our next release. To enable us to do so, please decorate your code/posting clearly with lesser BSD license agreement.
  
  OK, I can do this. I’ve made some tweaks and correctness fixes that need to be included, mostly two things: TIFF PDImage must have setBitsPerComponent(1) called, or image will be rejected by Acrobat Reader; and font must have a getFlags().setSymbolic(false) called because of the way these flags work.
  Maven repository is not supported by us. We currently only provide code as a file download on this site. The maven submission is managed by a third party supporter.
  
  Yeah, I guess I’ll try to have to figure out who this is and contact him.
  
  Antti
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Antti S. Lankila - 2014-07-11

OK, this is the new version. I added the 3-clause BSD license (which is what I assume you mean by "lesser") as a javadoc for the class.

PDFBoxImprovements.java

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Antti S. Lankila - 2014-07-11

... and excuse the name. This was supposed to be called jpodimprovements but I was just studying recent changes in pdfbox when I prepared the file.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Some useful functions for jPod: TIFF, JPEG, lossless, unicode TTF writing

Group

Searches

Help

#3 Some useful functions for jPod: TIFF, JPEG, lossless, unicode TTF writing

Discussion