Add support for reading DPI information from JPEG2000 images #5568

rogermb · 2021-06-30T14:26:05Z

Changes proposed in this pull request:

Currently, Pillow does not read the DPI information for JPEG2000 images, unlike it does for other image formats.
This PR seeks to remedy this issue by also parsing the "resc" JPEG2000 header box if it is present.
I didn't want to copy & paste the same box header reading code a third time due to the hierarchical format of the JPEG2000 header, so I wrapped the common header reading code in a small helper class (BoxReader), see the first commit.
However, this only adds support for reading DPI information, as OpenJPEG currently lacks support for writing the required "res " and "resc" header boxes. There are already years-old open issues addressing this shortcoming: OpenJPEG/#378, OpenJPEG/#788.

This is my first contribution to Pillow, so please let me know if I messed something up :)

rogermb · 2021-06-30T15:17:03Z

I'm not sure what's up with the pypy-3.6 x86 build failing on Windows. It worked on my fork and I haven't even modified the file that seems to be causing problems, Tests/test_image_getextrema.py.

Perhaps it's just a random failure and running the Windows build again will fix it?

radarhere · 2021-06-30T23:03:58Z

I ran the job again and it is passing now

src/PIL/Jpeg2KImagePlugin.py

rogermb · 2021-08-01T23:08:38Z

src/PIL/Jpeg2KImagePlugin.py

@@ -154,9 +154,6 @@ def _parse_jp2_header(fp):
            if reader.read_fields(">4s")[0] == b"jpx ":
                mimetype = "image/jpx"

-    if header is None:
-        raise SyntaxError("Could not find JP2 header")


You're right, this is unreachable, reader.has_next_box() will never return False (because reader = BoxReader(fp) is initialized without a length), so the only possible outcome for a malformed image not containing a "jp2h" header is that reader.next_box_type() will eventually fail with some kind of exception.

While not exactly elegant, that's definitely better than having an if statement that misleads the reader. 👍

rogermb · 2021-08-01T23:09:22Z

src/PIL/Jpeg2KImagePlugin.py

@@ -33,12 +33,12 @@ def __init__(self, fp, length=-1):
        self.remaining_in_box = -1

    def _can_read(self, num_bytes):
+        if self.has_length and self.fp.tell() + num_bytes > self.length:
+            # Outside box: ensure we don't read past the known file length
+            return False
        if self.remaining_in_box >= 0:


This could technically also be an elif for symmetry and whatnot -- not that it matters, since we return from the previous if branch anyway. 👍 on spotting and fixing the bug where we could read past the parent box length!

rogermb · 2021-08-01T23:15:28Z

src/PIL/Jpeg2KImagePlugin.py

-        )
-    return (254 * num * (10 ** exp)) / (10000 * denom)
+    if denom != 0:
+        return num / denom * (10 ** exp) * 0.0254


I should've probably clarified this in a code comment: the reason for my roundabout way of calculating the resolution was to remain precise by working with ints for as long as possible, and to only have a single (division) operation which introduces floating point errors.

While your code is more elegant and more easily readable, there are now 3 floating point operations that can introduce slight numeric errors.

I'm not sure whether my approach is overkill, though. It's not like those last few mantissa bits are ever realistically going to matter.

Ah, ok. I was thinking of being consistent with how this value is used elsewhere in Pillow.

I'll switch back to your version.

Awesome, thanks! 😄

rogermb · 2021-08-01T23:23:57Z

Also, thank you for the additional test cases! 🚀

rogermb · 2021-08-02T12:25:45Z

LGTM! Thank you for all of the code improvements 😄

rogermb added 2 commits June 30, 2021 06:43

Create BoxReader helper class to parse JPEG2000 header

7f275c1

Attempt to read dpi information from JPEG2000's resc header box

5f4653d

radarhere changed the title ~~Add support for reading dpi information for JPEG2000 images~~ Add support for reading DPI information from JPEG2000 images Jun 30, 2021

radarhere added the JPEG label Jun 30, 2021

radarhere mentioned this pull request Jul 25, 2021

JPEG2000 Resolution rogermb/Pillow#1

Merged

radarhere added 4 commits August 1, 2021 18:38

If DPI is invalid, ignore it instead of raising an error

ae54838

Stop reading from "res " after all information is extracted

3ee5a9b

Prevent reading past end of file pointer even if box length allows it

0c600f1

Removed unreachable code

8828080

radarhere force-pushed the jpeg2000-resolution branch from 4cbae0d to c4567e8 Compare August 1, 2021 08:41

Added tests

8045ecc

radarhere force-pushed the jpeg2000-resolution branch from c4567e8 to 8045ecc Compare August 1, 2021 09:01

radarhere reviewed Aug 1, 2021

View reviewed changes

src/PIL/Jpeg2KImagePlugin.py Show resolved Hide resolved

rogermb commented Aug 1, 2021

View reviewed changes

Favour integer operations when calculating DPI

dab5721

radarhere merged commit 6406dab into python-pillow:master Aug 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for reading DPI information from JPEG2000 images #5568

Add support for reading DPI information from JPEG2000 images #5568

rogermb commented Jun 30, 2021

rogermb commented Jun 30, 2021

radarhere commented Jun 30, 2021

rogermb Aug 1, 2021 •

edited

rogermb Aug 1, 2021

rogermb Aug 1, 2021 •

edited

radarhere Aug 2, 2021

rogermb Aug 2, 2021

rogermb commented Aug 1, 2021 •

edited

rogermb commented Aug 2, 2021 •

edited

Add support for reading DPI information from JPEG2000 images #5568

Add support for reading DPI information from JPEG2000 images #5568

Conversation

rogermb commented Jun 30, 2021

rogermb commented Jun 30, 2021

radarhere commented Jun 30, 2021

rogermb Aug 1, 2021 • edited

Choose a reason for hiding this comment

rogermb Aug 1, 2021

Choose a reason for hiding this comment

rogermb Aug 1, 2021 • edited

Choose a reason for hiding this comment

radarhere Aug 2, 2021

Choose a reason for hiding this comment

rogermb Aug 2, 2021

Choose a reason for hiding this comment

rogermb commented Aug 1, 2021 • edited

rogermb commented Aug 2, 2021 • edited

rogermb Aug 1, 2021 •

edited

rogermb Aug 1, 2021 •

edited

rogermb commented Aug 1, 2021 •

edited

rogermb commented Aug 2, 2021 •

edited