Fix image post-processing for OWLv2 #30686

jla524 · 2024-05-07T06:38:05Z

What does this PR do?

Fixes #30131 (issue)

Who can review?

amyeroberts

Thanks for updating our example, @jla524!

This is something which should really be handled within the image processor of the model itself, specifically the rescaling of the box dimensions

…to fix_owlv2_example

amyeroberts

Looks great - thanks for taking the time to update the example as well!

Technically this is a breaking change. However, as many users have highlighted this confusing behaviour and it should always have been done in the post-processing method, I think it's OK to add directly.

Final request is to add a test here so that we check the images and boxes are rescaled as expected in the post processing method. Once that's done I think we're good to merge!

HuggingFaceDocBuilderDev · 2024-05-08T10:10:52Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jla524 · 2024-05-08T22:09:52Z

Looks great - thanks for taking the time to update the example as well!

Technically this is a breaking change. However, as many users have highlighted this confusing behaviour and it should always have been done in the post-processing method, I think it's OK to add directly.

Final request is to add a test here so that we check the images and boxes are rescaled as expected in the post processing method. Once that's done I think we're good to merge!

Added a test for the resize. I'm not sure if this is the right approach. Let me know what you think!

amyeroberts

Looks great - thanks for iterating on this!

For the test, have you visualized the bounding boxes to confirm they are sensible? It would be good to check this + bboxes on the examples to make sure everything is a-ok before merge. Otherwise looks good to go!

tests/models/owlv2/test_image_processor_owlv2.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

jla524 · 2024-05-09T14:52:00Z

Looks great - thanks for iterating on this!

For the test, have you visualized the bounding boxes to confirm they are sensible? It would be good to check this + bboxes on the examples to make sure everything is a-ok before merge. Otherwise looks good to go!

Yes, the boxes look reasonable. One thing to note is that the box on the right is slightly out of bound. The box has a x_max of 642.32, whereas the image itself is 640 pixels.

amyeroberts

Thanks for fixing, adding tests and verifying the outputs!

ydshieh · 2024-05-13T12:14:10Z

Hi @jla524 Thank you for the fix.

Our doctesting for this model now has one failure. See below.

python3 -m pytest -v --make-reports doc_tests_gpu_docs_source_en_model_doc_owlv2.md --doctest-modules docs/source/en/model/doc/owlv2.md failures:
owlv2.md -sv --doctest-continue-on-failure --doctest-glob="*.md"

(on a T4 GPU machine)

I think it's expected? Would you like to take a look to confirm and open a pull request to update docs/source/en/model/doc/owlv2.md if that is the way to go?

Expected:
    Detected a photo of a cat with confidence 0.614 at location [341.67, 17.54, 642.32, 278.51]
    Detected a photo of a cat with confidence 0.665 at location [6.75, 38.97, 326.62, 354.85]
Got:
    Detected a photo of a cat with confidence 0.614 at location [341.67, 23.39, 642.32, 371.35]
    Detected a photo of a cat with confidence 0.665 at location [6.75, 51.96, 326.62, 473.13]

jla524 · 2024-05-14T01:47:46Z

@ydshieh Yes, I'll send in a PR to fix this.

feat: add note about owlv2

102b633

amyeroberts reviewed May 7, 2024

View reviewed changes

jla524 added 4 commits May 7, 2024 03:03

fix: post processing coordinates

55aa637

Merge branch 'fix_owlv2_example' of github.com:jla524/transformers in…

bbf3b45

…to fix_owlv2_example

remove: workaround document

Loading
Loading status checks…

cf52310

fix: extra quotes

Loading
Loading status checks…

5e58cbc

jla524 changed the title ~~Add a note about OWLv2 example~~ Fix post processing for the OWLv2 image processor May 7, 2024

update: owlv2 docstrings

Loading
Loading status checks…

0fb1083

jla524 changed the title ~~Fix post processing for the OWLv2 image processor~~ Fix image post-processing for OWLv2 May 7, 2024

fix: copies check

f8d1785

amyeroberts reviewed May 8, 2024

View reviewed changes

feat: add unit test for resize

49fc94c

amyeroberts reviewed May 9, 2024

View reviewed changes

tests/models/owlv2/test_image_processor_owlv2.py Outdated Show resolved Hide resolved

Update tests/models/owlv2/test_image_processor_owlv2.py

517869c

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

amyeroberts approved these changes May 9, 2024

View reviewed changes

amyeroberts merged commit 218f441 into huggingface:main May 9, 2024
18 checks passed

jla524 deleted the fix_owlv2_example branch May 14, 2024 01:45

jla524 mentioned this pull request May 14, 2024

Fix OWLv2 Doc #30794

Merged

qubvel mentioned this pull request May 28, 2024

Multi input of owlv2 cause RuntimeError: Boolean value of Tensor with more than one value is ambiguous #31077

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix image post-processing for OWLv2 #30686

Fix image post-processing for OWLv2 #30686

jla524 commented May 7, 2024

amyeroberts left a comment

amyeroberts left a comment

HuggingFaceDocBuilderDev commented May 8, 2024

jla524 commented May 8, 2024

amyeroberts left a comment

jla524 commented May 9, 2024

amyeroberts left a comment

ydshieh commented May 13, 2024

jla524 commented May 14, 2024

Fix image post-processing for OWLv2 #30686

Fix image post-processing for OWLv2 #30686

Conversation

jla524 commented May 7, 2024

What does this PR do?

Who can review?

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented May 8, 2024

jla524 commented May 8, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

jla524 commented May 9, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

ydshieh commented May 13, 2024

jla524 commented May 14, 2024