Do not remove half seq length in generation tests #30016

zucchini-nlp · 2024-04-03T10:42:19Z

What does this PR do?

Generation tests divided input ids into half, feeding only half-seq-length for generation. But this strategy may cause difficulties for multimodal models, which have dependency between images count and input ids. For ex, Llava models include special image tokens in input ids, and removing half-seq-length might result in failing test.

This is part of a work to add GenerationTesterMixin in multimodal models, which are not covered with tests now.

This PR:

removes dividing seq length into half
replaces "max length" with "max new tokens" for ease
gets rid of "min length" as we already have "config.eos_token=None" everywhere ensuring that exactly "max length" will be generated

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@gante

HuggingFaceDocBuilderDev · 2024-04-03T11:04:55Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gante

LGTM, thank you for working on this 🙏

gante · 2024-04-03T14:59:38Z

tests/models/longt5/test_modeling_longt5.py

@@ -752,7 +752,7 @@ def test_attention_outputs(self):

    def _check_encoder_attention_for_generate(self, attentions, batch_size, config, seq_length):
        block_len = getattr(self.model_tester, "block_len", None)
-        encoder_expected_shape = (batch_size, 1, config.num_attention_heads, block_len, 3 * block_len)
+        encoder_expected_shape = (batch_size, 2, config.num_attention_heads, block_len, 3 * block_len)


is this because of the input_ids shape change?

yes, I guess we can hardcode it this way since previously "1" was also hardcoded

no need to add imo

You mean bringing back "1"? That causes test failures...

tests/models/xlnet/test_modeling_xlnet.py

gante · 2024-04-03T15:02:15Z

@zucchini-nlp Can you trigger all tests, to double-check? (i.e. commit message = "[test_all] ...")

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

zucchini-nlp · 2024-04-03T15:27:05Z

All tests are passing, except for the attached PR and another which seemingly failed to download files.

ArthurZucker

thanks for the cleanup overall 😄

tests/generation/test_utils.py

ArthurZucker · 2024-04-05T08:15:14Z

tests/generation/test_utils.py

@@ -691,40 +667,39 @@ def test_model_parallel_beam_search(self):
                new_model.generate(
                    input_ids,
                    attention_mask=attention_mask,
-                    max_length=max_length,
+                    max_new_tokens=self.max_new_tokens,


everywhere we call model.generate, if we pas this max_new_token, why not in the setup update the model.generation_config.max_new_tokens?

Hmm, you mean the "setup" in each model's test file, so that we already have the "max_new_tokens" linked to model? That means we need to initialize model from config when calling _get_input_ids_and_config and then set model.generation_config.max_new_tokens.

I can do it, seems like it will not cause any errors. But I am not sure if that's what you mean

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove seq length from generation tests * style and quality * [test_all] & PR suggestion Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update tests/generation/test_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * [test all] remove unused variables --------- Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

zucchini-nlp added 2 commits April 3, 2024 12:33

remove seq length from generation tests

e325858

Merge remote-tracking branch 'upstream/main' into generation_test_v2

a03e76b

zucchini-nlp requested a review from gante April 3, 2024 10:42

style and quality

b87495e

zucchini-nlp mentioned this pull request Apr 3, 2024

Fix whisper kwargs and generation config #30018

Merged

gante approved these changes Apr 3, 2024

View reviewed changes

gante requested a review from ArthurZucker April 3, 2024 15:01

[test_all] & PR suggestion

Loading
Loading status checks…

760836b

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

ArthurZucker approved these changes Apr 5, 2024

View reviewed changes

zucchini-nlp and others added 3 commits April 8, 2024 12:26

Update tests/generation/test_utils.py

Loading
Loading status checks…

b8045de

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

[test all] remove unused variables

Loading
Loading status checks…

ae2108d

Merge remote-tracking branch 'upstream/main' into generation_test_v2

7c6f6ef

zucchini-nlp mentioned this pull request Apr 18, 2024

Add generation tests for multimodal generative models #29853

Closed

gante merged commit b1cd487 into huggingface:main Apr 19, 2024
18 checks passed

gante mentioned this pull request Apr 22, 2024

Jamba: fix left-padding test #30389

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not remove half seq length in generation tests #30016

Do not remove half seq length in generation tests #30016

zucchini-nlp commented Apr 3, 2024

HuggingFaceDocBuilderDev commented Apr 3, 2024

gante left a comment

gante Apr 3, 2024

zucchini-nlp Apr 3, 2024 •

edited

Loading

gante Apr 3, 2024

zucchini-nlp Apr 3, 2024

gante commented Apr 3, 2024 •

edited

Loading

zucchini-nlp commented Apr 3, 2024

ArthurZucker left a comment

ArthurZucker Apr 5, 2024

zucchini-nlp Apr 8, 2024

Do not remove half seq length in generation tests #30016

Do not remove half seq length in generation tests #30016

Conversation

zucchini-nlp commented Apr 3, 2024

What does this PR do?

Who can review?

HuggingFaceDocBuilderDev commented Apr 3, 2024

gante left a comment

Choose a reason for hiding this comment

gante Apr 3, 2024

Choose a reason for hiding this comment

zucchini-nlp Apr 3, 2024 • edited Loading

Choose a reason for hiding this comment

gante Apr 3, 2024

Choose a reason for hiding this comment

zucchini-nlp Apr 3, 2024

Choose a reason for hiding this comment

gante commented Apr 3, 2024 • edited Loading

zucchini-nlp commented Apr 3, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Apr 5, 2024

Choose a reason for hiding this comment

zucchini-nlp Apr 8, 2024

Choose a reason for hiding this comment

zucchini-nlp Apr 3, 2024 •

edited

Loading

gante commented Apr 3, 2024 •

edited

Loading