expose max_decode_len and eos_token_id in decoding #328

gyin94 · 2024-02-17T09:03:14Z

Expose

max_decode_len for beam_search to causal_lm module
eos_token_id for sample_decode to causal_lm module

It looks like both have been added unit test in decoding_test.py. This change is to expose the parameter to causal_lm and decoder module.

markblee · 2024-02-17T17:35:01Z

axlearn/common/causal_lm.py

@@ -157,6 +158,7 @@ def beam_search_decode(
            input_batch: a dict with a minimum of the following entries:
                prefix: Prompt IDs representing a Tensor of shape [batch, max_sequence_length].
            num_decodes: the number of beams to decode.
+            eos_token_id: The end of sentence token id. If not set, will use cfg.eos_token_id.


Any reason not to just set cfg.eos_token_id directly?

setting cfg.eos_token_id might not be suitable for inference side? especially if we have different requests to run against the same instance with different eos token id.

Is the motivation to customize when to stop decoding? If so, have you considered adding stop_decoding_condition similar to sample_decode?

ruomingp

Hi Guoli, what's the use case? Should we first discuss in an internal PR?

ruomingp · 2024-02-18T22:11:23Z

axlearn/common/causal_lm.py

@@ -157,6 +158,7 @@ def beam_search_decode(
            input_batch: a dict with a minimum of the following entries:
                prefix: Prompt IDs representing a Tensor of shape [batch, max_sequence_length].
            num_decodes: the number of beams to decode.
+            eos_token_id: The end of sentence token id. If not set, will use cfg.eos_token_id.


Is the motivation to customize when to stop decoding? If so, have you considered adding stop_decoding_condition similar to sample_decode?

gyin94 · 2024-02-26T05:20:46Z

Hi Guoli, what's the use case? Should we first discuss in an internal PR?

sg. let's discuss in an internal PR firstly.

expose max_decode_len and eos_token_id in decoding

03de579

gyin94 requested review from markblee and ruomingp February 17, 2024 09:03

markblee reviewed Feb 18, 2024

View reviewed changes

ruomingp reviewed Feb 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

expose max_decode_len and eos_token_id in decoding #328

expose max_decode_len and eos_token_id in decoding #328

gyin94 commented Feb 17, 2024 •

edited

markblee Feb 17, 2024

gyin94 Feb 18, 2024

ruomingp Feb 18, 2024

ruomingp left a comment

ruomingp Feb 18, 2024

gyin94 commented Feb 26, 2024

expose max_decode_len and eos_token_id in decoding #328

Are you sure you want to change the base?

expose max_decode_len and eos_token_id in decoding #328

Conversation

gyin94 commented Feb 17, 2024 • edited

markblee Feb 17, 2024

Choose a reason for hiding this comment

gyin94 Feb 18, 2024

Choose a reason for hiding this comment

ruomingp Feb 18, 2024

Choose a reason for hiding this comment

ruomingp left a comment

Choose a reason for hiding this comment

ruomingp Feb 18, 2024

Choose a reason for hiding this comment

gyin94 commented Feb 26, 2024

gyin94 commented Feb 17, 2024 •

edited