add more ctc-loss related summaries #457

yqwangustc · 2024-05-12T20:29:32Z

Add more detailed loss to easy debugging

markblee · 2024-05-12T22:34:54Z

axlearn/audio/asr_decoder.py

+                jnp.mean(source_lengths), batch_size
+            ),
+            "input_stats/frame_packing_effiency": WeightedScalar(
+                jnp.sum(source_lengths) / input_batch["paddings"].size, input_batch["paddings"].size


Guard against division by 0 here and below?

I think this before: the denominator input_batch['paddings'].size is directly pulled from the input. If it is zero, many other places will become problematic well before we hit here (e.g., various normalization layer). Nevertheless, I have added safeguard for them.

Thanks! I don't think we should make assumptions about the other components.

markblee · 2024-05-12T22:35:20Z

axlearn/audio/asr_decoder.py

+            ),
+        }
+        # pytype: enable=attribute-error
+        return ret_dict


Is there a specific reason to prefer returning the summaries instead of just adding them here?

By returning a dictionary, we can possibly reuse the summaries value in the base class for a more specific summaries in the subclasss. Also it allows us to override some summaries in the subclass.

markblee · 2024-05-12T22:35:42Z

axlearn/audio/asr_decoder.py


+    def _input_stats_summary(


Suggested change

def _input_stats_summary(

def _input_stats_summaries(

or def _add_input_stats_summaries if we decide to inline the add, which may be more similar to other callsites in the repo.

Have the changes been pushed?

sorry. was confused and pushed to the other repo. Just pushed the change.

markblee · 2024-05-12T22:37:51Z

axlearn/audio/asr_decoder.py

+        per_frame_loss = total_ctc_loss / num_valid_frames
+        per_label_loss = total_ctc_loss / num_valid_labels
+        batch_size = per_example_weight.shape[0]


markblee

Thanks, a few nits.

markblee · 2024-05-14T17:51:35Z

axlearn/audio/asr_decoder.py

+        per_frame_loss = total_ctc_loss / num_valid_frames
+        per_label_loss = total_ctc_loss / num_valid_labels


markblee · 2024-05-14T17:52:17Z

axlearn/audio/asr_decoder.py

+        valid_label_mask = (1.0 - target_paddings) * per_example_weight[:, None]
+        num_valid_frames = jnp.sum(valid_frame_mask)
+        num_valid_labels = jnp.sum(valid_label_mask)
+        num_valid_examples = jnp.maximum(per_example_weight.sum(), 1.0)


A couple nits -- since we sum over weights, 1.0 may not always be appropriate. We might also consider renaming num_valid_examples to total_example_weight.

Thanks for the suggestion.

add more ctc-loss related summaries

a8bd10f

yqwangustc requested review from markblee and zhiyun May 12, 2024 20:29

markblee reviewed May 12, 2024

View reviewed changes

yqwangustc requested a review from markblee May 13, 2024 14:22

Yongqiang Wang added 2 commits May 13, 2024 21:10

update

0471cc1

update

72f091e

markblee approved these changes May 14, 2024

View reviewed changes

address review feedback

4624783

yqwangustc requested review from zhiyun and markblee and removed request for zhiyun May 14, 2024 22:07

markblee approved these changes May 14, 2024

View reviewed changes

yqwangustc added this pull request to the merge queue May 14, 2024

Merged via the queue into apple:main with commit d5f219f May 14, 2024
4 checks passed

yqwangustc deleted the ctc_loss branch May 14, 2024 22:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add more ctc-loss related summaries #457

add more ctc-loss related summaries #457

yqwangustc commented May 12, 2024

markblee May 12, 2024

yqwangustc May 13, 2024

markblee May 13, 2024

markblee May 12, 2024

yqwangustc May 13, 2024

markblee May 12, 2024

yqwangustc May 13, 2024

markblee May 13, 2024

yqwangustc May 14, 2024

markblee May 12, 2024

yqwangustc May 13, 2024

markblee left a comment

markblee May 14, 2024

yqwangustc May 14, 2024

markblee May 14, 2024

yqwangustc May 14, 2024

		per_frame_loss = total_ctc_loss / num_valid_frames
		per_label_loss = total_ctc_loss / num_valid_labels

add more ctc-loss related summaries #457

add more ctc-loss related summaries #457

Conversation

yqwangustc commented May 12, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

markblee left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment