Implement Huber loss #1444

WorldSEnder · 2024-03-10T09:32:35Z

Checklist

Confirmed that run-checks all script has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

Closes #1441 as I think the remaining feature request for a sign function is already tracked in #522.

Changes

Instead of strictly following the definition of using a sign or abs function, the implementation uses clamping, which computes the same value outside the delta bounds but is better behaved on the autodiff backend and does not need any extra primitive ops. See also #1441 for my first attempt of implementing this.

Testing

Test data should cover all relevant branches of the operation, and critical points on the autodiff backend, i.e. zero residuals and the point where the loss switches between the branches. Test assertions have been generated from executing the equivalent in scipy.

Note: the test_downsample_interpolation test in nearest_interpolate.rs is failing locally for me. Not caused by the patch, I've ignored it when running run-checks.

Instead of using a sign or abs function, uses clamping to compute it outside the bounds. This is better for the autodiff backend.

antimora · 2024-03-10T17:30:24Z

Submitted a PR for sign tensor operator: #1446

WorldSEnder · 2024-03-10T18:50:52Z

Note: I think the method of clamping even works out better for Huber than using sign, since it avoids a mul_scalar(delta). Though I'm still interested in sign for alternative losses such as SmoothL1.

antimora · 2024-03-10T18:57:47Z

CI failed due to:

 test tests::jit_fusion::var::tests::test_var_mean_bias ... ok
  
  failures:
  
  ---- tests::jit::kernel::normal::tests::subsequent_calls_give_different_tensors stdout ----
  thread 'tests::jit::kernel::normal::tests::subsequent_calls_give_different_tensors' panicked at crates/burn-wgpu/src/lib.rs:73:5:
  assertion failed: tensor_1.to_data().value[i] != tensor_2.to_data().value[i]
  
  
  failures:
      tests::jit::kernel::normal::tests::subsequent_calls_give_different_tensors
  
  test result: FAILED. 1341 passed; 1 failed; 0 ignored; 0 measured; 0 filtered out; finished in 55.86s

Rerunning to see if it's a fluke. Tagging @nathanielsimard and @louisfd since they're currently working in this area.

codecov · 2024-03-10T19:15:39Z

Codecov Report

Attention: Patch coverage is 96.93878% with 3 lines in your changes are missing coverage. Please review.

Project coverage is 85.97%. Comparing base (4ed90a9) to head (7d7b181).
Report is 25 commits behind head on main.

Files	Patch %	Lines
crates/burn-core/src/nn/loss/huber.rs	96.93%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1444      +/-   ##
==========================================
+ Coverage   85.81%   85.97%   +0.15%     
==========================================
  Files         610      646      +36     
  Lines       70417    71847    +1430     
==========================================
+ Hits        60428    61769    +1341     
- Misses       9989    10078      +89

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

antimora · 2024-03-11T15:40:57Z

Sign tensor op PR is merged.

nathanielsimard

LGTM @louisfd for further review.

louisfd

LGTM, but can you clarify the math in comments? I think it works out but it's a bit confusing, in particular I think r, err and res are three names for the same thing?

WorldSEnder · 2024-03-12T17:00:30Z

LGTM, but can you clarify the math in comments? I think it works out but it's a bit confusing, in particular I think r, err and res are three names for the same thing?

Ah yes, I initially wanted to use "error" for the difference between targets and predictions, then remembered the better term residuals and I guess didn't catch a few things when renaming. Will fix.

louisfd · 2024-03-12T17:10:24Z

Thanks, we can merge once CI passes

WorldSEnder added 2 commits March 10, 2024 10:21

Implement Huber loss

74f20fd

Instead of using a sign or abs function, uses clamping to compute it outside the bounds. This is better for the autodiff backend.

mention Huber loss in the book

8859357

antimora requested review from nathanielsimard and louisfd March 10, 2024 17:29

antimora requested a review from laggui March 11, 2024 15:41

nathanielsimard approved these changes Mar 12, 2024

View reviewed changes

louisfd approved these changes Mar 12, 2024

View reviewed changes

unify naming of residuals in comments

7d7b181

WorldSEnder requested a review from louisfd March 13, 2024 15:06

antimora merged commit 53eb3ec into tracel-ai:main Mar 13, 2024
14 checks passed

WorldSEnder deleted the huber-loss branch March 13, 2024 18:01

WorldSEnder mentioned this pull request Mar 15, 2024

Precision errors running tests #1477

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Huber loss #1444

Implement Huber loss #1444

WorldSEnder commented Mar 10, 2024 •

edited

antimora commented Mar 10, 2024

WorldSEnder commented Mar 10, 2024

antimora commented Mar 10, 2024

codecov bot commented Mar 10, 2024 •

edited

antimora commented Mar 11, 2024

nathanielsimard left a comment

louisfd left a comment

WorldSEnder commented Mar 12, 2024

louisfd commented Mar 12, 2024

Implement Huber loss #1444

Implement Huber loss #1444

Conversation

WorldSEnder commented Mar 10, 2024 • edited

Checklist

Related Issues/PRs

Changes

Testing

antimora commented Mar 10, 2024

WorldSEnder commented Mar 10, 2024

antimora commented Mar 10, 2024

codecov bot commented Mar 10, 2024 • edited

Codecov Report

antimora commented Mar 11, 2024

nathanielsimard left a comment

Choose a reason for hiding this comment

louisfd left a comment

Choose a reason for hiding this comment

WorldSEnder commented Mar 12, 2024

louisfd commented Mar 12, 2024

WorldSEnder commented Mar 10, 2024 •

edited

codecov bot commented Mar 10, 2024 •

edited