Add f16 support in the wgpu backend #1582

p1-0tr · 2024-04-07T12:09:19Z

Pull Request Template

Checklist

Confirmed that run-checks all script has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

Split from: #1475
Dawn support: #1583

Changes

The burn-wgpu backend currently does not support computations on 16 bit floats. This. for example, limits the ability to run LLMs on top of Burn, on widely available hardware. So, add 16 bit float support in burn-wgpu.

Testing

I used this change on top of my changes which add the ability to run with Dawn instead of wgpu, to run llama2-burn with f16.

The burn-wgpu backend currently does not support computations on 16 bit floats. This. for example, limits the ability to run LLMs on top of Burn, on widely available hardware. So, add 16 bit float support in burn-wgpu. Signed-off-by: Piotr Stankiewicz <piotr.stankiewicz@docker.com>

codecov · 2024-04-08T08:17:05Z

Codecov Report

Attention: Patch coverage is 9.30233% with 39 lines in your changes are missing coverage. Please review.

Project coverage is 86.35%. Comparing base (f3e0aa6) to head (ee7ec2c).
Report is 91 commits behind head on main.

Files	Patch %	Lines
crates/burn-jit/src/element.rs	0.00%	18 Missing ⚠️
crates/burn-wgpu/src/compiler/wgsl/shader.rs	14.28%	6 Missing ⚠️
crates/burn-jit/src/fusion/tracing/builder.rs	0.00%	5 Missing ⚠️
crates/burn-wgpu/src/compiler/wgsl/base.rs	0.00%	3 Missing ⚠️
crates/burn-wgpu/src/element.rs	0.00%	3 Missing ⚠️
crates/burn-wgpu/src/compiler/wgsl/compiler.rs	60.00%	2 Missing ⚠️
crates/burn-jit/src/codegen/dialect/gpu/shader.rs	0.00%	1 Missing ⚠️
crates/burn-jit/src/codegen/kernel.rs	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1582      +/-   ##
==========================================
- Coverage   86.39%   86.35%   -0.05%     
==========================================
  Files         688      688              
  Lines       78676    78718      +42     
==========================================
+ Hits        67974    67977       +3     
- Misses      10702    10741      +39

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

nathanielsimard · 2024-04-11T13:13:25Z

crates/burn-jit/src/codegen/dialect/gpu/shader.rs

@@ -21,6 +21,7 @@ pub enum Visibility {
 #[allow(missing_docs)]
 pub enum Elem {
    Float,
+    Half,


There is no need to add Half here, Float should cover all float types of all precisions in this context.

nathanielsimard · 2024-04-11T13:14:37Z

crates/burn-jit/src/element.rs

+    fn gpu_elem() -> gpu::Elem {
+        gpu::Elem::Half
+    }


The gpu element would be Float here.

nathanielsimard · 2024-04-11T13:16:04Z

crates/burn-wgpu/src/compiler/wgsl/compiler.rs

+        let features = match F::gpu_elem() {
+            gpu::Elem::Half => vec![wgsl::Feature::ShaderF16],
+            _ => vec![],
+        };


I would check using F::wgpu_elem() == Elem::F16 instead.

nathanielsimard · 2024-04-11T13:17:06Z

crates/burn-wgpu/src/compiler/wgsl/compiler.rs

            gpu::Elem::Float => F::wgpu_elem(),
+            gpu::Elem::Half => F::wgpu_elem(),


This line pretty much explains why we don't need Half in the gpu::Elem enum :)

github-actions · 2024-05-12T12:07:27Z

This PR has been marked as stale because it has not been updated for over a month

p1-0tr force-pushed the ps-wgpu-f16 branch from 761a22a to ee7ec2c Compare April 7, 2024 12:50

nathanielsimard requested changes Apr 11, 2024

View reviewed changes

github-actions bot added the stale The issue or pr has been open for too long label May 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add f16 support in the wgpu backend #1582

Add f16 support in the wgpu backend #1582

p1-0tr commented Apr 7, 2024 •

edited

codecov bot commented Apr 8, 2024 •

edited

nathanielsimard Apr 11, 2024

nathanielsimard Apr 11, 2024

nathanielsimard Apr 11, 2024

nathanielsimard Apr 11, 2024

github-actions bot commented May 12, 2024

		gpu::Elem::Float => F::wgpu_elem(),
		gpu::Elem::Half => F::wgpu_elem(),

Add f16 support in the wgpu backend #1582

Are you sure you want to change the base?

Add f16 support in the wgpu backend #1582

Conversation

p1-0tr commented Apr 7, 2024 • edited

Pull Request Template

Checklist

Related Issues/PRs

Changes

Testing

codecov bot commented Apr 8, 2024 • edited

Codecov Report

nathanielsimard Apr 11, 2024

Choose a reason for hiding this comment

nathanielsimard Apr 11, 2024

Choose a reason for hiding this comment

nathanielsimard Apr 11, 2024

Choose a reason for hiding this comment

nathanielsimard Apr 11, 2024

Choose a reason for hiding this comment

github-actions bot commented May 12, 2024

p1-0tr commented Apr 7, 2024 •

edited

codecov bot commented Apr 8, 2024 •

edited