`f16` support #172

SludgePhD · 2023-06-04T13:45:10Z

Network weights can be stored as f16 floats, halving the size of the network, which is often very desirable.

It would be nice if wonnx could support loading networks that do that. WebGPU has native support for f16 "half-precision" floats, so all GPU buffers could store them natively. Rust does not, however, so all network inputs and outputs would have to be converted.

The text was updated successfully, but these errors were encountered:

FL33TW00D · 2023-06-05T08:13:47Z

Network weights can be stored as f16 floats, halving the size of the network, which is often very desirable.

It would be nice if wonnx could support loading networks that do that. WebGPU has native support for f16 "half-precision" floats, so all GPU buffers could store them natively. Rust does not, however, so all network inputs and outputs would have to be converted.

F16 is not supported in Naga yet: gfx-rs/wgpu#4384
Also not shipped in Chrome yet: https://bugs.chromium.org/p/dawn/issues/detail?id=1775&q=f16&can=2

SludgePhD · 2023-06-05T16:45:29Z

Ah, that's unfortunate. In that case, wonnx could still upconvert these values to f32 when loading these models to make them work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`f16` support #172

`f16` support #172

SludgePhD commented Jun 4, 2023

FL33TW00D commented Jun 5, 2023

SludgePhD commented Jun 5, 2023

f16 support #172

f16 support #172

Comments

SludgePhD commented Jun 4, 2023

FL33TW00D commented Jun 5, 2023

SludgePhD commented Jun 5, 2023

`f16` support #172

`f16` support #172