[TorchToLinalg] add support for quantized group conv #3341

zjgarvey · 2024-05-14T15:54:40Z

This addresses 7 of the model failures I'm seeing in the test suite. See Shark-Turbine issue #566.

Need the op linalg.conv_2d_ngchw_gfchw_q to be added upstream before merging this. See llvm-project PR #92136 .

A small additional expansion to operand quantization is included in this patch to address a model failure that occurs when unblocking the quantized group convolutions in one of these onnx models.

zjgarvey · 2024-05-17T01:33:10Z

When the upstream patch lands, I'll rebase, and add some tests.

vivekkhandelwal1

LGTM

add quantized group conv

b105b2f

zjgarvey mentioned this pull request May 16, 2024

[ONNX][TorchToLinalg] Add support for dynamic dims in Interpolate lowering #3351

Merged

zjgarvey added 3 commits May 31, 2024 16:56

Merge remote-tracking branch 'upstream/main' into grouped_qconv

65e1beb

add e2e test for quantized group conv

d44b30a

Merge remote-tracking branch 'upstream/main' into grouped_qconv

f695679

zjgarvey requested review from rsuderman, renxida, vivekkhandelwal1 and AmosLewis May 31, 2024 19:11

vivekkhandelwal1 approved these changes Jun 3, 2024

View reviewed changes

vivekkhandelwal1 merged commit 8995c90 into llvm:main Jun 3, 2024
3 checks passed

zjgarvey mentioned this pull request Jun 3, 2024

Quantized Grouped Convolution Support nod-ai/SHARK-Turbine#678

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TorchToLinalg] add support for quantized group conv #3341

[TorchToLinalg] add support for quantized group conv #3341

zjgarvey commented May 14, 2024 •

edited

zjgarvey commented May 17, 2024

vivekkhandelwal1 left a comment

[TorchToLinalg] add support for quantized group conv #3341

[TorchToLinalg] add support for quantized group conv #3341

Conversation

zjgarvey commented May 14, 2024 • edited

zjgarvey commented May 17, 2024

vivekkhandelwal1 left a comment

Choose a reason for hiding this comment

zjgarvey commented May 14, 2024 •

edited