Add inline tags to UninitSlice methods #443

danburkert · 2020-11-09T08:31:48Z

This appears to be the primary cause of significant performance
regressions in the prost test suite in the 0.5 to 0.6 transition. See
danburkert/prost#381.

This appears to be the primary cause of significant performance regressions in the `prost` test suite in the 0.5 to 0.6 transition. See danburkert/prost#381.

Darksonn

Thanks!

src/buf/uninit_slice.rs

Co-authored-by: Alice Ryhl <alice@ryhl.io>

seanmonstar · 2020-11-09T17:23:53Z

src/buf/uninit_slice.rs

    pub fn write_byte(&mut self, index: usize, byte: u8) {
        assert!(index < self.len());

-        unsafe { self[index..].as_mut_ptr().write(byte) }
+        unsafe { self.as_mut_ptr().add(index).write(byte) }


I'd expect the bounds check to have been elided, since it's asserted just before... It's not?

I didn't check the assembly, but empirically I found in #442 that a similar change had a measurable impact on microbenchmarks in the prost repo. It's very possible that the issue there was that the UninitSlice index call was not inlined, and so the similar double check wasn't eligible to be optimized away. Realistically I'm not going to have time to read the ASM here, but I'm happy to back out this part of the change if you'd prefer that.

Using s[i] vs s.as_ptr().add(i) gets me the same thing with the assert in the function:

example::index: push rax cmp rsi, rdx jbe .LBB7_1 mov al, byte ptr [rdi + rdx] pop rcx ret .LBB7_1: lea rdi, [rip + .L__unnamed_4] call std::panicking::begin_panic ud2 example::ptr: push rax cmp rsi, rdx jbe .LBB8_2 mov al, byte ptr [rdi + rdx] pop rcx ret .LBB8_2: lea rdi, [rip + .L__unnamed_5] call std::panicking::begin_panic ud2

OK, would you like me to switch it back?

It seems reasonable to me to keep to safe things if they make no performance difference.

sounds good, updated.

taiki-e · 2021-04-10T18:20:28Z

Thanks!

This appears to be the primary cause of significant performance regressions in the `prost` test suite in the 0.5 to 0.6 transition. See danburkert/prost#381.

Add inline tags to UninitSlice methods

645bd69

This appears to be the primary cause of significant performance regressions in the `prost` test suite in the 0.5 to 0.6 transition. See danburkert/prost#381.

Darksonn reviewed Nov 9, 2020

View reviewed changes

src/buf/uninit_slice.rs Outdated Show resolved Hide resolved

Update src/buf/uninit_slice.rs

70c5525

Co-authored-by: Alice Ryhl <alice@ryhl.io>

danburkert mentioned this pull request Nov 9, 2020

Update bytes to 0.6 tokio-rs/prost#387

Merged

seanmonstar reviewed Nov 9, 2020

View reviewed changes

ptr::add -> index

7c8af6d

taiki-e approved these changes Apr 10, 2021

View reviewed changes

taiki-e merged commit 3d5624a into tokio-rs:master Apr 10, 2021

Darksonn mentioned this pull request Aug 25, 2021

Prepare bytes v1.1.0 #509

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inline tags to UninitSlice methods #443

Add inline tags to UninitSlice methods #443

danburkert commented Nov 9, 2020

Darksonn left a comment

seanmonstar Nov 9, 2020

danburkert Nov 9, 2020

seanmonstar Nov 9, 2020

danburkert Nov 9, 2020

seanmonstar Nov 9, 2020

danburkert Nov 9, 2020

taiki-e commented Apr 10, 2021

Add inline tags to UninitSlice methods #443

Add inline tags to UninitSlice methods #443

Conversation

danburkert commented Nov 9, 2020

Darksonn left a comment

Choose a reason for hiding this comment

seanmonstar Nov 9, 2020

Choose a reason for hiding this comment

danburkert Nov 9, 2020

Choose a reason for hiding this comment

seanmonstar Nov 9, 2020

Choose a reason for hiding this comment

danburkert Nov 9, 2020

Choose a reason for hiding this comment

seanmonstar Nov 9, 2020

Choose a reason for hiding this comment

danburkert Nov 9, 2020

Choose a reason for hiding this comment

taiki-e commented Apr 10, 2021