Questions about non-atomic instructions like `i32.store` #197

yamt · 2023-03-22T06:43:03Z

when i32.store writes data to a shared memory, it's performed with an wr action. (https://webassembly.github.io/threads/core/exec/instructions.html#t-mathsf-xref-syntax-instructions-syntax-instr-memory-mathsf-store-n-xref-syntax-instructions-syntax-memarg-mathit-memarg)

those events are atomically performed according to https://webassembly.github.io/threads/core/exec/runtime.html#events.

thus, if a runtime implements atomic instructions like i32.atomic.rmw.cmpxchg via a lock, non-atomic instructions like i32.store should take the lock too, at least when operating on a shared memory.
is it the correct reading of the spec?

background:
some applications (eg. musl) implements a mutex with atomic cmpxchg for lock and ordinary store + barrier for unlock. as far as i know, it's fine for eg. x86. however, a naive porting to wasm might or might not cause problems.

The text was updated successfully, but these errors were encountered:

conrad-watt · 2023-03-22T09:37:38Z

Sorry, the use of the word "atomic" in that link is probably misleading and I'll make a note to change it.

The intention is that, if the atomic operations (in the sense of X.atomic.XXX) are implemented using locks, non-atomic operations can still be implemented without using locks, so long as aligned accesses don't tear (which is true on most "regular" architectures, but may not be true in some special environments).

yamt · 2023-03-22T09:51:05Z

Sorry, the use of the word "atomic" in that link is probably misleading and I'll make a note to change it.

The intention is that, if the atomic operations (in the sense of X.atomic.XXX) are implemented using locks, non-atomic operations can still be implemented without using locks, so long as aligned accesses don't tear (which is true on most "regular" architectures, but may not be true in some special environments).

ok.
then, in wasm, the way to implement a mutex mentioned in "background" in the description is not expected to work?

conrad-watt · 2023-03-22T09:59:24Z

Are you referring to this one here? At a rough glance it seems to me that it's correct. It would be incorrect if any of the accesses involved in its implementation were non-atomic.

Note that, if atomics are implemented using locks, any wait and notify instructions would also need to acquire those locks.

yamt · 2023-03-22T10:14:00Z

Are you referring to this one here? At a rough glance it seems to me that it's correct. It would be incorrect if any of the accesses involved in its implementation were non-atomic.

no. i meant the one in the description of this issue:

background:
some applications (eg. musl) implements a mutex with atomic cmpxchg for lock and ordinary store + barrier for unlock. as far as i know, it's fine for eg. x86. however, a naive porting to wasm might or might not cause problems.

Note that, if atomics are implemented using locks, any wait and notify instructions would also need to acquire those locks.

sure.

conrad-watt · 2023-03-22T10:22:08Z

some applications (eg. musl) implements a mutex with atomic cmpxchg for lock and ordinary store + barrier for unlock. as far as i know, it's fine for eg. x86. however, a naive porting to wasm might or might not cause problems.

Ah, I'm sorry for looking in the wrong place. Yes, this wouldn't work in Wasm. Wasm non-atomics and fences should be thought of as more C/C++-style than x86-style, and it wouldn't be safe in C/C++ to implement a mutex unlock with non-atomic store + fence.

Just an additional quick note, since I've been reading the conversations on the other thread (WebAssembly/wasi-libc#403)

but i noticed that it's actually more about the cmpxchg implementation than i32.store.
when cmpxchg is implemented with a lock as it is in wamr interpreter, i32.store can be executed in the middle of cmpxchg and effectively break the lock as you say.

It is our intention to allow this behaviour in Wasm. If a non-atomic store races with an atomic rmw/cas, it's not guaranteed that the rmw/cas will observably "act" like an atomic swap (C/C++ analogy - this is a data race with undefined behaviour). The correct thing to do is perform an atomic store instead of a non-atomic store (as I think you already observed)

yamt · 2023-03-22T10:36:23Z

some applications (eg. musl) implements a mutex with atomic cmpxchg for lock and ordinary store + barrier for unlock. as far as i know, it's fine for eg. x86. however, a naive porting to wasm might or might not cause problems.

Ah, I'm sorry for looking in the wrong place. Yes, this wouldn't work in Wasm. Wasm non-atomics and fences should be thought of as more C/C++-style than x86-style, and it wouldn't be safe in C/C++ to implement a mutex unlock with non-atomic store + fence.

ok.

(a bit off-topic: while i'm not familiar with C/C++ style atomics, i suspect it compiles to x86 atomics on x86 and thus has basically compatible semantics, doesn't it?)

Just an additional quick note, since I've been reading the conversations on the other thread (WebAssembly/wasi-libc#403)

but i noticed that it's actually more about the cmpxchg implementation than i32.store.
when cmpxchg is implemented with a lock as it is in wamr interpreter, i32.store can be executed in the middle of cmpxchg and effectively break the lock as you say.

It is our intention to allow this behaviour in Wasm. If a non-atomic store races with an atomic rmw/cas, it's not guaranteed that the rmw/cas will observably "act" like an atomic swap (C/C++ analogy - this is a data race with undefined behaviour). The correct thing to do is perform an atomic store instead of a non-atomic store (as I think you already observed)

ok. i got the intention.
the current wording in the spec is very misleading as it's somehow clearly stating that the only differences between atomic and non-atomic ops are memory ordering and alignment check:
https://webassembly.github.io/threads/core/exec/instructions.html#exec-atomic-store

conrad-watt · 2023-03-22T10:49:43Z

(a bit off-topic: while i'm not familiar with C/C++ style atomics, i suspect it compiles to x86 atomics on x86 and thus has basically compatible semantics, doesn't it?)

C/C++ atomics (and Wasm atomics!) need to abstract over a bunch of different possible implementations, including ones using locks, hence the "weaker" guarantees at the spec level. If one knows that one's C/C++ is definitely compiling to x86, one can try to make additional assumptions based on this, but this is hazardous because of possible compiler optimisations etc.

the current wording in the spec is very misleading as it's somehow clearly stating that the only differences between atomic and non-atomic ops are memory ordering and alignment check:

Iterating on the spec is definitely on my immediate radar (and will be a requirement for us to get this proposal over the line into the W3C standard). Note though that the differences we've been discussing above are captured purely by the difference in memory ordering, so this part of the spec wouldn't change too much - the implications of different memory orders should be explained in more detail in this section, once it's finished.

Just to try and point out other resources, the EMCAScript memory model is essentially identical for the language fragment we've been talking about. I don't know how helpful it is to read though.

yamt · 2023-03-22T11:00:24Z

(a bit off-topic: while i'm not familiar with C/C++ style atomics, i suspect it compiles to x86 atomics on x86 and thus has basically compatible semantics, doesn't it?)

C/C++ atomics (and Wasm atomics!) need to abstract over a bunch of different possible implementations, including ones using locks, hence the "weaker" guarantees at the spec level. If one knows that one's C/C++ is definitely compiling to x86, one can try to make additional assumptions based on this, but this is hazardous because of possible compiler optimisations etc.

ok.

the current wording in the spec is very misleading as it's somehow clearly stating that the only differences between atomic and non-atomic ops are memory ordering and alignment check:

Iterating on the spec is definitely on my immediate radar (and will be a requirement for us to get this proposal over the line into the W3C standard). Note though that the differences we've been discussing above are captured purely by the difference in memory ordering, so this part of the spec wouldn't change too much - the implications of different memory orders should be explained in more detail in this section, once it's finished.

i usually consider that the interlock behavior is a separate topic from memory ordering. but ok.

Just to try and point out other resources, the EMCAScript memory model is essentially identical for the language fragment we've been talking about. I don't know how helpful it is to read though.

thank you for the link.

yamt mentioned this issue Mar 22, 2023

Fix a_store operation in atomic.h WebAssembly/wasi-libc#403

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about non-atomic instructions like `i32.store` #197

Questions about non-atomic instructions like `i32.store` #197

yamt commented Mar 22, 2023

conrad-watt commented Mar 22, 2023

yamt commented Mar 22, 2023

conrad-watt commented Mar 22, 2023 •

edited

yamt commented Mar 22, 2023 •

edited

conrad-watt commented Mar 22, 2023 •

edited

yamt commented Mar 22, 2023

conrad-watt commented Mar 22, 2023

yamt commented Mar 22, 2023

Questions about non-atomic instructions like i32.store #197

Questions about non-atomic instructions like i32.store #197

Comments

yamt commented Mar 22, 2023

conrad-watt commented Mar 22, 2023

yamt commented Mar 22, 2023

conrad-watt commented Mar 22, 2023 • edited

yamt commented Mar 22, 2023 • edited

conrad-watt commented Mar 22, 2023 • edited

yamt commented Mar 22, 2023

conrad-watt commented Mar 22, 2023

yamt commented Mar 22, 2023

Questions about non-atomic instructions like `i32.store` #197

Questions about non-atomic instructions like `i32.store` #197

conrad-watt commented Mar 22, 2023 •

edited

yamt commented Mar 22, 2023 •

edited

conrad-watt commented Mar 22, 2023 •

edited