io: add `Ready::ERROR` and report error readiness #5781

folkertdev · 2023-06-08T21:13:14Z

Motivation

The motivation is to await messages arriving in a UdpSocket's error queue. The error queue is a mechanism to asynchronously process errors on a socket. By separating them from the normal queue, error information does not interfere with standard IO, but at the same time better error messages can be given than just a single integer as the result of send/receive.

This potentially has many use cases. My particular one is awaiting when a send timestamp is available.

Solution

The solution is to provide Ready::ERROR. This is already supported in mio, but so far was not exposed in tokio. In combination with AsyncFd::ready with some arbitrary interest, this can be used to await error readiness.

One potentially controversial change is that error readiness is reported regardless of interest. That is in practice how error readiness works on operating systems: errors are always reported, no matter what the configured interest is.

            // error readiness is reported regardless of interest
            ready |= Ready::ERROR;

Testing

Send timestamping, my primary use case, only works on linux. I've provided a test for it, but it is only compiled/executed on linux.

I've also included a test that sends a UDP packet to a local socket where nobody is listening. Normally this send operation would just succeed, but by setting the IP_RECVERR socket option, the sending socket is notified via the error queue that there is nobody listening on the other side. Turns out this test also only works on linux.

I'm not sure what to do about other operating systems. because we're just re-exposing mio functionality here maybe it's fine to just have the linux tests on CI?

folkertdev · 2023-07-03T17:03:09Z

Is this blocked on anything? anything I can do to help it move along?

Darksonn · 2023-07-03T17:28:31Z

Over the past month, I've been busy with other things, but I should be able to look at it soon now that we have entered July.

Darksonn · 2023-07-05T14:13:20Z

tokio/src/io/ready.rs

+        // error readiness is reported regardless of interest
+        ready |= Ready::ERROR;
+


Hmm. Does this mean that the fn ready methods on socket types always return Error unconditionally even if there are no Ready events?

We should probably have a test that Ready is not set if there have been no error events.

the phrasing is misleading I think, I'll improve it. What is happening here is that with this change, an event with the error bit set (i.e. with the error readiness) matches any interest.

so no matter what interest a user configures, an event with error readiness is not discarded. That means the corresponding waker is woken up, and then can deal with the event as they see fit.

We already have some tests that perform an equality check on Ready values, e.g.

#[tokio::test] async fn clear_ready_matching_clears_ready() { use tokio::io::{Interest, Ready}; let (a, mut b) = socketpair(); let afd_a = AsyncFd::new(a).unwrap(); b.write_all(b"0").unwrap(); let mut guard = afd_a .ready(Interest::READABLE | Interest::WRITABLE) .await .unwrap(); // the readiness is just readable and writable, not error assert_eq!(guard.ready(), Ready::READABLE | Ready::WRITABLE); guard.clear_ready_matching(Ready::READABLE); assert_eq!(guard.ready(), Ready::WRITABLE); guard.clear_ready_matching(Ready::WRITABLE); assert_eq!(guard.ready(), Ready::EMPTY); }

This test passes, so the error readiness is not set in this case.

Darksonn · 2023-07-08T15:45:41Z

The implementation looks fine, but my main concern is how the error readiness is always exposed. Are we sure we want to do that? What about portability? Linux with epoll is one thing, but what about kqueue? What about Windows?

carllerche · 2023-07-08T16:08:03Z

I am out of town and reviewing on a mobile device so I may be reading it wrong.

I think the biggest question is whether we should always unconditionally report error readiness without the user asking for it on all platforms. I know that this is what epoll does (error interest is always implicit) but I don’t know if we can maintain that behavior across all platforms (windows, future wasi based platforms, etc…)

The easiest option is to conditionally define Interest::Error on platforms where we can support it and require the user to specify it when awaiting for readiness.

folkertdev · 2023-07-08T16:43:56Z

This is a fair concern. I don't think always reporting the error readiness has real downsides, but may be missing something.

I like the idea of Interest::ERROR as a tokio user, I'd really prefer it, but ran into issues with the implementation. The core issue is that mio does not have a Interest::ERROR because it does not make sense there (this comment explains why tokio-rs/mio#1672 (comment)).

Tokio wraps the mio::Interest, but must be able to convert from the tokio to the mio version to tell mio what to do. If a user would specify just a tokio Interest::ERROR, what interest is mio supposed to register with the OS?

I've written more about it in the related issue #5716 (comment)

Darksonn · 2023-07-08T16:47:48Z

Tokio already supports cases where you are waiting for a different set than registered with mio. For example, if two threads are waiting for read and write readiness respectively, then we register with both on mio, but the read waiter is only notified if mio gives us a read readiness.

Tokio wraps the mio::Interest, but must be able to convert from the tokio to the mio version to tell mio what to do. If a user would specify just a tokio Interest::ERROR, what interest is mio supposed to register with the OS?

If you get the error readiness no matter what, then we can just remove it from the readiness set given to mio.

folkertdev · 2023-07-08T16:53:22Z

ah, interesting. So if I use Interest::ERROR in my user code, behind the scenes this could turn into an arbitrary interest for mio (say Interest::READABLE) but then tokio will filter what it reports (so clearing the readable flag from the event) so that the user-visible event only shows the error readiness.

am I understanding the idea correctly?

Darksonn · 2023-07-08T17:57:02Z

You bring up a good point. How does clearing the error readiness work? How does epoll handle it?

folkertdev · 2023-07-10T09:40:51Z

I've added Interest::ERROR. I've picked this representation

pub struct Interest {
    mio: Option<mio::Interest>,
    has_error_interest: bool,
}

In practice this will take 2 bytes in memory, up from 1 before. This representation makes some existing functions a bit longer, especially because some are const fn and not all helper functions on Option are const stable (or were not const stable in tokio's msrv).

Then, mio has this note on the is_error function on event:

The table below shows what flags are checked on what OS.

OS selector Flag(s) checked

epoll EPOLLERR

kqueue EV_ERROR and EV_EOF with fflags set to 0.

Based on https://docs.rs/mio/latest/mio/struct.Poll.html#implementation-notes only windows would never use the error field (but the function is still available on that platform).

Always providing the error interest and related functions can be convenient to cut down on conditional compilation. On the other hand it can be confusing. I went with just doing what mio does for now, but don't have a strong opinion here.

I've verified that this code works as I'd expect with the current code of this PR

let mut guard1 = async_fd_socket.ready(Interest::ERROR).await.unwrap();
guard1.clear_ready_matching(Ready::ERROR);

// blocks!
let mut guard2 = async_fd_socket.ready(Interest::ERROR).await.unwrap();

but this is all tokio of course. So I ran another test where I send some bytes to the async_fd_socket

let mut guard1 = async_fd_socket.ready(Interest::ERROR).await.unwrap();
guard1.clear_ready_matching(Ready::ERROR);

// .. send bytes to `async_fd_socket`

// blocks!
let mut guard2 = async_fd_socket.ready(Interest::ERROR).await.unwrap();

and that still blocks. So the error readiness does not persist between different returns of epoll_wait.

Is that what you meant with your comment about clearing error readiness @Darksonn?

Darksonn · 2023-07-10T10:04:08Z

That answers how error readiness is cleared for AsyncFd. How about TcpStream? It doesn't have the same clear_ready_matching api.

folkertdev · 2023-07-10T10:24:06Z

At the moment TcpStream is not a problem, because it does not have a public api for registering with a custom interest. #5796 wants to add custom interest functionality, and would run into the same issue for Interest::PRIORITY. So I don't think it's a problem for this PR specifically, but in general it will need a solution.

That solution could be to not allow a custom interest, or some equivalent of clear_ready_matching

Darksonn · 2023-07-10T10:39:40Z

Does that mean that even if an error event occurs, the current implementation will not surface it via the TcpStream api?

folkertdev · 2023-07-10T10:53:42Z

As far as I can see, yes. Because it is not possible to register a tokio TcpStream with Interest::ERROR, the Ready::ERROR readiness would never be visible to the user. it is filtered out by

        if interest.is_error() {
            ready |= Ready::ERROR;
        }

in Ready::from_interest

carllerche

Thanks, it looks mostly good to me. I left a comment on the Interest struct.

I think, for me, the primary question is, from a portability POV, what are the guarantees Tokio provides when one sets Error interest? The fact that error is overloaded makes me uncomfortable with providing this as a portable flag.

e.g. it seems to me that error interest should mean that if one receives error readiness, one can just drop the socket w/o performing further operations on it because the socket is in an error state. This doesn't hold for the case you want to use it for.

One option is, we include it for all platforms and document it as "this passes error interest to the underlying OS selector. Behavior is platform specific, read your platform's documentation"

carllerche · 2023-07-10T18:42:24Z

tokio/src/io/interest.rs

@@ -11,43 +11,59 @@ use std::ops;
 /// I/O resource readiness states.
 #[cfg_attr(docsrs, doc(cfg(feature = "net")))]
 #[derive(Clone, Copy, Eq, PartialEq)]
-pub struct Interest(mio::Interest);
+pub struct Interest {


Could you change this to a usize and define the bits yourself, similar to how Ready does it: https://github.com/tokio-rs/tokio/blob/master/tokio/src/io/ready.rs.

I would prefer not growing the struct size and avoid branches in the is_* checks.

is there a particular reason to choose for usize? Maybe because the interest value doesn't live long and is likely in a register, usize is no worse than u8 (which would currently fit, with 2 bits to spare)?

usize doesn't matter really, it is just what mio::Interest uses.

mio uses a NonZeroU8 https://github.com/tokio-rs/mio/blob/master/src/interest.rs#L17

#[derive(Copy, PartialEq, Eq, Clone, PartialOrd, Ord)] pub struct Interest(NonZeroU8);

folkertdev · 2023-07-10T22:52:02Z

allright, I made the changes and CI is happy again.

For AsyncFd a note about behavior being platform-specific (which I've added) seems fine to me. You're likely looking at those docs because you are doing some lowlevel platform-specific stuff.

If eventually the tokio UdpSocket and TcpStream also get custom interest registeration support, then the situation might get a bit more complicated.

folkertdev · 2023-07-14T08:13:15Z

e.g. it seems to me that error interest should mean that if one receives error readiness, one can just drop the socket w/o performing further operations on it because the socket is in an error state. This doesn't hold for the case you want to use it for.

That's right.

On linux/epoll the error mechanism has been hijacked to provide a sidechannel. The socket can continue to function while error information can be received and processed concurrently. Then the sidechannel is also used for non-error information like the timestamps.
I have very little experience with kqueue, but it looks like EV_ERROR does mean a fatal error. (no memory, invalid file descriptor, etc) https://man.freebsd.org/cgi/man.cgi?query=kqueue&sektion=2#end
On windows, this interest has no effect whatsoever.

Given that, I'd be totally ok with a #cfg(any(target_os = "linux", target_os = "android")] on the error interest in tokio. Linux/epoll is the only tested platform right now, and it is where awaiting error readiness makes most sense. It also means that the documentation can be more targeted.

Lifting the conditional compilation restrictions can always happen at a later time, e.g. if a real usecase for error readiness with kqueue comes up.

does that sound like a good way forward @carllerche ?

folkertdev · 2023-07-24T20:45:29Z

so, how do we make progress here?

tokio/src/io/interest.rs

Darksonn · 2023-08-03T07:33:02Z

tokio/tests/io_async_fd.rs

+        // Set the destination address. This address is invalid in this context. the OS will notice
+        // that nobody is listening on port 1234. Normally this is ignored (UDP is "fire and forget"),
+        // but because IP_RECVERR is enabled, the error will actually be reported to the sending socket
+        let mut dest_addr =
+            unsafe { std::mem::MaybeUninit::<libc::sockaddr_in>::zeroed().assume_init() };
+        dest_addr.sin_family = libc::AF_INET as _;
+        dest_addr.sin_port = 1234u16.to_be(); // Destination port


So this test will fail if some other test decides to use port 1234 for something?

Perhaps we should use a port number less than 1024 to avoid the case where tests using port 0 to pick a port will pick the port used by this test?

yes, testing this is kind of tricky. I've changed the port to 512 and added an explanation.

Darksonn · 2023-08-03T20:42:29Z

tokio/tests/io_async_fd.rs

+    let fd = AsyncFd::new(socket).unwrap();
+
+    let buf = b"hello there";
+    fd.get_ref().send(buf).unwrap();
+
+    // the send timestamp should now be in the error queue
+    let guard = fd.ready(Interest::ERROR).await.unwrap();
+    assert_eq!(guard.ready(), Ready::ERROR);
+}


Could we assert that the error interest is not set before we send the buffer? Currently all of our tests would pass if ERROR is always set.

yes, I added one of those select! with a timer and panic branch that is used in a bunch of the other tests.

carllerche

Looks good to me. Thanks for sticking w/ it!

folkertdev force-pushed the ready-error branch 2 times, most recently from 67f9465 to 5cd4706 Compare June 9, 2023 08:32

Darksonn added A-tokio Area: The main tokio crate M-io Module: tokio/io labels Jun 9, 2023

folkertdev mentioned this pull request Jun 10, 2023

io: make EPOLLERR awake AsyncFd::readable #4444

Closed

folkertdev changed the title ~~Report error readiness~~ io: add Ready::ERROR and report error readiness Jun 22, 2023

folkertdev force-pushed the ready-error branch 2 times, most recently from bcbb35e to 2f6b71f Compare July 3, 2023 16:35

Darksonn reviewed Jul 5, 2023

View reviewed changes

folkertdev requested a review from Darksonn July 8, 2023 15:31

folkertdev force-pushed the ready-error branch from eaefb1a to 952f386 Compare July 10, 2023 09:39

carllerche reviewed Jul 10, 2023

View reviewed changes

folkertdev force-pushed the ready-error branch 2 times, most recently from b1f4652 to e4e4a03 Compare July 10, 2023 21:16

folkertdev requested a review from carllerche July 18, 2023 08:31

Darksonn reviewed Jul 25, 2023

View reviewed changes

tokio/src/io/interest.rs Outdated Show resolved Hide resolved

tokio/src/io/interest.rs Outdated Show resolved Hide resolved

folkertdev force-pushed the ready-error branch from e4e4a03 to fec2bce Compare July 25, 2023 09:22

Darksonn mentioned this pull request Aug 3, 2023

io: add Interest::remove method #5906

Merged

Darksonn reviewed Aug 3, 2023

View reviewed changes

folkertdev added 8 commits August 3, 2023 18:49

add Ready::ERROR

91730b7

add tests for error readiness

10a25f4

clarify comment

ac71aac

add Interest::ERROR

4ff6595

Interest as usize

846a6e2

use option to accumulate mio Interest

5f3b7c2

use unwrap_or

a43c05a

ensure test port cannot be selected as ephemeral port

7d63674

folkertdev force-pushed the ready-error branch from 0a4cfc4 to 7d63674 Compare August 3, 2023 17:02

Darksonn reviewed Aug 3, 2023

View reviewed changes

assert that error readiness is not set before send

eed8c95

carllerche approved these changes Aug 16, 2023

View reviewed changes

carllerche merged commit 10e141d into tokio-rs:master Aug 16, 2023
72 checks passed

D3PSI mentioned this pull request Aug 17, 2023

chore(deps): bump tokio from 1.29.1 to 1.32.0 in /src-tauri woollygoods/huehuehue#52

Merged

Darksonn mentioned this pull request Nov 25, 2023

EPOLLERR does not wake up AsyncFd::readable() #4349

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

io: add `Ready::ERROR` and report error readiness #5781

io: add `Ready::ERROR` and report error readiness #5781

folkertdev commented Jun 8, 2023 •

edited

folkertdev commented Jul 3, 2023

Darksonn commented Jul 3, 2023

Darksonn Jul 5, 2023

folkertdev Jul 5, 2023 •

edited

Darksonn commented Jul 8, 2023

carllerche commented Jul 8, 2023

folkertdev commented Jul 8, 2023

Darksonn commented Jul 8, 2023

folkertdev commented Jul 8, 2023

Darksonn commented Jul 8, 2023

folkertdev commented Jul 10, 2023

Darksonn commented Jul 10, 2023

folkertdev commented Jul 10, 2023

Darksonn commented Jul 10, 2023

folkertdev commented Jul 10, 2023

carllerche left a comment

carllerche Jul 10, 2023

folkertdev Jul 10, 2023

carllerche Jul 12, 2023

folkertdev Jul 12, 2023

folkertdev commented Jul 10, 2023

folkertdev commented Jul 14, 2023

folkertdev commented Jul 24, 2023

Darksonn Aug 3, 2023

folkertdev Aug 3, 2023

Darksonn Aug 3, 2023

Darksonn Aug 3, 2023

folkertdev Aug 3, 2023

carllerche left a comment

		// error readiness is reported regardless of interest
		ready \|= Ready::ERROR;

io: add Ready::ERROR and report error readiness #5781

io: add Ready::ERROR and report error readiness #5781

Conversation

folkertdev commented Jun 8, 2023 • edited

Motivation

Solution

Testing

folkertdev commented Jul 3, 2023

Darksonn commented Jul 3, 2023

Choose a reason for hiding this comment

folkertdev Jul 5, 2023 • edited

Choose a reason for hiding this comment

Darksonn commented Jul 8, 2023

carllerche commented Jul 8, 2023

folkertdev commented Jul 8, 2023

Darksonn commented Jul 8, 2023

folkertdev commented Jul 8, 2023

Darksonn commented Jul 8, 2023

folkertdev commented Jul 10, 2023

Darksonn commented Jul 10, 2023

folkertdev commented Jul 10, 2023

Darksonn commented Jul 10, 2023

folkertdev commented Jul 10, 2023

carllerche left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

folkertdev commented Jul 10, 2023

folkertdev commented Jul 14, 2023

folkertdev commented Jul 24, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

carllerche left a comment

Choose a reason for hiding this comment

io: add `Ready::ERROR` and report error readiness #5781

io: add `Ready::ERROR` and report error readiness #5781

folkertdev commented Jun 8, 2023 •

edited

folkertdev Jul 5, 2023 •

edited