Introduce Lingering Timeout #10569

rtribotte · 2024-04-04T07:41:25Z

What does this PR do?

This PR introduces the respondingTimeouts.lingeringTimeout option for entry points, with a default value of 2s.

The lingering timeout defines the maximum duration between each TCP read operation.
As a layer 4 timeout, it applies during HTTP handling but respects the respondingTimeouts.readTimeout option configuration.

The default value is purposely narrowed and can close the connection too early.
This could be breaking for "server-first" protocols.
We suggest to adapt this value accordingly to your situation.

This PR also deprecates the respondingTimeouts.<timeout> options:

<entryPoint>.transport.respondingTimeouts.readTimeout
<entryPoint>.transport.respondingTimeouts.writeTimeout
<entryPoint>.transport.respondingTimeouts.idleTimeout

They have been replaced by:

<entryPoint>.transport.respondingTimeouts.http.readTimeout
<entryPoint>.transport.respondingTimeouts.http.writeTimeout
<entryPoint>.transport.respondingTimeouts.http.idleTimeout

Motivation

This change avoids Traefik instances with the default configuration hanging while waiting for bytes to be read on the connection.
This has been identified to be an issue with:

HTTP/1.1 GET request specifying a Content-Length header with value >0.
Any silent TCP client connection (notably "server-first" protocols" and proxy protocol enabled on the entry point (some TCP services with proxyProtocol.trustedIPs broken in 3.0.0-rc1 #10448).
Any silent TCP client connection and no catch-all router to bypass the client-hello first bytes read.

Fixes #10448.
Superseeds #10531

More

Added/updated tests
Added/updated documentation

Additional Notes

Co-authored-by: Baptiste Mayelle baptiste.mayelle@traefik.io
Co-authored-by: Kevin Pollet pollet.kevin@gmail.com

mmatur

LGTM

lbenguigui

LGTM

juliens

LGTM

ngbrown · 2024-04-11T05:45:29Z

Can someone expand more on what is meant by "between each TCP read operation"? Is Traefik monitoring the TCP packet acks from the service? Or is it the delay between received packets from the client on the ingress port? Can more information be provided on how this is technically measured?

This change also seems breaks the AMQP TCP protocol, and AMQP over WebSockets. Setting the value to 0 cures the problems, but if I knew what the above phrase meant, I could try other values.

Edit: The documentation verbiage seems to be related to the go net package documentation for SetReadDeadline(). It's really not descriptive enough as to what is going on for a non-go programmer.

A better description would describe what is happening at a more physical level. Like: "this timeout is the maximum delay between received packets". Is this an accurate description? Does it affect both directions?

see traefik/traefik#10569

yashgorana · 2024-04-11T10:57:52Z

Can someone please help me understand why this new timeout is (1) introduced as a patch, (2) has such a small value of 2s knowing that it will break systems and more importantly (3) why it isn't an opt-in feature? This PR triggered a bunch of issues reported in #10596, #10595, #10589

rtribotte added status/2-needs-review kind/bug/fix a bug fix area/server area/tcp breaking labels Apr 4, 2024

rtribotte added this to the 2.11 milestone Apr 4, 2024

traefiker added the size/M label Apr 4, 2024

rtribotte mentioned this pull request Apr 4, 2024

Change default readTimeout #10531

Closed

2 tasks

rtribotte force-pushed the fix-lingering-timeout branch from 38ee87e to 64cb06e Compare April 4, 2024 07:42

mmatur approved these changes Apr 5, 2024

View reviewed changes

lbenguigui approved these changes Apr 5, 2024

View reviewed changes

fix: introduce lingering timeout

ff01a98

rtribotte force-pushed the fix-lingering-timeout branch from 64cb06e to 30206ae Compare April 8, 2024 09:28

review: split TCP and HTTP timeout options.

6e829da

rtribotte force-pushed the fix-lingering-timeout branch from 30206ae to 6e829da Compare April 8, 2024 09:34

juliens approved these changes Apr 8, 2024

View reviewed changes

juliens added status/3-needs-merge and removed status/2-needs-review labels Apr 8, 2024

traefiker merged commit cef8422 into traefik:v2.11 Apr 8, 2024
22 checks passed

traefiker removed the status/3-needs-merge label Apr 8, 2024

ldez mentioned this pull request Apr 10, 2024

v2.11.1 breaks websockets for asp.net core #10589

Closed

2 tasks

wollomatic mentioned this pull request Apr 11, 2024

v2.11.1 breaks file upload in some cases - related to PR #10569 (with solution described) #10595

Closed

2 tasks

wollomatic added a commit to wollomatic/traefik-hardened that referenced this pull request Apr 11, 2024

mitigate breaking change since traefik v2.11.1

e7b7e6c

see traefik/traefik#10569

wollomatic added a commit to wollomatic/simple-traefik that referenced this pull request Apr 11, 2024

mitigate breaking change since Traefik v2.11.1

3d13835

see traefik/traefik#10569

agilezebra mentioned this pull request Apr 11, 2024

v2.11.1 lingeringTimeout breaks normal behaviour #10596

Closed

2 tasks

emilevauge mentioned this pull request Apr 11, 2024

v2.11.1 lingeringTimeout can break some connections #10598

Closed

2 tasks

sholl mentioned this pull request May 3, 2024

Regular warnings in log since using matrix integration home-assistant/core#111921

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce Lingering Timeout #10569

Introduce Lingering Timeout #10569

rtribotte commented Apr 4, 2024 •

edited

mmatur left a comment

lbenguigui left a comment

juliens left a comment

ngbrown commented Apr 11, 2024 •

edited

yashgorana commented Apr 11, 2024

Introduce Lingering Timeout #10569

Introduce Lingering Timeout #10569

Conversation

rtribotte commented Apr 4, 2024 • edited

What does this PR do?

Motivation

More

Additional Notes

mmatur left a comment

Choose a reason for hiding this comment

lbenguigui left a comment

Choose a reason for hiding this comment

juliens left a comment

Choose a reason for hiding this comment

ngbrown commented Apr 11, 2024 • edited

yashgorana commented Apr 11, 2024

rtribotte commented Apr 4, 2024 •

edited

ngbrown commented Apr 11, 2024 •

edited