Limit the number of Scheduler#disposeGracefully threads #3259

simonbasle · 2022-10-28T16:42:53Z

This change introduces a DisposeAwaiterRunnable with a small pool of
threads dedicated to polling the termination status after a graceful
Scheduler shutdown.

Previously, one Thread would be created for each Scheduler that is
disposed gracefully. While we don't expect this to be an issue in most
production applications, this can lead to hitting native thread limits
faster. Notably, stress tests around graceful disposal create a lot of
schedulers for that purpose.

This change also ensures that the evictor executorServices of both the
BoundedElasticScheduler and ElasticScheduler are limited to at most 1
thread.

Finally, it attempts to improve the SchedulersStressTest to avoid the
OOMs as much as possible: block on disposeGracefully() calls, increase
the heap of forked JVMs for jcstress, and ultimately stop covering the
BoundedElasticScheduler in the stress test.

Fixes #3258.

This change modifies the `DisposeAwaiter` interface in order to be able to poll for graceful shutdown termination status rather than await/block for it. It introduces a DisposeAwaiterRunnable with a small pool of threads dedicated to polling the termination status after a graceful Scheduler shutdown. Previously, one Thread would be created for each Scheduler that is disposed gracefully. While we don't expect this to be an issue in most production applications, this can lead to hitting native thread limits faster. Notably, stress tests around graceful disposal create a lot of schedulers for that purpose.

simonbasle · 2022-10-28T16:43:20Z

currently seeing another type of OOM issue:

  Messages:
    java.lang.OutOfMemoryError: GC overhead limit exceeded
        at java.util.concurrent.ThreadPoolExecutor.<init>(ThreadPoolExecutor.java:463)
        at java.util.concurrent.ThreadPoolExecutor.<init>(ThreadPoolExecutor.java:1237)
        at java.util.concurrent.ScheduledThreadPoolExecutor.<init>(ScheduledThreadPoolExecutor.java:447)
        at reactor.core.scheduler.ParallelScheduler.get(ParallelScheduler.java:80)
        at reactor.core.scheduler.ParallelScheduler.init(ParallelScheduler.java:109)
        at reactor.core.scheduler.SchedulersStressTest$ParallelSchedulerDisposeGracefullyAndDisposeStressTest.<init>(SchedulersStressTest.java:318)
        at reactor.core.scheduler.SchedulersStressTest_ParallelSchedulerDisposeGracefullyAndDisposeStressTest_jcstress.internalRun(SchedulersStressTest_ParallelSchedulerDisposeGracefullyAndDisposeStressTest_jcstress.java:118)
        at org.openjdk.jcstress.infra.runners.Runner.run(Runner.java:72)

simonbasle · 2022-11-04T14:04:53Z

we couldn't get to the bottom of the OOMs on M1 machines, but it seems to still significantly improve the situation, and most notably on CI.

the last commit removes stress tests for BoundedElasticScheduler, which by nature puts more pressure in terms of threads. these can be reintroduced in a branch to experiment with improving stress tests on a Mac OSX M1 machine.

reactor-core/src/jcstress/java/reactor/core/scheduler/BasicSchedulersStressTest.java

chemicL · 2022-11-04T15:56:58Z

reactor-core/src/jcstress/java/reactor/core/scheduler/BasicSchedulersStressTest.java

@@ -174,6 +144,11 @@ public void arbiter(IIZ_Result r) {
 			// by r.r1 and r.r2, which should be equal.
 			boolean consistentState = r.r1 == r.r2;
 			r.r3 = consistentState && scheduler.isDisposed();
+			if (consistentState) {


reactor-core/src/main/java/reactor/core/scheduler/BoundedElasticScheduler.java

reactorbot · 2022-11-04T16:50:26Z

@simonbasle this PR seems to have been merged on a maintenance branch, please ensure the change is merge-forwarded to intermediate maintenance branches and up to main 🙇

simonbasle · 2022-11-04T16:52:34Z

merged into 3.5.0 with wrong commit message by commit 2a83001

This change introduces a `DisposeAwaiterRunnable` with a small pool of threads dedicated to polling the termination status after a graceful Scheduler shutdown. Previously, one Thread would be created for each Scheduler that is disposed gracefully. While we don't expect this to be an issue in most production applications, this can lead to hitting native thread limits faster. Notably, stress tests around graceful disposal create a lot of schedulers for that purpose. This change also ensures that the evictor executorServices of both the BoundedElasticScheduler and ElasticScheduler are limited to at most 1 thread. Finally, it attempts to improve the SchedulersStressTest to avoid the OOMs as much as possible: block on disposeGracefully() calls, increase the heap of forked JVMs for jcstress, and ultimately stop covering the BoundedElasticScheduler in the stress test. Fixes #3258.

simonbasle added 7 commits November 2, 2022 12:24

remove println statement

4e022d5

simplify: state->cancelled, remove tryAwait option

923bad2

set core size of await pool to 0

648c04a

Block on disposeGracefully in jcstress instead of subscribe

ead69be

increase heap size for jcstress forks

72673da

use a single-thread ScheduledExecutorService for evictors

8e1b840

Limit scheduler stress tests to single + parallel

e9aa046

simonbasle added this to the 3.4.25 milestone Nov 4, 2022

simonbasle added the type/enhancement A general enhancement label Nov 4, 2022

simonbasle self-assigned this Nov 4, 2022

simonbasle linked an issue Nov 4, 2022 that may be closed by this pull request

Scheduler graceful dispose should share awaitTermination monitoring threads #3258

Closed

simonbasle marked this pull request as ready for review November 4, 2022 14:09

simonbasle requested a review from a team as a code owner November 4, 2022 14:09

chemicL reviewed Nov 4, 2022

View reviewed changes

reactor-core/src/jcstress/java/reactor/core/scheduler/BasicSchedulersStressTest.java Outdated Show resolved Hide resolved

chemicL reviewed Nov 4, 2022

View reviewed changes

reactor-core/src/main/java/reactor/core/scheduler/BoundedElasticScheduler.java Outdated Show resolved Hide resolved

simonbasle added 2 commits November 4, 2022 17:26

mix subscribe and block on disposeGracefully calls

c18f82a

fix formatting

044fdb0

chemicL approved these changes Nov 4, 2022

View reviewed changes

simonbasle merged commit 9fe3241 into 3.4.x Nov 4, 2022

simonbasle deleted the 3258-DisposeAwaiterLimitedThreads branch November 4, 2022 16:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit the number of Scheduler#disposeGracefully threads #3259

Limit the number of Scheduler#disposeGracefully threads #3259

simonbasle commented Oct 28, 2022 •

edited

simonbasle commented Oct 28, 2022

simonbasle commented Nov 4, 2022

chemicL Nov 4, 2022

reactorbot commented Nov 4, 2022

simonbasle commented Nov 4, 2022

Limit the number of Scheduler#disposeGracefully threads #3259

Limit the number of Scheduler#disposeGracefully threads #3259

Conversation

simonbasle commented Oct 28, 2022 • edited

simonbasle commented Oct 28, 2022

simonbasle commented Nov 4, 2022

chemicL Nov 4, 2022

Choose a reason for hiding this comment

reactorbot commented Nov 4, 2022

simonbasle commented Nov 4, 2022

simonbasle commented Oct 28, 2022 •

edited