SharedResourceHolder should roughly handle exceptions during close #6002

ejona86 · 2019-07-23T23:54:53Z

@xCASx reported a bug where gRPC would get "hung" after a point, which included the following stack trace:

Jul 22, 2019 7:37:47 PM io.grpc.internal.LogExceptionRunnable run
SEVERE: Exception while executing runnable io.grpc.internal.SharedResourceHolder$2@73cfe40e
io.grpc.netty.shaded.io.netty.channel.ChannelException: eventfd_write() failed: Bad file descriptor
at io.grpc.netty.shaded.io.netty.channel.epoll.Native.eventFdWrite(Native Method)
at io.grpc.netty.shaded.io.netty.channel.epoll.EpollEventLoop.wakeup(EpollEventLoop.java:167)
at io.grpc.netty.shaded.io.netty.util.concurrent.SingleThreadEventExecutor.shutdownGracefully(SingleThreadEventExecutor.java:603)
at io.grpc.netty.shaded.io.netty.util.concurrent.MultithreadEventExecutorGroup.shutdownGracefully(MultithreadEventExecutorGroup.java:163)
at io.grpc.netty.shaded.io.grpc.netty.Utils$DefaultEventLoopGroupResource.close(Utils.java:346)
at io.grpc.netty.shaded.io.grpc.netty.Utils$DefaultEventLoopGroupResource.close(Utils.java:318)
at io.grpc.internal.SharedResourceHolder$2.run(SharedResourceHolder.java:145)
at io.grpc.internal.LogExceptionRunnable.run(LogExceptionRunnable.java:43)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)

If we look in SharedResourceHolder, if resource.close(instance) fails throws instances.remove(resource) is not run. While we shouldn't encourage exceptions to be thrown during close, we would like it to be able to recover eventually. In this case, any future resource fetches will get the partially-closed resource, which will immediately fail.

grpc-java/core/src/main/java/io/grpc/internal/SharedResourceHolder.java

Lines 145 to 146 in d7b9438

    
           resource.close(instance); 
        
           instances.remove(resource);

@xCASx, a workaround would be to keep the client objects alive for as long as possible. We generally encourage that for performance, but as long as one client object is alive we won't attempt to shut down this executor.

The text was updated successfully, but these errors were encountered:

arand-mms · 2019-08-08T07:19:37Z

We run into the same problem. A fast solution would be fine!

…Fixes grpc#6002.

Fixes grpc#6002.

Fixes #6002. (#6044)

Fixes grpc#6002. (grpc#6044)

Fixes #6002. (#6044) (#6046)

Fixes #6002. (#6044) (#6047)

Fixes #6002. (#6044) (#6048)

ejona86 added the bug label Jul 23, 2019

ejona86 added this to the 1.23 milestone Jul 23, 2019

voidzcy added a commit to voidzcy/grpc-java that referenced this issue Aug 8, 2019

core: handle removing patially-closed resources for throwing on close. …

88360fd

…Fixes grpc#6002.

voidzcy added a commit to voidzcy/grpc-java that referenced this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close.

91eb998

Fixes grpc#6002.

voidzcy mentioned this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close. Fixes #6002 #6044

Merged

voidzcy closed this as completed in #6044 Aug 8, 2019

voidzcy added a commit that referenced this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close.

539f040

Fixes #6002. (#6044)

xCASx mentioned this issue Aug 8, 2019

Dataproc: RejectedExecutionException: event executor terminated googleapis/google-cloud-java#5810

Closed

voidzcy added a commit to voidzcy/grpc-java that referenced this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close.

660f635

Fixes grpc#6002. (grpc#6044)

voidzcy mentioned this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close. Fixes #6002 (1.23.x backport) #6046

Merged

voidzcy added a commit to voidzcy/grpc-java that referenced this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close.

80fca2e

Fixes grpc#6002. (grpc#6044)

voidzcy mentioned this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close. Fixes #6002 (1.22.x backport) #6047

Merged

voidzcy added a commit to voidzcy/grpc-java that referenced this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close.

c2e601a

Fixes grpc#6002. (grpc#6044)

voidzcy mentioned this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close. Fixes #6002 (1.21.x backport) #6048

Merged

voidzcy added a commit that referenced this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close.

8779c6a

Fixes #6002. (#6044) (#6046)

voidzcy added a commit that referenced this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close.

54084b9

Fixes #6002. (#6044) (#6047)

voidzcy added a commit that referenced this issue Aug 8, 2019

core: handle removing partially-closed resources for throwing on close.

0736411

Fixes #6002. (#6044) (#6048)

lock bot locked as resolved and limited conversation to collaborators Nov 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SharedResourceHolder should roughly handle exceptions during close #6002

SharedResourceHolder should roughly handle exceptions during close #6002

ejona86 commented Jul 23, 2019

arand-mms commented Aug 8, 2019

SharedResourceHolder should roughly handle exceptions during close #6002

SharedResourceHolder should roughly handle exceptions during close #6002

Comments

ejona86 commented Jul 23, 2019

arand-mms commented Aug 8, 2019