Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Failing Test]: beam_PreCommit_Java_PVR_Flink_Batch stuck #26301

Closed
2 of 15 tasks
Abacn opened this issue Apr 17, 2023 · 1 comment
Closed
2 of 15 tasks

[Failing Test]: beam_PreCommit_Java_PVR_Flink_Batch stuck #26301

Abacn opened this issue Apr 17, 2023 · 1 comment

Comments

@Abacn
Copy link
Contributor

Abacn commented Apr 17, 2023

What happened?

There is possibility that this precommit stuck and timing out after 4 hour. It is not obvious which test is timing out. Having downloaded the raw log in the jenkins node, it is found that the stucking test is CombineTest.basicTests#testCombinePerKeyLambda at least for https://ci-beam.apache.org/job/beam_PreCommit_Java_PVR_Flink_Batch_Cron/9111/

Issue Failure

Failure: Test is flaky

Issue Priority

Priority: 2 (backlog / disabled test but we think the product is healthy)

Issue Components

  • Component: Python SDK
  • Component: Java SDK
  • Component: Go SDK
  • Component: Typescript SDK
  • Component: IO connector
  • Component: Beam examples
  • Component: Beam playground
  • Component: Beam katas
  • Component: Website
  • Component: Spark Runner
  • Component: Flink Runner
  • Component: Samza Runner
  • Component: Twister2 Runner
  • Component: Hazelcast Jet Runner
  • Component: Google Cloud Dataflow Runner
@Abacn
Copy link
Contributor Author

Abacn commented Apr 17, 2023

The sampler continuously emitting warnings indicating job stuck:

[pool-258-thread-2] WARN org.apache.beam.fn.harness.control.ExecutionStateSampler - Operation ongoing in bundle 12 for at least 01h00m51s without outputting or completing:
at sun.misc.Unsafe.park(Native Method)
at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2039)
at org.apache.beam.sdk.fn.CancellableQueue.take(CancellableQueue.java:94)
at org.apache.beam.sdk.fn.data.BeamFnDataInboundObserver.awaitCompletion(BeamFnDataInboundObserver.java:122)
at org.apache.beam.fn.harness.control.ProcessBundleHandler.processBundle(ProcessBundleHandler.java:550)
at org.apache.beam.fn.harness.FnHarness$$Lambda$1264/2051412990.apply(Unknown Source)
at org.apache.beam.fn.harness.control.BeamFnControlClient.delegateOnInstructionRequestType(BeamFnControlClient.java:151)
at org.apache.beam.fn.harness.control.BeamFnControlClient$InboundObserver.lambda$onNext$0(BeamFnControlClient.java:116)
at org.apache.beam.fn.harness.control.BeamFnControlClient$InboundObserver$$Lambda$1391/138607428.run(Unknown Source)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at org.apache.beam.sdk.util.UnboundedScheduledExecutorService$ScheduledFutureTask.run(UnboundedScheduledExecutorService.java:163)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)

However this code path is generic and not help much for root cause

@Abacn Abacn closed this as not planned Won't fix, can't repro, duplicate, stale May 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant