New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CI: K8sDatapathConfig Encapsulation Check iptables masquerading with random-fully #17353
Comments
From a quick initial glance this looks similar to #13773 |
I'm not convinced it's #13773 given we had fixed that flake and it didn't appear for a long time. However, if you want to double-check, the full analysis of #13773's root cause is at #13773 (comment). Also, according to our CI dashboard, this is the only occurrence of this flake in the past two weeks. IMO, based on that, most likely cause would be: 1) it's a random, one-off google.com failure or 2) the PR where it happened introduced it. |
PR #17368 hit this flake with 89.02% similarity: Click to show.Test Name
Failure Output
StacktraceClick to show.
Standard OutputClick to show.
Standard ErrorClick to show.
ZIP Links: Click to show.https://jenkins.cilium.io/job/Cilium-PR-K8s-1.20-kernel-4.19//1430/artifact/1726431d_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-1.20-kernel-4.19/1430/ If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue. |
PR #17596 hit this flake with 90.54% similarity: Click to show.Test Name
Failure Output
StacktraceClick to show.
Standard OutputClick to show.
Standard ErrorClick to show.
ZIP Links: Click to show.https://jenkins.cilium.io/job/Cilium-PR-K8s-1.19-kernel-5.4//1246/artifact/c58b47d9_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-1.19-kernel-5.4/1246/ If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue. |
PR #17704 hit this flake with 85.95% similarity: Click to show.Test Name
Failure Output
StacktraceClick to show.
Standard OutputClick to show.
Standard ErrorClick to show.
ZIP Links: Click to show.https://jenkins.cilium.io/job/Cilium-PR-K8s-GKE//6778/artifact/src/github.com/cilium/cilium/33bbbc48_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-GKE/6778/ If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue. |
No MLH, that last one is not the same flake. It's a panic and it seems to match #17661 (comment). |
PR #17780 hit this flake with 88.79% similarity: Click to show.Test Name
Failure Output
StacktraceClick to show.
Standard OutputClick to show.
Standard ErrorClick to show.
ZIP Links: Click to show.https://jenkins.cilium.io/job/Cilium-PR-K8s-GKE//6933/artifact/src/github.com/cilium/cilium/cf272726_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-GKE/6933/ If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue. |
PR #17929 hit this flake with 85.45% similarity: Click to show.Test Name
Failure Output
StacktraceClick to show.
Standard OutputClick to show.
Standard ErrorClick to show.
ZIP Links: Click to show.https://jenkins.cilium.io/job/Cilium-PR-K8s-1.21-kernel-4.19//239/artifact/6609f4a6_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-1.21-kernel-4.19/239/ If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue. |
PR #18041 hit this flake with 89.67% similarity: Click to show.Test Name
Failure Output
StacktraceClick to show.
Standard OutputClick to show.
Standard ErrorClick to show.
ZIP Links: Click to show.https://jenkins.cilium.io/job/Cilium-PR-K8s-1.21-kernel-5.4//25/artifact/d2429973_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-1.21-kernel-5.4/25/ If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue. |
PR #17762 hit this flake with 88.79% similarity: Click to show.Test Name
Failure Output
StacktraceClick to show.
Standard OutputClick to show.
Standard ErrorClick to show.
ZIP Links: Click to show.https://jenkins.cilium.io/job/Cilium-PR-K8s-GKE//7061/artifact/src/github.com/cilium/cilium/ac9ad29e_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-GKE/7061/ If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue. |
PR #19062 hit this flake with 91.08% similarity: Click to show.Test Name
Failure Output
StacktraceClick to show.
Standard OutputClick to show.
Standard ErrorClick to show.
ZIP Links: Click to show.https://jenkins.cilium.io/job/Cilium-PR-K8s-1.19-kernel-5.4//1427/artifact/fbf63a34_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-1.19-kernel-5.4/1427/ If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue. |
PR #20937 hit this flake with 87.91% similarity: Click to show.Test Name
Failure Output
StacktraceClick to show.
Standard OutputClick to show.
Standard ErrorClick to show.
ZIP Links: Click to show.https://jenkins.cilium.io/job/Cilium-PR-K8s-1.16-net-next//2464/artifact/d772b98c_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-1.16-net-next/2464/ If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue. |
Taking run https://jenkins.cilium.io/job/cilium-master-k8s-1.24-kernel-5.4-quarantine/410/testReport/junit/Suite-k8s-1/24/K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random_fully/ to debug this. Tracing the PacketWe can see the IPv6 packet failing to be routed outside the node:
Routing FailureRunning the test locally, we can see that the packet should have been routed as follows:
from the default route:
This route is installed at Lines 240 to 243 in f59df85
In the Jenkins output, we can see that that step of the provisioning failed for k8s1:
For some reason, the IPv6 NAT interface (enp0s16) is missing its NAT IP address (something like
|
This commit dumps the natnetwork configuration on the host as part of the VirtualBox provisioning, to help debug a flake with that device. Related: cilium#17353 Signed-off-by: Paul Chaignon <paul@cilium.io>
This commit dumps the natnetwork configuration on the host as part of the VirtualBox provisioning, to help debug a flake with that device. Related: #17353 Signed-off-by: Paul Chaignon <paul@cilium.io>
#22675 dumped the VirtualBox natnetworks on the host during the VM provisioning process. In case the tests passed, we can see the following natnetworks:
When the test fails, we have the following:
|
Next step is to see what effect #22704 had on the quarantined tests. I expect it to mitigate the flake but not completely remove it. We can track this with this DataStudio dashboard. In case of any failure, we should check the Jenkins log to see:
|
Test Name
Failure Output
Stacktrace
Click to show.
Standard Output
Click to show.
Standard Error
Click to show.
ZIP Links:
Click to show.
https://jenkins.cilium.io/job/Cilium-PR-K8s-1.19-kernel-5.4//960/artifact/b2fccc59_K8sDatapathConfig_Encapsulation_Check_iptables_masquerading_with_random-fully.zip
https://jenkins.cilium.io/job/Cilium-PR-K8s-1.19-kernel-5.4//960/artifact/test_results_Cilium-PR-K8s-1.19-kernel-5.4_960_BDD-Test-PR.zip
Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-1.19-kernel-5.4/960/
If this is a duplicate of an existing flake, comment 'Duplicate of #<issue-number>' and close this issue.
The text was updated successfully, but these errors were encountered: