[Bug] When restart oap got interruptedException #12182
Replies: 4 comments 2 replies
-
Noticed that 9.4 using client with io.kubernetes.client. I was checking wrong source code which is 8.9. |
Beta Was this translation helpful? Give feedback.
-
About timeout, OkhttpClient is used by io.kubernetes.client, which is set by 10s. If k8s is big enough, api return over 10s, may get error with time out. |
Beta Was this translation helpful? Give feedback.
-
Simply add timeout may not help. What about using https://kubernetes.io/docs/reference/using-api/api-concepts/#retrieving-large-results-sets-in-chunks |
Beta Was this translation helpful? Give feedback.
-
KubernetesPods.java
limit was not config, limit was set null. |
Beta Was this translation helpful? Give feedback.
-
Search before asking
Apache SkyWalking Component
OAP server (apache/skywalking) 9.4
What happened
restart OAP Service failed
What you expected to happen
We reboot the OAP service, sometimes OAP would be start success, but 90% would get a interrupted exception.
Check the info log, it seems list pod, timeout with k8s api.
So i checked the skywalking source code, noticed that connect k8s using io.fabric8.kubernetes.client,
found timeout can change.
public static final String KUBERNETES_CONNECTION_TIMEOUT_SYSTEM_PROPERTY = "kubernetes.connection.timeout";
public static final String KUBERNETES_UPLOAD_REQUEST_TIMEOUT_SYSTEM_PROPERTY = "kubernetes.upload.request.timeout";
public static final String KUBERNETES_REQUEST_TIMEOUT_SYSTEM_PROPERTY = "kubernetes.request.timeout";
kubernetes.connection.timeout
kubernetes.request.timeout
Default value is 10000ms, try to set 15000ms, not work for me.
Here is my questions.
using io.fabric8.kubernetes.client ... why came out error with io.kubernetes.client?
How to reproduce
how to slove the issue.
Anything else
No response
Are you willing to submit a pull request to fix on your own?
Code of Conduct
Beta Was this translation helpful? Give feedback.
All reactions