You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Opensearch data node get's constantly excluded from shard allocations
How can one reproduce the bug?
Deploy the opensearch operator and cluster with the configuration shown below.
What is the expected behavior?
Working opensearch cluster without yellow health state / no excluded data nodes.
What is your host/environment?
Debian 12 x64 / k3s v1.28.8+k3s1
Do you have any screenshots?
Outputs listed below.
Do you have any additional context?
I have more than enough disk space available and writing only 4-6 GiB per Day into opensearch.
Hello! I'm using the Helm OpenSearch-Operator and Helm OpenSearch-Cluster.
My cluster is showing a yellow status due to unassigned replica shards.
I ran GET _cluster/settings and received the following output:
I attempted to remove data-nodes-0 from the exclusion list using a PUT request, but it automatically gets added back to the list after a few seconds.
Here are some details about my setup:
helm list
NAME NAMESPACE REVISION UPDATED STATUS CHART APP VERSION
opensearch-cluster opensearch 5 2024-04-22 13:53:25.886739305 +0200 CEST deployed opensearch-cluster-2.5.1 2.5.1
opensearch-operator opensearch 4 2024-04-22 14:13:59.756698617 +0200 CEST deployed opensearch-operator-2.5.1 2.5.1
kubectl get OpenSearchCluster
NAME HEALTH NODES VERSION PHASE AGE
opensearch-cluster yellow 6 2.8.0 RUNNING 66d
Hi @hollowdew. Not sure why this is happening. The only operator components that set the exclusion are the restarter and the upgrader. And accoding to your status none of them are doing anything.
Could you please set drainDataNodes to false (the drain should only be needed for emptyDir and since you have no extra persistence config it uses PVCs) and see if that stops the node being added to the exclusion list?
I greatly appreciate your suggestion. I have updated my cluster and manually removed data-node0 from the exclusion list once more. I will monitor to see if it gets excluded again and will provide an update tomorrow.
What is the bug?
Opensearch data node get's constantly excluded from shard allocations
How can one reproduce the bug?
Deploy the opensearch operator and cluster with the configuration shown below.
What is the expected behavior?
Working opensearch cluster without yellow health state / no excluded data nodes.
What is your host/environment?
Debian 12 x64 / k3s v1.28.8+k3s1
Do you have any screenshots?
Outputs listed below.
Do you have any additional context?
I have more than enough disk space available and writing only 4-6 GiB per Day into opensearch.
Hello! I'm using the Helm OpenSearch-Operator and Helm OpenSearch-Cluster.
My cluster is showing a yellow status due to unassigned replica shards.
I ran GET _cluster/settings and received the following output:
And the health of my cluster:
GET _cluster/health?pretty
I attempted to remove data-nodes-0 from the exclusion list using a PUT request, but it automatically gets added back to the list after a few seconds.
Here are some details about my setup:
Here are my Helm values for the OpenSearch cluster:
And here are the Helm values for the operator:
Here are some log entries from my operator that repeat every few seconds:
I haven't found any errors in the logs for the manager-node or data-node.
After I remove data-node-0 from the exclusion list
The unassigned shards slowly being processed.
However, data-node-0 is then re-added to the exclusion list after some seconds.
The text was updated successfully, but these errors were encountered: