Replies: 5 comments 13 replies
-
Can you please provide more detail -
To start, provide the complete RabbitMQ logs from all nodes. Save them in an archive, and attach the archive to a comment. Thanks. |
Beta Was this translation helpful? Give feedback.
-
We are also facing this issue where we get timeout on 2 of the nodes. We have to restart the timeout nodes manually and then it resolved. But this is reoccurring for us and block the consumers for that time. Let me know if any way to debug and resolve it. |
Beta Was this translation helpful? Give feedback.
-
Related discussion: #11181 |
Beta Was this translation helpful? Give feedback.
-
We are going in circles on this. We cannot suggest anything without logs from all nodes in this state. Per RabbitMQ Community support policy, unless you are a paying user or a regular contributor, we require a specific set of steps to reproduce, or, at the very least, every bit of information we ask for:
In fact, |
Beta Was this translation helpful? Give feedback.
-
There were some specific findings in suggestions in #11181 (comment) based on provided log snippets, which I suspect was filed by a different person from the same team. There quorum queues are used for request-reply (RPC) responses. That is the opposite of what quorum queues were designed for, as their doc guide explicitly states. Short lived queues should be classic non-mirrored ones. They are very cheap to set up while quorum queues are very expensive to declare and are meant to be long lived. |
Beta Was this translation helpful? Give feedback.
-
I am not sure how to best provide logs for the issue I am having - I do have the same symptoms as #10936 only difference is that our cluster is already running on 3.13 and on a kubernetes cluster based on RKE2 (v1.26.11 +rke2r1).
with the following cluster status:
We had to shutdown our complete cluster, and once it started this problem started to happen. This also crashes when we try to create new queues.
Is there any idea what this could be related to or what kind of logs I can provide to get to the root of this?
Edit: I just read the comment in #10934 - checking if this might be related as it states that there might be something wrong with the node identities
Beta Was this translation helpful? Give feedback.
All reactions