Inline templates still do not create the pod name correctly #12937
Labels
area/controller
Controller issues, panics
P1
High priority. All bugs with >=5 thumbs up that aren’t P0, plus: Any other bugs deemed high priority
solution/suggested
A solution to the bug has been suggested. Someone needs to implement it.
solution/workaround
There's a workaround, might not be great, but exists
type/bug
Pre-requisites
:latest
What happened/what did you expect to happen?
It still looks like the bug from #10912 only that now with version v3.5.5 argo is not even realizing anymore that the pod is already completed.
The pod argo wants to refer to is named
fantastic-python-2118328171
as can be seen in the UI. (for the workflow, see the example workflow further down)It's stuck in pending mode as the pod is not found on the Kubernetes cluster, because when I look at the cluster, I see the pod called
fantastic-python--2118328171
which is either running or already done. Since argo apparently uses the wrong name to fetch the pod and its status, the workflow does not proceed. Even if it would continue, the logs would not be visible in the argo UI, see the referenced bug report for it.You can even see in the logs of the controller that it created the pod using the double
-
and then it tries to pull the status of the pod using only one-
. (see logs further, the first log is being returned when I search for the pod name with double-
, the other logs, when I search for the pod name with only one-
)If I got your fix right for #10912 I think you tried to fix a problem with the naming of the task, but I think the issue goes deeper:
As far as I understood it, the pod name should always be something like
{workflow-name}-{workflow-id}-{step-name}-{step-id}
but when I use theinline
option rather than thetemplate
option to reference another template, it simply does not add thestep-name
to the pod name.Please have a look at it, since I need to use the
inline
option as I'm creating the workflows automatically, and using thetemplate
option would make everything much much harder.Version
v3.5.5
Paste a small workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.
Logs from the workflow controller
Logs from in your workflow's wait container
The text was updated successfully, but these errors were encountered: