Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deploy fails immediately on ImagePullBackoff #54

Open
smlx opened this issue May 25, 2021 · 2 comments
Open

Deploy fails immediately on ImagePullBackoff #54

smlx opened this issue May 25, 2021 · 2 comments
Labels
enhancement New feature or request

Comments

@smlx
Copy link
Member

smlx commented May 25, 2021

I had a deploy fail almost immediately when the kubectl-build-deploy-dind image went into ImagePullBackoff. Here's what happened:

  1. Started several deploys, got success back from Lagoon API.
  2. Lagoon build pods appeared.
  3. One build pod went into ImagePullBackoff (the others started running).
  4. The ImagePullBackoff build pod disappeared.
  5. Deploy shown as failed in Lagoon dashboard:

Screenshot from 2021-05-25 16-25-09

I would have expected the pod to eventually start running instead of failing the deploy. The other builds started at the same time ran fine, so the image pull error may have just been a transient network issue?

I ran deploy on this environment a second time and it ran through fine.

@smlx smlx added the bug Something isn't working label May 25, 2021
@shreddedbacon
Copy link
Member

Controller fails a build if it can't start for whatever reason, imagepullbackoff is one of them.

If the image is a genuine fail to pull, then we would need some sort of timeout on it to prevent it from blocking future builds.

I'm ok with doing this if its something we want to support?

@smlx
Copy link
Member Author

smlx commented May 26, 2021

I think it would be nice to have some kind of timeout to allow kubernetes time to recover. If only to avoid Lagoon users being confused about why their deploys sporadically fail. It doesn't have to be long - just enough for kubernetes to retry the image pull.

I've only seen this once, so it isn't an urgent problem.

@smlx smlx added enhancement New feature or request and removed bug Something isn't working labels May 26, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants