Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The subnet is already taken by /tmp/pytest-of-root/pytest-21/env0/lago/default/uuid #614

Closed
nvgoldin opened this issue Jun 28, 2017 · 6 comments · Fixed by #637
Closed
Labels
Milestone

Comments

@nvgoldin
Copy link
Contributor

@gbenhaim
http://jenkins.ovirt.org/job/lago_master_github_check-patch-fc25-x86_64/450/testReport/junit/tests.functional-sdk/test_sdk_sanity/test_custom_log/

I am actually seeing this locally quite often since 0c8de17 was merged. Perhaps now the code does what it was supposed to do?

I think what happened per the job that failed - init failed in a previous job, but the /tmp/.... lago environment was not destroyed and the subnet lease remained in the locks.

@nvgoldin nvgoldin added the bug label Jun 28, 2017
@nvgoldin
Copy link
Contributor Author

Looks like we're partly affected by pytest-dev/pytest#1120
Regardless, need to think of a proper way to clean it up.

@nvgoldin
Copy link
Contributor Author

@gbenhaim
Copy link
Member

In another build, I saw that the networks weren't destroyed.
Taken from the slave:

virsh # net-list
 Name                 State      Autostart     Persistent
----------------------------------------------------------
 default              active     yes           yes
 dfe5-net-01          active     no            no
 dfe5-net-02          active     no            no

@nvgoldin
Copy link
Contributor Author

which build?

@gbenhaim
Copy link
Member

@nvgoldin
Copy link
Contributor Author

Looking at the slave's build history, these might be leftovers from:
http://jenkins.ovirt.org/job/lago_master_github_check-patch-fc25-x86_64/449/console

Where the job simply timed out:

17:16:40 test_sdk_sanity.py::test_extract_paths_ignore_nopath[vm-el7-3-base-/nothing/here-normal] PASSED
22:52:26 test_sdk_sanity.py::test_extract_paths_ignore_nopath[vm-el7-3-base-/nothing/here-dead] Build timed out (after 360 minutes). Marking the build as failed.
22:52:26 $ ssh-agent -k
22:52:26 unset SSH_AUTH_SOCK;

So this is a little different - the networks were still up, not only the leases. This issue is probably more rare and fixing it would mean updating the job's cleanup script.

Either way I think this (good) behaviour is new, I guess before it just re-wrote the lease or something.
Implementing #599 would probably solve most issues.

@gbenhaim gbenhaim added this to the 0.41 milestone Jul 5, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants