Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle errors from NVL more gracefully. #6

Open
bzamecnik opened this issue Dec 5, 2018 · 0 comments
Open

Handle errors from NVL more gracefully. #6

bzamecnik opened this issue Dec 5, 2018 · 0 comments

Comments

@bzamecnik
Copy link
Collaborator

Logs:

  File "/usr/local/lib/python2.7/dist-packages/nvgpu/list_gpus.py", line 60, in device_statuses
    return [device_status(device_index) for device_index in range(device_count)]
  File "/usr/local/lib/python2.7/dist-packages/nvgpu/list_gpus.py", line 14, in device_status
    handle = nv.nvmlDeviceGetHandleByIndex(device_index)
  File "/usr/local/lib/python2.7/dist-packages/pynvml.py", line 946, in nvmlDeviceGetHandleByIndex
    _nvmlCheckReturn(ret)
  File "/usr/local/lib/python2.7/dist-packages/pynvml.py", line 405, in _nvmlCheckReturn
    raise NVMLError(ret)
NVMLError_GpuIsLost: GPU is lost

It fails with 500, but should better report the error NVMLError_GpuIsLost: GPU is lost.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant