Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

-plugin-timeout does not wait long enoufg #3583

Open
hchybz opened this issue Mar 26, 2024 · 4 comments
Open

-plugin-timeout does not wait long enoufg #3583

hchybz opened this issue Mar 26, 2024 · 4 comments

Comments

@hchybz
Copy link

hchybz commented Mar 26, 2024

Hi

need to run sos on a 8-node-hardware-assembled-256core BULLION server.

lshw takes approx 15 minutes to parse every part..

sos report is unable to wait so long although --plugin-timeout=-1 is set

root@my377[v240325]:/$ sos report -o hardware  --plugin-timeout=-1 -vv --batch --debug

...
[plugin:hardware] unpacked command: binary=False, changes=False, chroot=True, cmd='lshw', env=None, foreground=False, root_symlink=None, runat=None, sizelimit=25, stderr=True, subdir=None, suggest_filename=None, tags=[], timeout=None
[plugin:hardware] collecting output of 'lshw'

 Plugin hardware timed out


[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.6b49az9p/sosreport-my377-2024-03-26-gfzrskc'
[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.6b49az9p/sosreport-my377-2024-03-26-gfzrskc'
[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.6b49az9p/sosreport-my377-2024-03-26-gfzrskc'
[archive:TarFileArchive] added string at 'version.txt' to FileCacheArchive '/tmp/sos.6b49az9p/sosreport-my377-2024-03-26-gfzrskc'
[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.6b49az9p/sosreport-my377-2024-03-26-gfzrskc'
[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.6b49az9p/sosreport-my377-2024-03-26-gfzrskc'
[archive:TarFileArchive] added string at 'sos_reports/manifest.json' to FileCacheArchive '/tmp/sos.6b49az9p/sosreport-my377-2024-03-26-gfzrskc'
Creating compressed archive...
[archive:TarFileArchive] finalizing archive '/tmp/sos.6b49az9p/sosreport-my377-2024-03-26-gfzrskc' using method 'auto'
[archive:TarFileArchive] built archive at '/tmp/sos.6b49az9p/sosreport-my377-2024-03-26-gfzrskc.tar' (size=6502400)

Your sosreport has been generated and saved in:
        /tmp/sosreport-my377-2024-03-26-gfzrskc.tar.xz

 Size   108.66KiB
 Owner  root
 md5    87350fcd4706cfc26a4d78f42e8770f1

Please send this file to your support representative.
@jcastill
Copy link
Member

Hi @hchybz
Unless I'm mistaken, -1 sets a timeout of 300 seconds, or 5 minutes, so lshw will still timeout. Can you try with:

--plugin-timeout=0

And see if that works? That should disable the timeout completely.

@hchybz
Copy link
Author

hchybz commented Mar 26, 2024

ok, will retry (seems i did not read carefully the help) `

but the output in my command shows timeout=None

[plugin:hardware] unpacked command: binary=False, changes=False, chroot=True, cmd='lshw', env=None, foreground=False, root_symlink=None, runat=None, sizelimit=25, stderr=True, subdir=None, suggest_filename=None, tags=[], timeout=None

root@my377[v240325]:/$ time lshw
real    6m23.410s
user    6m22.668s
sys     0m0.424s

@hchybz
Copy link
Author

hchybz commented Mar 26, 2024

Seems to work with --plugin-timeout=0

root@my377[v240325]:/$ sos report -o hardware  --plugin-timeout=0 -vv --batch --debug
...
[plugin:hardware] collected plugin 'hardware' in 389.36702013015747

  Finished running plugins

[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.eu8ey6sj/sosreport-my377-2024-03-26-llsddwa'
[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.eu8ey6sj/sosreport-my377-2024-03-26-llsddwa'
[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.eu8ey6sj/sosreport-my377-2024-03-26-llsddwa'
[archive:TarFileArchive] added string at 'version.txt' to FileCacheArchive '/tmp/sos.eu8ey6sj/sosreport-my377-2024-03-26-llsddwa'
[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.eu8ey6sj/sosreport-my377-2024-03-26-llsddwa'
[archive:TarFileArchive] added open file to FileCacheArchive '/tmp/sos.eu8ey6sj/sosreport-my377-2024-03-26-llsddwa'
[archive:TarFileArchive] added string at 'sos_reports/manifest.json' to FileCacheArchive '/tmp/sos.eu8ey6sj/sosreport-my377-2024-03-26-llsddwa'
Creating compressed archive...
[archive:TarFileArchive] finalizing archive '/tmp/sos.eu8ey6sj/sosreport-my377-2024-03-26-llsddwa' using method 'auto'
[archive:TarFileArchive] built archive at '/tmp/sos.eu8ey6sj/sosreport-my377-2024-03-26-llsddwa.tar' (size=7372800)

Your sosreport has been generated and saved in:
        /tmp/sosreport-my377-2024-03-26-llsddwa.tar.xz

 Size   127.12KiB
 Owner  root
 md5    804f2bed48ceabbd62e86c0a425c7572

Please send this file to your support representative.

question remains : 0 or -1 produces on my server the same debug output with timeout=None

[plugin:hardware] unpacked command: binary=False, changes=False,
chroot=True, cmd='lshw', env=None, foreground=False, root_symlink=None, 
runat=None, sizelimit=25, stderr=True, subdir=None, suggest_filename=None, tags=[], timeout=None

@TurboTurtle
Copy link
Member

-1 is the actual default we set on the plugins, so that it accepts the global timeout value (be it a user-supplied value or the default of 300 for that option).

The none timeout you're seeing in the logs is not the plugin timeout being set, but the timeout for the individual lshw command, which is not controlled by the plugin-level timeout option.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants