-
Notifications
You must be signed in to change notification settings - Fork 124
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TNetXNGFile::Open
error when running xAH with AnalysisBase 25.2.9 on lxplus EL9 node
#1688
Comments
Hi @mamerl , Do you know if hist-data23_13p6TeV.00456225.physics_Main.deriv.DAOD_JETM1.f1369_m2185_p5994.root is an input file you are running on, or one of the output files? Or potentially something else? Best, |
Hi @mdhank, From what I understand the
Thanks, |
Hi Max, I believe TNetXNGFile is for xrootd, but I'm not sure why it would need xrootd to open the output files. Could you send your full command and config file? Best, |
Hi Michael, Thanks for clarifying that. The config file we use is here: https://gitlab.cern.ch/tla-atlas-run3/tla-steering-run-3/-/blob/TLA-25.2.9-mamerl-dev/configs/onlineOverOffline/base_config_run3_withargs.py?ref_type=heads (I have added you to our TLA steering framework as a reporter so you can view the code). The command we run is:
Thanks, |
Hi @mamerl , I ran some tests and was able to reproduce the error. It seems it occurs whenever submitDir is on eos (/eos/user/m/ vs. /eos/home-m/ makes no difference). If I use an AFS directory, there is no error. It's unclear if the error actually causes any problems, but I would recommend changing the output location to be on the safe side. Best, |
Hi @mdhank, Thanks for clarifying that. At the moment, we only run locally from EOS for testing so won't be relying on any results from the local runs, and our large-scale jobs would be run on the grid. Would this error also affect submitting jobs to the grid with the Thanks, |
Hi @mamerl , I don't think that would be a problem- it only gives the error when the output is on eos, not the input. I'll also note that even having the output on eos worked fine on a different file I tested, though I'm not sure why that makes a difference. Best, |
Hi @mdhank, Thanks for pointing that out. In that case, we'll keep our workflow as it is and I'll test a few different jobs to see if I can replicate the same behaviour where we don't get the error when running on other files, etc. Thanks for your help! Cheers, |
Hi, Just a follow up for documentation purposes. When I run jobs on the Grid with this setup (checking one job) I don't see the same error:
So this seems to suggest that the issue is just linked to accessing files on EOS. There was a similar error message for ROOT that was recently resolved here as well, but I'm not sure whether that can explain the error we see here since the file paths don't contain Cheers, |
Hi,
When running
xAH_run.py
on an lxplus EL9 node I encountered the following error messages at the end of the job:I'm not sure whether this is an error in our analysis framework or something configuration related in terms of the location we are running from, etc.
Do you have any suggestions regarding where this may be coming from and how we can solve it in case it poses an issue @mdhank, @tofitsch?
From what I can see in the test job I ran, we get the usual output files containing the histograms we book with our custom algorithms and these do get filled.
Thanks,
Max
The text was updated successfully, but these errors were encountered: