Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"PCIe error as occurred" at boot with a Dell R230 server #27

Open
ralawa opened this issue Sep 13, 2022 · 10 comments
Open

"PCIe error as occurred" at boot with a Dell R230 server #27

ralawa opened this issue Sep 13, 2022 · 10 comments

Comments

@ralawa
Copy link

ralawa commented Sep 13, 2022

Hi,

I have just installed the PCIe adapter with the coral dual TPU board in a Dell R230 server. The server does not boot and complains about a PCIe error. Do you think, there is something to do to make it working?

The adapter works without any issue in a Dell desktop and the 2 TPUs are detected.

Thank you.

@magic-blue-smoke
Copy link
Owner

Hi @ralawa
This would be the first known incompatible configuration if can't be fixed.
My idea is that server trying some "smart" things which are not supported by adapter or can't fallback to Gen2 x1 mode
Looking to BIOS settings I see option called "Slot Disablement" and from description it's not quite clear to me if it disables slot completely or only ignores Option ROM and UEFI drivers for it. Could you try this?

@ralawa
Copy link
Author

ralawa commented Sep 14, 2022

Hi,

Thank you for your response.

When set to "disabled", the server boots but the adapter is not detected by the OS as expected by the BIOS help.
And when set to "Boot Driver Disabled", the server boot but the Linux kernel crash during boot.

Regards.

@magic-blue-smoke
Copy link
Owner

Hi,
Thank you for your response.
When set to "disabled", the server boots but the adapter is not detected by the OS as expected by the BIOS help. And when set to "Boot Driver Disabled", the server boot but the Linux kernel crash during boot.

@ralawa thanks for trying these options.
When Linux crashes, are there any informative logs/messages?
Also, is it possible to try another PCIe slot?

@ralawa
Copy link
Author

ralawa commented Sep 16, 2022

Hi,

In attachement, the kernel panic logs.
I can only use one slot, the other PCIe slot is used by the PERC raid controller.
I also upgraded the bios but same issue.

I believe that there is nothing more to do.

Regards.

IMG_20220916_111753

@magic-blue-smoke
Copy link
Owner

@ralawa I see, please contact me using form at the bottom of the page

@reaperharvest
Copy link

I have this same issue on a Dell r710

@magic-blue-smoke
Copy link
Owner

@reaperharvest unfortunately I don't have access Dell servers to reproduce and diagnose the issue.
Please contact me using form at the bottom of the page for a refund and/or alternative options

@magic-blue-smoke
Copy link
Owner

There's something similar I see on Dell Support Forums. It seems to me that PCIe root complex fails to fall back to x1 from x4 (required by PCIe specs)

@chino-lu
Copy link

facing the same on an R330... R720 is working fine

@cbrherms
Copy link

cbrherms commented Feb 8, 2024

Ran in to the same issue on my R330, but appears to be detected fine on my R430. I guess a change of plans of how i'll deploy and will just use that hardware instead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants