Please find inline reply for your queries-
1) Who is the link partner in the customer's setup?
Link partner is End point device from microsemi.
2) Please provide the PCIe dump before the crash.
We can't take pcie dump before crash as we donot know when the dump comes.If possible we can take pcie dump just after booting the processor.
3) As per the customer's earlier response, they observed the machine check. However, we do not see the machine check in the log. Please share the complete log.
no machine check was observed in this case, only pcie dump was there and after some time processor goes into hung state.
4) What tests were running on the setup? At which point did the error occur?
We were configuring our end point device, the error comes randomly while accessing the device.
5) To take the PCIe register dump after the crash, try disabling the PCI Express CA completion in the PCI Express error disable register.
This settings needs to be disabled in uboot, or dynamically we have to modify kernel to disable CA completion error when PCIE register dump is detected.