^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 1) # SPDX-License-Identifier: GPL-2.0
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 2) config RAS_CEC
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 3) bool "Correctable Errors Collector"
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 4) depends on X86_MCE && MEMORY_FAILURE && DEBUG_FS
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 5) help
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 6) This is a small cache which collects correctable memory errors per 4K
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 7) page PFN and counts their repeated occurrence. Once the counter for a
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 8) PFN overflows, we try to soft-offline that page as we take it to mean
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 9) that it has reached a relatively high error count and would probably
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 10) be best if we don't use it anymore.
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 11)
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 12) Bear in mind that this is absolutely useless if your platform doesn't
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 13) have ECC DIMMs and doesn't have DRAM ECC checking enabled in the BIOS.
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 14)
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 15) config RAS_CEC_DEBUG
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 16) bool "CEC debugging machinery"
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 17) default n
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 18) depends on RAS_CEC
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 19) help
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 20) Add extra files to (debugfs)/ras/cec to test the correctable error
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 21) collector feature. "pfn" is a writable file that allows user to
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 22) simulate an error in a particular page frame. "array" is a read-only
^8f3ce5b39 (kx 2023-10-28 12:00:06 +0300 23) file that dumps out the current state of all pages logged so far.