summaryrefslogtreecommitdiff
path: root/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
AgeCommit message (Expand)AuthorFilesLines
2024-04-27drm/amdgpu: Fix ras mode2 reset failure in ras aca modeYiPeng Chai1-0/+4
2024-04-27drm/amdgpu: Use new interface to reserve bad pageYiPeng Chai1-3/+1
2024-04-27drm/amdgpu: Fix address translation defectYiPeng Chai1-1/+1
2024-04-27drm/amdgpu: add poison consumption handlerYiPeng Chai1-4/+39
2024-04-27drm/amdgpu: Add delay work to retire bad pagesYiPeng Chai1-1/+35
2024-04-27drm/amdgpu: add interface to update umc v12_0 ecc statusYiPeng Chai1-0/+2
2024-04-27drm/amdgpu: add poison creation handlerYiPeng Chai1-7/+69
2024-04-27drm/amdgpu: prepare for logging ecc errorsYiPeng Chai1-0/+32
2024-04-27drm/amdgpu: add message fifo to handle RAS poison eventsYiPeng Chai1-0/+35
2024-04-27drm/amdgpu: Add interface to reserve bad pageYiPeng Chai1-0/+19
2024-04-10drm/amdgpu: Set fatal errror detected flag earlierLijo Lazar1-13/+28
2024-03-22drm/amdgpu: add ras event id support for ACAYang Wang1-5/+6
2024-03-20drm/amdgpu: add aca deferred error type supportYang Wang1-2/+6
2024-03-20drm/amdgpu: make reset method configurable for RAS poisonTao Zhou1-2/+2
2024-03-20drm/amdgpu: add ras event id supportYang Wang1-71/+136
2024-02-26drm/amdgpu: Fix ineffective ras_mask settingsStanley.Yang1-0/+1
2024-02-26drm/amdgpu: Add fatal error detected flagLijo Lazar1-0/+32
2024-01-31drm/amdgpu: disable RAS feature when finiTao Zhou1-1/+1
2024-01-31drm/amdgpu: Update boot time errors polling sequenceHawking Zhang1-1/+13
2024-01-25drm/amdgpu: Support passing poison consumption ras block to SRIOVYiPeng Chai1-1/+1
2024-01-25drm/amdgpu: adjust aca init/fini sequence to match gpu resetYang Wang1-2/+13
2024-01-25drm/amdgpu: Fix module unload hang with RAS enabledMukul Joshi1-0/+4
2024-01-23drm/amdgpu: skip call ras_late_init if ras block is not supportedYang Wang1-2/+5
2024-01-23drm/amdgpu:Support retiring multiple MCA error address pagesYiPeng Chai1-8/+35
2024-01-23drm/amdgpu: Use asynchronous polling to handle umc_v12_0 poisoningYiPeng Chai1-0/+5
2024-01-23drm/amdgpu: Fix ras features value calltraceStanley.Yang1-5/+6
2024-01-23drm/amdgpu: Prepare for asynchronous processing of umc page retirementYiPeng Chai1-0/+34
2024-01-18drm/amdgpu: Show deferred error count for UMCStanley.Yang1-2/+6
2024-01-18drm/amdgpu: fix UBSAN array-index-out-of-bounds for ras_block_string[]Yang Wang1-1/+4
2024-01-16drm/amdgpu: Centralize ras cap query to amdgpu_ras_check_supportedHawking Zhang1-77/+93
2024-01-16drm/amdgpu: Log deferred error separatelyCandice Li1-20/+96
2024-01-16drm/amdgpu: add aca sysfs supportYang Wang1-0/+15
2024-01-16drm/amdgpu: add amdgpu ras aca query interfaceYang Wang1-15/+90
2024-01-16drm/amdgpu: add ACA bank dump debugfs supportYang Wang1-0/+14
2024-01-16drm/amdgpu: Add ras helper to query boot errors v2Hawking Zhang1-0/+95
2024-01-09drm/amdgpu: Packed socket_id to ras feature maskHawking Zhang1-0/+5
2024-01-09drm/amdgpu: Support poison error injection via ras_ctrl debugfsCandice Li1-2/+5
2024-01-09drm/amdgpu: Drop unnecessary sentences about CE and deferred error.Candice Li1-9/+5
2024-01-06Revert "drm/amdgpu: enable mca debug mode on APU by default"Hawking Zhang1-2/+1
2024-01-03drm/amdgpu: Fix possible NULL dereference in amdgpu_ras_query_error_status_he...Srinivasan Shanmugam1-0/+3
2024-01-03drm/amdgpu: Use kzalloc instead of kmalloc+__GFP_ZERO in amdgpu_ras.cSrinivasan Shanmugam1-3/+3
2023-12-19drm/amdgpu: MCA supports recording umc address informationYiPeng Chai1-7/+15
2023-12-12Backmerge tag 'v6.7-rc5' into drm-nextDave Airlie1-0/+17
2023-12-07drm/amdgpu: optimize the printing order of error dataYang Wang1-0/+17
2023-12-01drm/amdgpu: enable mca debug mode on APU by defaultYang Wang1-1/+2
2023-11-30drm/amdgpu: Move mca debug mode decision to rasLijo Lazar1-3/+11
2023-11-17drm/amdgpu: fix ras err_data null pointer issue in amdgpu_ras.cYang Wang1-1/+1
2023-11-10drm/amdgpu: fix software pci_unplug on some chipsVitaly Prosyak1-3/+6
2023-11-10drm/amdgpu: Support multiple error query modesHawking Zhang1-23/+70
2023-11-03drm/amdgpu: check recovery status of xgmi hive in ras_reset_error_countTao Zhou1-1/+10