Package: src:linuxdefrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables nfnetlink overlay cpufreq_powersave cpufreq_ondemand cpufreq_conservative cpufreq_userspace sunrpc quota_v2 quota_tree binfmt_misc intel_rapl_msr nls_ascii nls_cp437 intel_rapl_common vfat snd_hda_codec_
Version: 6.1.135-1
Severity: important
X-Debbugs-Cc: mg-public-addr@protonmail.com
Dear Maintainer,
The system will log an error, followed by 100% cpu usage on one core, I believe by the kernel, which results in the message
```
[drm:dc_add_plane_to_context [amdgpu]] *ERROR* Head pipe not found for stream_state 00000000b7629c18 !
```
logged endlessly and as fast as the CPU can process to the kernel log.
The trigger for this issue is unclear to me, as it will not happen on every boot of the system, and can take hours, days or
weeks to appear after a reboot.
Other system functions appear to work as normal, or not degreded in a way I have noticed, as long as the logs are rotated.
Rebooting the system is my current approach when this error happens, and buys time until it occurs again. This system has run
without issue for >60 days on an affected kernel version, so I suspect there is no guarantee this bug will always appear.
The system is run as a mostly headless server, does not hibernate, sleep or suspend. It *is* connected to a TV via a HDMI cable,
that turns on and off throughout the day, and is one of the few inputs relevant to the amd gpu driver that I suspect could be a
trigger. This is connected to the motherboard HDMI connection, using the iGPU of a Ryzen 2200G
This system is running openmediavault (intalled from that install media), but I am logging here as I suspect it does not make
modifications to the kernel and core debian system.
The last time this was known to be stable for me was on the 5.10 kernels under bullseye, and on both bullseye and bookworm under
the 6.x kernel this issue has appeared.
I have captured two instances of this from separate dates included below. The last line is the one to repeat infinitely from
this point onwards. It is difficult to capture as the logs will quickly either fill up the hard drive, or get log rotated
out, which means that it has been hard to observe anything other than the final message in the logs! It has been recurring
around 6-10 times total in a 12 month period.
Please note that the last kernel log included by `reportbug` is on a fresh reboot of the system where this issue has not
occured yet and may be of no use - else it would only capture the spammed message log!
Logs from first time I caught the issue:
2024-07-19T20:17:35.445701+01:00 rhino kernel: [72604.746570] ------------[ cut here ]------------
2024-07-19T20:17:35.445717+01:00 rhino kernel: [72604.746574] WARNING: CPU: 1 PID: 56 at drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc.c:3074 dc_update_planes_and_stream+0x342/0x870 [amdgpu]
2024-07-19T20:17:35.445720+01:00 rhino kernel: [72604.747029] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter bridge stp llc nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_
2024-07-19T20:17:35.445722+01:00 rhino kernel: [72604.747114] drm_kms_helper ecdh_generic dvb_core ecc mc rapl cfg80211 ccp wmi_bmof pcspkr sp5100_tco k10temp sg rfkill evdev acpi_cpufreq button wireguard libchacha20poly1305 chacha_x86_64 poly1305_x86_64 curve25519_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel softdog watchdog nct6775 drm nct6775_core hwmon_vid dm_mod fuse loop efi_pstore configfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 btrfs blake2b_generic zstd_compress
2024-07-19T20:17:35.445724+01:00 rhino kernel: [72604.747206] CPU: 1 PID: 56 Comm: kworker/1:1H Not tainted 6.1.0-23-amd64 #1 Debian 6.1.99-1ff ff 48
2024-07-19T20:17:35.445725+01:00 rhino kernel: [72604.747212] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./B450 Gaming-ITX/ac, BIOS P3.40 07/17/2019
2024-07-19T20:17:35.445737+01:00 rhino kernel: [72604.747215] Workqueue: events_highpri dm_irq_work_func [amdgpu]
2024-07-19T20:17:35.445738+01:00 rhino kernel: [72604.747659] RIP: 0010:dc_update_planes_and_stream+0x342/0x870 [amdgpu]
2024-07-19T20:17:35.445740+01:00 rhino kernel: [72604.748095] Code: 48 2b 14 25 28 00 00 00 0f 85 38 05 00 00 48 83 c4 50 5b 5d 41 5c 41 5d 41 5e 41 5f e9 57 4e 9a de 45 85 ed 0f 84 51 fe ff ff <0f> 0b 31 c0 eb ca 8b 93 50 06 00 00 83 fa 01 0f 84 68 fe
2024-07-19T20:17:35.445741+01:00 rhino kernel: [72604.748099] RSP: 0018:ffffbeb1c040f870 EFLAGS: 00010202
2024-07-19T20:17:35.445742+01:00 rhino kernel: [72604.748103] RAX: 0000000000000000 RBX: ffff962e90544000 RCX: 0000000000000000
I assume you cannot more specifically say when you saw the problem
first appearing in the 6.1.y series?
Would you be able to test newer stable series as well (ideally 6.12.y
as will be shipped in trixie or mainline kernel?)
I have searched the upstream issues in https://gitlab.freedesktop.org/drm/amd/-/issues and did not found
something directly matching your report. Could you please report it
upstream and report back the upstream issue back here so we can link
those?
Sice you are owning the hardware that would help speed up the
debugging by having you directly interacting with upstream on the
matter. Can you do that?
Control: tags -1 + moreinfodefrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables nfnetlink overlay cpufreq_powersave cpufreq_ondemand cpufreq_conservative cpufreq_userspace sunrpc quota_v2 quota_tree binfmt_misc intel_rapl_msr nls_ascii nls_cp437 intel_rapl_common vfat snd_hda_codec_
Hi
thanks for your report.
On Mon, May 05, 2025 at 12:35:36PM +0100, mg wrote:
Package: src:linux
Version: 6.1.135-1
Severity: important
X-Debbugs-Cc: mg-public-addr@protonmail.com
Dear Maintainer,
The system will log an error, followed by 100% cpu usage on one core, I believe by the kernel, which results in the message
`[drm:dc_add_plane_to_context [amdgpu]] *ERROR* Head pipe not found for stream_state 00000000b7629c18 !`
logged endlessly and as fast as the CPU can process to the kernel log.
The trigger for this issue is unclear to me, as it will not happen on every boot of the system, and can take hours, days or
weeks to appear after a reboot.
Other system functions appear to work as normal, or not degreded in a way I have noticed, as long as the logs are rotated.
Rebooting the system is my current approach when this error happens, and buys time until it occurs again. This system has run
without issue for >60 days on an affected kernel version, so I suspect there is no guarantee this bug will always appear.
The system is run as a mostly headless server, does not hibernate, sleep or suspend. It is connected to a TV via a HDMI cable,
that turns on and off throughout the day, and is one of the few inputs relevant to the amd gpu driver that I suspect could be a
trigger. This is connected to the motherboard HDMI connection, using the iGPU of a Ryzen 2200G
This system is running openmediavault (intalled from that install media), but I am logging here as I suspect it does not make
modifications to the kernel and core debian system.
The last time this was known to be stable for me was on the 5.10 kernels under bullseye, and on both bullseye and bookworm under
the 6.x kernel this issue has appeared.
I have captured two instances of this from separate dates included below. The last line is the one to repeat infinitely from
this point onwards. It is difficult to capture as the logs will quickly either fill up the hard drive, or get log rotated
out, which means that it has been hard to observe anything other than the final message in the logs! It has been recurring
around 6-10 times total in a 12 month period.
Please note that the last kernel log included by `reportbug` is on a fresh reboot of the system where this issue has not
occured yet and may be of no use - else it would only capture the spammed message log!
Logs from first time I caught the issue:
2024-07-19T20:17:35.445701+01:00 rhino kernel: [72604.746570] ------------[ cut here ]------------
2024-07-19T20:17:35.445717+01:00 rhino kernel: [72604.746574] WARNING: CPU: 1 PID: 56 at drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc.c:3074 dc_update_planes_and_stream+0x342/0x870 [amdgpu]
2024-07-19T20:17:35.445720+01:00 rhino kernel: [72604.747029] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter bridge stp llc nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_
Sysop: | Keyop |
---|---|
Location: | Huddersfield, West Yorkshire, UK |
Users: | 546 |
Nodes: | 16 (2 / 14) |
Uptime: | 151:41:02 |
Calls: | 10,383 |
Files: | 14,054 |
Messages: | 6,417,807 |