In days of yore (Mon, 15 Apr 2024), Jamie thus quoth:
So there is a very nasty bug in the e1000e network card
driver.
So there is a very nasty bug in the e1000e network card
driver.
I am running Debian 12 Bookworm.[snip]
You will get the message "Detected Hardware Unit Hang" and then
the network card just stops working.
This is a gigabit network card as I said it is a built in NIC I believe it
is an Intel NIC.
This seems to happen when you are actually pushing a bit of traffic
though it not a lot but just even a little bit. It isn't network overload
or anything I am barely doing anything really but it will do this.
I have already tried the following
ethtool -K eth1 tx off rx off
ethtool -K eth1 tso off gso off
ethtool -K eth1 gso off gro off tso off tx off rx off rxvlan off txvlan
off sg off
I have disabled all power management in the bios as well including the one for ASPM
I added the following to grub
pcie_aspm=off e1000e.SmartPowerDownEnable=0
This is in /etc/default/grub
GRUB_CMDLINE_LINUX_DEFAULT="quiet pcie_aspm=off e1000e.SmartPowerDownEnable=0"
Then I did an update-grub as well.
None of this has worked in fixing this problem. I am still getting the
same issue.
Can you please fix this issue this is a really nasty problem with Debian
12 (Bookworm)
I am seeing this being reported back in Kernel 5.3.x but i am not seeing any reports for 6.1.x about this issue.
Debian Bug report logs - #945912
Kernel 5.3 e100e Detected Hardware Unit Hang https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=945912
Please reply back and confirm that you got this email and that you are looking into this problem please.
-- This email message, including any attachments, is for the intended recipient(s) only and may contain information that is privileged, confidential and/or exempt from disclosure under applicable law. If you
have received this message in error, or are obviously not one of the
intended recipients, please immediately notify the sender by reply email
and delete this email message, including any attachments. All
information in this email including any attachment(s) is to be kept in
strict confidence and is not to be released to anyone without my prior written consent.
It has been known to happen that drivers implement workarounds for issues
in the hardware itself, so that hardware bugs do not get tripped (or are tripped less often).
It has been known to happen that drivers implement workarounds for issues in the hardware itself, so that hardware bugs do not get tripped (or are tripped less often).
🙂
You make it sound like it's a rare occurrence, but it's actually
quite common. Most of it is discrete so you'll rarely be exposed to it,
but `grep bugs /proc/cpuinfo` is one of the places where you can see it
being somewhat documented.
Look this is a kernel bug and Debian needs to
fix this! Don't give me any of this crap about upstream
this is a bug with the Debian Kernel!
This needs to be fixed!
I have already tried disabling the offloads and it does
not work.
It isn't the cable either I have tried different cables it
still happens! This is an issue with the Kernel module for
the e1000e NIC card.
This is a bug with the kernel that needs to be fixed in Debian!
I have already replaced it but this bug needs to be fixed
by the Debian kernel team!
Sysop: | Keyop |
---|---|
Location: | Huddersfield, West Yorkshire, UK |
Users: | 546 |
Nodes: | 16 (2 / 14) |
Uptime: | 149:31:32 |
Calls: | 10,383 |
Calls today: | 8 |
Files: | 14,054 |
D/L today: |
2 files (1,861K bytes) |
Messages: | 6,417,769 |