• Bug#1098698: linux: Segfault and system hang on larger network file tra

    From Salvatore Bonaccorso@21:1/5 to Paul DeKraker on Thu Feb 27 17:50:01 2025
    XPost: linux.debian.bugs.dist

    Hi Paul,

    On Mon, Feb 24, 2025 at 09:12:43PM -0500, Paul DeKraker wrote:
    I just tried 6.12.16 and as with 6.12.15 the system locks up / becomes completely unresponsive and needs to be hard reset when attempting a large transfer from a network share to the local computer, so this probably is a separate issue. If there is something more I can do to capture this
    behavior please let me know.

    Thanks for confirming. Yes now we have to get to fresh logs. As I
    understand you cannot access the system (not even remotely via SSH) to
    gather the kernel logs after the issue appears?

    In this case we might be successfull if you attach a netconsole and
    have another system available which can capture the logs.
    Documentation can be found here: https://www.kernel.org/doc/html/latest/networking/netconsole.html#netconsole

    Would you be able to try that and provide fresh logs for the recent
    6.12.y kernels?

    Regards,
    Salvatore

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Salvatore Bonaccorso@21:1/5 to Paul DeKraker on Sat Mar 1 15:20:02 2025
    XPost: linux.debian.bugs.dist

    Hi Paul,

    On Sat, Mar 01, 2025 at 08:21:58AM -0500, Paul DeKraker wrote:
    Here is a log collected via netconsole.

    Thanks,
    Paul

    On Thu, Feb 27, 2025 at 11:39 AM Salvatore Bonaccorso <carnil@debian.org> wrote:

    Hi Paul,

    On Mon, Feb 24, 2025 at 09:12:43PM -0500, Paul DeKraker wrote:
    I just tried 6.12.16 and as with 6.12.15 the system locks up / becomes completely unresponsive and needs to be hard reset when attempting a
    large
    transfer from a network share to the local computer, so this probably is
    a
    separate issue. If there is something more I can do to capture this behavior please let me know.

    Thanks for confirming. Yes now we have to get to fresh logs. As I understand you cannot access the system (not even remotely via SSH) to gather the kernel logs after the issue appears?

    In this case we might be successfull if you attach a netconsole and
    have another system available which can capture the logs.
    Documentation can be found here:

    https://www.kernel.org/doc/html/latest/networking/netconsole.html#netconsole

    Would you be able to try that and provide fresh logs for the recent
    6.12.y kernels?

    Thanks a lot for the log, this was very helpful. At first glance it
    looks like this issue: https://lore.kernel.org/netfs/CAKPOu+_4mUwYgQtRTbXCmi+-k3PGvLysnPadkmHOyB7Gz0iSMA@mail.gmail.com/

    There is a submitted patch but it does not look like it got applied
    already, checking.

    Might you be able to test the patch from https://lore.kernel.org/netfs/20250210191118.3444416-1-max.kellermann@ionos.com/
    to see if it fixes the issue?

    You can follow the procedure in https://kernel-team.pages.debian.net/kernel-handbook/ch-common-tasks.html#id-1.6.6.4
    to do the "simple patching and building".

    Regards,
    Salvatore

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)