On 27 Feb 2024, at 23:47, Gareth Evans <donotspam@fastmail.fm> wrote:some time.
On Tue 27/02/2024 at 22:52, David Christensen <dpchrist@holgerdanske.com> wrote:
...
These appear to be the ZFS packages for the available Debian releases:
https://packages.debian.org/buster/zfs-dkms
buster zfs-dkms (0.7.12-2+deb10u2)
buster-backports zfs-dkms (2.0.3-9~bpo10+1)
bullseye zfs-dkms (2.0.3-9+deb11u1)
bullseye-backports zfs-dkms (2.1.11-1~bpo11+1)
bookworm zfs-dkms (2.1.11-1)
bookworm-backports zfs-dkms (2.2.2-4~bpo12+1)
trixie zfs-dkms (2.2.2-4)
The question is, how far back to go? Is OpenZFS 2.1.x buggy? OpenZFS
2.0.x? What is 0.7.12 -- OpenZFS, ZFS-on-Linux, or something else --
and is it buggy?
This seems to be very "involved"! The discussion in #15526 suggests a coreutils upgrade (particularly re. "cp") in combination with the addition of the zpool block cloning feature seems to have triggered the issue, which may have gone undetected for
-- https://github.com/openzfs/zfs/issues/15526#issuecomment-1810472547After downgrading coreutils from 9.3 to 8.32, I am no longer able to reproduce this corruption.This seems to solve the corruption issue on my end too.
See also https://www.reddit.com/r/zfs/comments/1826lgs/psa_its_not_block_cloning_its_a_data_corruption/
Debian users can't follow the gentoo/emerge-based reproduction/trigger steps for build of golang in
https://github.com/openzfs/zfs/issues/15526 (for zfs 2.2.0)
and
https://github.com/openzfs/zfs/issues/15933 (for 2.2.3)
If anyone can recommend steps to debianise these (15933 seem most likely to be useful, and slightly different), I would be happy to test openzfs 2.2.2-4 from bookworm-backports on deb 12.5
Given that the original gentoo reporter, who seems to have tested extensively, considered the issue closed after upgrade to openzfs 2.2.2
https://bugs.gentoo.org/917224#c26
I wonder if the 2.2.3 issue is similar/related, or perhaps there are multiple triggers.
Watching with interest.
Best wishes,
Gareth
</blockquote><span>-- https://github.com/openzfs/zfs/issues/15526#issuecomment-1810472547</span><br><span></span><br><span>See also</span><br><span>https://www.reddit.com/r/zfs/comments/1826lgs/psa_its_not_block_cloning_its_a_data_corruption/</span><com/openzfs/zfs/issues/15933 (for 2.2.3)</span><br><span></span><br><span>If anyone can recommend steps to debianise these (15933 seem most likely to be useful, and slightly different), I would be happy to test openzfs 2.2.2-4 from bookworm-backports on
<span></span><br><span>Debian users can't follow the gentoo/emerge-based reproduction/trigger steps for build of golang in </span><br><span>https://github.com/openzfs/zfs/issues/15526 (for zfs 2.2.0)</span><br><span>and</span><br><span>https://github.
<span></span><br><span>I wonder if the 2.2.3 issue is similar/related, or perhaps there are multiple triggers.</span><br><span></span><br><span>Watching with interest.</span><br><span></span><br><span>Best wishes,</span><br><span>Gareth</span><br><span></span><br></div></blockquote><br><div>As anyone interested can see from the ref to #15933 in the below, there seems to have been considerable effort in getting to grips with this bug (actually multiple bugs), and it looks like a fix may be
As anyone interested can see from the ref to #15933 in the below, there seems to have been considerable effort in getting to grips with this bug (actually multiple bugs), and it looks like a fix may be forthcoming, though not sure at the time ofwriting if there may be some further polishing first
https://github.com/openzfs/zfs/pull/16019
On Fri 22/03/2024 at 21:01, Gareth Evans <donotspam@fastmail.fm> wrote:writing if there may be some further polishing first
As anyone interested can see from the ref to #15933 in the below, there seems to have been considerable effort in getting to grips with this bug (actually multiple bugs), and it looks like a fix may be forthcoming, though not sure at the time of
https://github.com/openzfs/zfs/pull/16019
https://github.com/openzfs/zfs/issues/15933
is now closed as completed with fix
https://github.com/openzfs/zfs/commit/102b468b5e190973fbaee6fe682727eb33079811
which for the moment necessarily adds synchronous writes.
FYI.
Gareth
On 3/25/24 15:05, Gareth Evans wrote:writing if there may be some further polishing first
On Fri 22/03/2024 at 21:01, Gareth Evans <donotspam@fastmail.fm> wrote:
As anyone interested can see from the ref to #15933 in the below, there seems to have been considerable effort in getting to grips with this bug (actually multiple bugs), and it looks like a fix may be forthcoming, though not sure at the time of
https://github.com/openzfs/zfs/pull/16019
https://github.com/openzfs/zfs/issues/15933
is now closed as completed with fix
https://github.com/openzfs/zfs/commit/102b468b5e190973fbaee6fe682727eb33079811
which for the moment necessarily adds synchronous writes.
FYI.
Gareth
Thank you for keeping an eye on this.
Looking at the github commit, the C code makes me worry -- it does not
appear to use traditional C/C++ thread-safe programming techniques such
as I learned in CS and used when I did systems programming (e.g. guard functions, critical sections, locks, semaphores, etc.).
Do I need to
look at more enclosing code to see such, are those techniques missing,
are there some newer techniques I do not understand, or something else?
Sysop: | Keyop |
---|---|
Location: | Huddersfield, West Yorkshire, UK |
Users: | 546 |
Nodes: | 16 (0 / 16) |
Uptime: | 156:57:49 |
Calls: | 10,384 |
Calls today: | 1 |
Files: | 14,056 |
Messages: | 6,416,474 |