• Re: Bug#1014908: ITP: gender-guesser -- Guess the gender from first nam

    From Steve McIntyre@21:1/5 to edward@4angle.com on Thu Jul 14 13:00:01 2022
    edward@4angle.com wrote:

    Package: wnpp
    Severity: wishlist
    Owner: Edward Betts <edward@4angle.com>
    X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org

    * Package name : gender-guesser
    Version : 0.4.0
    Upstream Author : Israel Saeta Prez <israel@lead-ratings.com>
    * URL : https://github.com/lead-ratings/gender-guesser
    * License : GPL-3 & GFDL-1.2+
    Programming Lang: Python
    Description : Guess the gender from first name

    Oh, not *another* package that tries to guess things from names.

    Do you have a real use for this package? There are a *lot* of issues
    in this area, and mis-gendering people is not something to risk
    lightly...

    --
    Steve McIntyre, Cambridge, UK. steve@einval.com "We're the technical experts. We were hired so that management could
    ignore our recommendations and tell us how to do our jobs." -- Mike Andrews

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve McIntyre@21:1/5 to Steve McIntyre on Thu Jul 14 14:00:01 2022
    Steve McIntyre wrote:
    edward@4angle.com wrote:

    Package: wnpp
    Severity: wishlist
    Owner: Edward Betts <edward@4angle.com>
    X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org >>
    * Package name : gender-guesser
    Version : 0.4.0
    Upstream Author : Israel Saeta Prez <israel@lead-ratings.com>
    * URL : https://github.com/lead-ratings/gender-guesser
    * License : GPL-3 & GFDL-1.2+
    Programming Lang: Python
    Description : Guess the gender from first name

    Oh, not *another* package that tries to guess things from names.

    Do you have a real use for this package? There are a *lot* of issues
    in this area, and mis-gendering people is not something to risk
    lightly...

    And I see you uploaded ~immediately - why even bother with an ITP?

    --
    Steve McIntyre, Cambridge, UK. steve@einval.com "We're the technical experts. We were hired so that management could
    ignore our recommendations and tell us how to do our jobs." -- Mike Andrews

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marc Haber@21:1/5 to All on Thu Jul 14 14:20:01 2022
    On Thu, 14 Jul 2022 12:54:52 +0100, Steve McIntyre <steve@einval.com>
    wrote:
    And I see you uploaded ~immediately - why even bother with an ITP?

    Is it proper procedure to upload without an ITP?

    --
    -------------------------------------- !! No courtesy copies, please !! ----- Marc Haber | " Questions are the | Mailadresse im Header Mannheim, Germany | Beginning of Wisdom " |
    Nordisch by Nature | Lt. Worf, TNG "Rightful Heir" | Fon: *49 621 72739834

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From julien.puydt@gmail.com@21:1/5 to All on Thu Jul 14 14:40:01 2022
    Hi,

    Le jeudi 14 juillet 2022 à 14:16 +0200, Marc Haber a écrit :
    On Thu, 14 Jul 2022 12:54:52 +0100, Steve McIntyre <steve@einval.com>
    wrote:
    And I see you uploaded ~immediately - why even bother with an ITP?

    Is it proper procedure to upload without an ITP?


    No ; I have to admit a large percentage of the new packages I upload
    get their ITP minutes before the package is ready.

    Basically: I wait for the bug number before pushing to salsa & NEW.

    Cheers,

    J.Puydt

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Johannes Schauer Marin Rodrigues@21:1/5 to All on Thu Jul 14 14:30:01 2022
    Quoting Steve McIntyre (2022-07-14 13:54:52)
    And I see you uploaded ~immediately - why even bother with an ITP?

    I did that quite a few times in the past as well. Is there a rule of how long I have to wait with my upload to NEW after filing the ITP? --==============Y79290324531790116=MIME-Version: 1.0
    Content-Transfer-Encoding: 7bit
    Content-Description: signature
    Content-Type: application/pgp-signature; name="signature.asc"; charset="us-ascii"

    -----BEGIN PGP SIGNATURE-----

    iQIzBAABCgAdFiEElFhU6KL81LF4wVq58sulx4+9g+EFAmLQCtcACgkQ8sulx4+9 g+Fi6Q//fW76s2jchRDaKbgZYr2gRyvt8rGQ/bQoCEzpTDM/wD+Qyu0eLGBXEssH PXREn6P2N8QwVcW4WBmIUcfiw5ZVYLetLE08+u2bCnNJlDRcGY8jSg3NEhTfQleg eGznmVSa52dpYjrXzsz+T/zxkkobjT2+wMJtR9lmhHtB4h10s5s115POOHfQHzuM 7gZ1cirXMiuq69f1tdBCrkgHdwbIZy1Ru3+Jx7tZPb7tcr/56Xm8q3HfIGrL8EDc PI63r7IGi14l16EF9YZY+OwPRbGq/SwRYFpFxUjpAT9ou+J5UX5UHJjj45XBJXkN MjrmR4lPp5Z63OK6cnQKP+R8XnIDnO/rJbTOE+KQpFLx+Zm9OU+JIOJ9bvnkYHQa hH9p/t9faXYcG4UVE5ybTzziTbfVEKNUD3R1MpgLKANWSLrDw+j+hlx10WiEwEKH iJB0Aiq2+tYEoyA+ZHBaSmc4uwvpLw9YVzfh3F8CQtBqpc+2CpLVZswNTy2rdiBX c0XMn2mFd9N6Gp3yWjRqWpX57AZtch4uEB6quRnPlWhrt+G/VaHruwEeh5AsWsOO /H+VU5zazQAKTd1qe9l1sEHZIdHseVB4FDolIPmPcUjPs5Peqng+KTH4wVFGn1wn ITZjGSkTFgXI2654oB1uzSco5cbGBtZWIZ7sbxBWciR2rLbomxw=
    =1gD8
    -----END PGP SIGNATURE-----

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Roberto =?iso-8859-1?Q?C=2E_S=E1nch@21:1/5 to Steve McIntyre on Thu Jul 14 14:50:02 2022
    On Thu, Jul 14, 2022 at 11:14:43AM +0100, Steve McIntyre wrote:
    edward@4angle.com wrote:

    Package: wnpp
    Severity: wishlist
    Owner: Edward Betts <edward@4angle.com>
    X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org

    * Package name : gender-guesser
    Version : 0.4.0
    Upstream Author : Israel Saeta Prez <israel@lead-ratings.com>
    * URL : https://github.com/lead-ratings/gender-guesser
    * License : GPL-3 & GFDL-1.2+
    Programming Lang: Python
    Description : Guess the gender from first name

    Oh, not *another* package that tries to guess things from names.

    Do you have a real use for this package?

    Why in the world is that even a relevant question? There are plenty of packages in the archive which are useful to essentially nobody apart
    from the maintainer and there are even packages which are maintained
    without being useful to the maintainer at all (but rather useful to
    others).

    There are a *lot* of issues
    in this area, and mis-gendering people is not something to risk
    lightly...


    "There are a *lot* of issues in this area" seems rather nebulous. In
    which area? Given the fact that we have clear and rather unambiguous guidelines for what constitutes software which is appropriate for
    inclusion in the archive, and given that on its face this software does
    not seem to be in conflict with any of those guidelines, what then is
    the problem? BTW, I'm not interested in any sort of "well I don't like
    ..." or "such and such could offend so and so ..." sort of arguments.

    Please provide an objective and technically-based reason for why this particular package should not be in the archive rather than hand-wavy
    arguments without any actual substance. Otherwise, it will appear as
    though you are simply attempting to conform everyone else to your own
    personal view on things. I think we can all agree that "there are a
    *lot* of issues" with such an approach.

    Regards,

    -Roberto

    --
    Roberto C. Snchez

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Andrey Rahmatullin@21:1/5 to All on Thu Jul 14 14:50:02 2022
    On Thu, Jul 14, 2022 at 08:45:24AM -0400, Roberto C. Sánchez wrote:
    And I see you uploaded ~immediately - why even bother with an ITP?

    Is it proper procedure to upload without an ITP?


    No ; I have to admit a large percentage of the new packages I upload
    get their ITP minutes before the package is ready.

    Basically: I wait for the bug number before pushing to salsa & NEW.

    It's been a while since I've packaged something entirely new, but this
    is also how I have approached it. Especially during periods when it
    might take months to make it through NEW, waiting an additional week or
    two for discussion around an ITP (especially when the vast majority of
    ITPs actually never elecit any sort of response from anyone), seems
    rather pointless.

    Filing the ITP then immediately uploading seems really sensible,
    More sensible than not filing it?
    This defeats both purposes of an ITP: getting it discussed and working as
    a mutex for people who are thinking about packaging the same software. Are there other purposes?

    --
    WBR, wRAR

    -----BEGIN PGP SIGNATURE-----

    iQJhBAABCgBLFiEEolIP6gqGcKZh3YxVM2L3AxpJkuEFAmLQELMtFIAAAAAAFQAP cGthLWFkZHJlc3NAZ251cGcub3Jnd3JhckBkZWJpYW4ub3JnAAoJEDNi9wMaSZLh VY4QAJW4XebkhKBL3m9YyovTdBUiMyu0U71mUQHx0Yoj1Na0kTKmIXWUG3+pvWQL Cs2OUFJtWhDog57EofTr9AruUNPoGZBh6JGFm6IH0S3SAsS1eBP9tT3PbyDUA3o5 w+2o4OAUHRhvZ8ocrMWE4k5eD8O8PzJQjuW+tiGcAr9DbB2CVZRwLUuIGxi+MmqZ R7qlM5e8CL/Ebv9u3mCN9uRkB3nPhnb6uWFonh3xyN/tCrG5FkFm/6SPqYh8X/qg XSXzWsnNPMMFc5YqLgGgmbSa9Lr82H2tLr3VqYb4Reu6ZWTd2jWuHfXER04jndKB BCpaD9iQpflV3zbtWPWONQVy3WJdc4iQGxJ3DwVUKKMEgQKBLl4suyq6zzVFxFU7 b4gDyqN+osaX1U74jlafo6C8VHu2FVmVzq5ZUm/tKegtr1LF8itNrBSyX58a8gOd kmenH+8NqP1AVMGi7DRu4/K3Ua/wcZybxZZdZCUSsj3vu+aelm/I0vN+gI/2vkpk 6Z6xjcFVcmyL0zIkXP7audd4XZ7pLL7FdBCsPGls5wA3UVgv5BLpXuaSIJkpOFrj aMXMI8sKA/2M4mEkmSbMYMo97+q78Xa6LP0EM4xvVijgjDm551ZbTqgqZMun99cG hjFpMWe2ZP4bubCOm/+xkG9Xc0c/DjHkxzZv84eie+OSPHPI
    =FxyD
    -----END PGP SIGNATURE-----

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Roberto =?iso-8859-1?Q?C=2E_S=E1nch@21:1/5 to julien.puydt@gmail.com on Thu Jul 14 14:50:02 2022
    On Thu, Jul 14, 2022 at 02:31:22PM +0200, julien.puydt@gmail.com wrote:
    Hi,

    Le jeudi 14 juillet 2022 14:16 +0200, Marc Haber a crit:
    On Thu, 14 Jul 2022 12:54:52 +0100, Steve McIntyre <steve@einval.com> wrote:
    And I see you uploaded ~immediately - why even bother with an ITP?

    Is it proper procedure to upload without an ITP?


    No ; I have to admit a large percentage of the new packages I upload
    get their ITP minutes before the package is ready.

    Basically: I wait for the bug number before pushing to salsa & NEW.

    It's been a while since I've packaged something entirely new, but this
    is also how I have approached it. Especially during periods when it
    might take months to make it through NEW, waiting an additional week or
    two for discussion around an ITP (especially when the vast majority of
    ITPs actually never elecit any sort of response from anyone), seems
    rather pointless.

    Filing the ITP then immediately uploading seems really sensible,
    especially since in the event of a mistake it is trivial to email
    ftp-master requesting a REJECT, which IME is usually something they do
    right away.

    Regards,

    -Roberto

    --
    Roberto C. Snchez

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve McIntyre@21:1/5 to Marc Haber on Thu Jul 14 15:00:01 2022
    Marc Haber wrote:
    On Thu, 14 Jul 2022 12:54:52 +0100, Steve McIntyre <steve@einval.com>
    wrote:
    And I see you uploaded ~immediately - why even bother with an ITP?

    Is it proper procedure to upload without an ITP?

    IMHO there are 2 points to an ITP:

    * to save effort in case two people might be working on the same
    package
    * to invite discussion on debian-devel / elsewhere

    If people post an ITP and upload iummediately, then I don't think that
    helps on either count.

    If the only reason for the ITP is to make lintian quiet then I think
    that's a total waste of time - it's following a guideline blindly
    without understanding the reason for it.

    How do others feel?

    --
    Steve McIntyre, Cambridge, UK. steve@einval.com "We're the technical experts. We were hired so that management could
    ignore our recommendations and tell us how to do our jobs." -- Mike Andrews

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marco d'Itri@21:1/5 to Marc Haber on Thu Jul 14 15:10:01 2022
    On Jul 14, Marc Haber <mh+debian-devel@zugschlus.de> wrote:

    And I see you uploaded ~immediately - why even bother with an ITP?
    Is it proper procedure to upload without an ITP?
    Is there any point in an ITP if you are already ready to upload the
    package? No.

    --
    ciao,
    Marco

    -----BEGIN PGP SIGNATURE-----

    iHUEABYIAB0WIQQnKUXNg20437dCfobLPsM64d7XgQUCYtANaAAKCRDLPsM64d7X gaGVAP90U02JvKhYqyh2cr4FLKTe9u7s6+csiys/yWTUvbOLpwEAqI/g5e0YjsUs e6VLJcmjezC4knhQ19/FH+BA9HRe6AQ=
    =OlWA
    -----END PGP SIGNATURE-----

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Roberto =?iso-8859-1?Q?C=2E_S=E1nch@21:1/5 to Andrey Rahmatullin on Thu Jul 14 15:20:01 2022
    On Thu, Jul 14, 2022 at 05:48:56PM +0500, Andrey Rahmatullin wrote:
    On Thu, Jul 14, 2022 at 08:45:24AM -0400, Roberto C. Snchez wrote:

    Filing the ITP then immediately uploading seems really sensible,
    More sensible than not filing it?
    This defeats both purposes of an ITP: getting it discussed and working as
    a mutex for people who are thinking about packaging the same software. Are there other purposes?

    Filing the ITP and then uploading immediately seems like it still fully
    allows for both things you describe.

    The discussion can take place as the package waits in NEW (which can be
    highly variable, from days to weeks, even to months). Revisions can be uploaded (if called for based on the discussion) without losing the
    place in NEW.

    As far as the mutex aspect, suppose I have some software that I want to package. I experiment and create a package before filing an ITP, for
    reasons, and then decide, "yes, I do want to upload this". First I
    search existing ITPs and see if someone has expressed an interest in
    working on this. If so, I communicate and coordinate with that person.
    If not, I file a new ITP. At that point, I am faced with a question,
    "how long to wait before uploading?" We can make the argument that
    whatever delay is chosen is likely to be insufficient for any of a
    number of reasons. So, then what's the difference with just uploading
    as soon as the ITP is filed? If someone comes along during the period
    where the package is in NEW and has an interest, then a simple "hey I'm
    also interested in packaging this, can we join forces?" seems like the
    thing to do.

    Perhaps then it might be that ITP should not be mandatory. If we
    substitue "search NEW and search open ITPs" for "search open ITPs" then
    the main reason to have ITPs would be for the instance where someone has
    the intention of packaging something but not until some time in the
    future. This might be because the person lacks sufficient time in the
    present, because upstream has not yet made a first release suitable for
    upload, or any of a number of other reasons. In any event, this seems
    like something that each maintainer can reasonably judge based on the circumstances.

    Regards,

    -Roberto

    --
    Roberto C. Snchez

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Jeremy Bicha@21:1/5 to roberto@debian.org on Thu Jul 14 15:40:01 2022
    On Thu, Jul 14, 2022 at 3:12 PM Roberto C. Sánchez <roberto@debian.org> wrote:

    On Thu, Jul 14, 2022 at 05:48:56PM +0500, Andrey Rahmatullin wrote:
    On Thu, Jul 14, 2022 at 08:45:24AM -0400, Roberto C. Sánchez wrote:

    Filing the ITP then immediately uploading seems really sensible,
    More sensible than not filing it?
    This defeats both purposes of an ITP: getting it discussed and working as
    a mutex for people who are thinking about packaging the same software. Are there other purposes?

    Filing the ITP and then uploading immediately seems like it still fully allows for both things you describe.

    I also file ITP bugs and try to CC debian-devel as an announcement
    that the package is coming soon to Debian.

    I generally wait to file the ITP until my packaging is nearly ready in
    Salsa and provide the Salsa link in my ITP bug which makes it easy for
    someone to review it if they want.

    Thank you,
    Jeremy Bicha

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Jeremy Bicha@21:1/5 to roberto@debian.org on Thu Jul 14 16:10:01 2022
    On Thu, Jul 14, 2022 at 2:41 PM Roberto C. Sánchez <roberto@debian.org> wrote:

    On Thu, Jul 14, 2022 at 11:14:43AM +0100, Steve McIntyre wrote:
    edward@4angle.com wrote:

    Package: wnpp
    Severity: wishlist
    Owner: Edward Betts <edward@4angle.com>
    X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org

    * Package name : gender-guesser
    Version : 0.4.0
    Upstream Author : Israel Saeta Pérez <israel@lead-ratings.com>
    * URL : https://github.com/lead-ratings/gender-guesser
    * License : GPL-3 & GFDL-1.2+
    Programming Lang: Python
    Description : Guess the gender from first name

    Oh, not *another* package that tries to guess things from names.

    Do you have a real use for this package?

    Why in the world is that even a relevant question? There are plenty of packages in the archive which are useful to essentially nobody apart
    from the maintainer and there are even packages which are maintained
    without being useful to the maintainer at all (but rather useful to
    others).

    There are a *lot* of issues
    in this area, and mis-gendering people is not something to risk
    lightly...


    "There are a *lot* of issues in this area" seems rather nebulous. In
    which area? Given the fact that we have clear and rather unambiguous guidelines for what constitutes software which is appropriate for
    inclusion in the archive, and given that on its face this software does
    not seem to be in conflict with any of those guidelines, what then is
    the problem? BTW, I'm not interested in any sort of "well I don't like
    ..." or "such and such could offend so and so ..." sort of arguments.

    Debian has a Diversity Statement [1] which says that Debian welcomes
    people regardless of how they identify themselves. Trans people and
    non-binary people face a lot of discrimination, harrassment and
    bullying around the world. That bad treatment of these people is
    against Debian's core values. Therefore, the Debian Project wouldn't
    want to distribute software that appears to facilitate that kind of
    harassment, regardless of the software license it is released under.
    We might not want to distribute such software even if it also has
    non-harmful uses. We don't have to distribute *everything* ourselves.

    [1] https://www.debian.org/intro/diversity

    Thank you,
    Jeremy Bicha

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Russ Allbery@21:1/5 to Steve McIntyre on Thu Jul 14 17:00:01 2022
    Steve McIntyre <steve@einval.com> writes:

    IMHO there are 2 points to an ITP:

    * to save effort in case two people might be working on the same
    package
    * to invite discussion on debian-devel / elsewhere

    If people post an ITP and upload iummediately, then I don't think that
    helps on either count.

    If the only reason for the ITP is to make lintian quiet then I think
    that's a total waste of time - it's following a guideline blindly
    without understanding the reason for it.

    How do others feel?

    I feel the same way, although I have for a long time been dubious of the benefit of debian-devel review for ITPs (and, to be honest, the benefit of
    WNPP in general apart from orphaning, although sometimes ITP and RFP bugs
    are a convenient central place to document all the reasons why packaging
    some specific piece of software is really hard), so the whole system feels
    kind of creaky to me.

    ITPs do occasionally catch things that really shouldn't be packaged, and
    we don't have another good mechanism for doing it. But the whole process
    as we currently follow it feels oddly dated and manual and sometimes like
    a box-ticking exercise. (It also adds a lot of noise to debian-devel from
    the perspective of, I suspect, most participants. But we've talked about
    that aspect of it before, and there was some moderate desire to see the
    new packages flow by.)

    Given that new packages as uploaded (a) include nearly all of the
    information in an ITP in a more structured form, and (b) have to flow
    through NEW anyway, I do sort of wonder if it would make sense to notify
    some mailing list of every new source package, extracting similar fields
    and the top entry of the changelog (which hopefully has some explanation
    for why the package is being packaged for Debian, and we could encourage
    people to do that), and then use the time the package sits waiting for NEW review as the window for people to raise concerns.

    That doesn't address the locking purpose of ITP (avoiding duplicate work).
    I'm not sure how frequently ITPs are effective at doing that. It feels
    like the percentage of the total software ecosystem that Debian is
    packaging is smaller than it used to be (we've grown but free software has grown way faster) and most of the places where I'd expect contention to
    happen are handled by language packaging teams that probably have their
    own processes.

    --
    Russ Allbery (rra@debian.org) <https://www.eyrie.org/~eagle/>

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Sean Whitton@21:1/5 to Johannes Schauer Marin Rodrigues on Thu Jul 14 17:00:01 2022
    Hello,

    On Thu 14 Jul 2022 at 02:23PM +02, Johannes Schauer Marin Rodrigues wrote:

    Quoting Steve McIntyre (2022-07-14 13:54:52)
    And I see you uploaded ~immediately - why even bother with an ITP?

    I did that quite a few times in the past as well. Is there a rule of how long I
    have to wait with my upload to NEW after filing the ITP?

    No, in fact they're not even required. The Haskell team doesn't post
    them, for example.

    --
    Sean Whitton

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Nilesh Patra@21:1/5 to julien.puydt@gmail.com on Thu Jul 14 16:50:01 2022
    On Thu, Jul 14, 2022 at 02:31:22PM +0200, julien.puydt@gmail.com wrote:
    Hi,

    Le jeudi 14 juillet 2022 14:16 +0200, Marc Haber a crit:
    On Thu, 14 Jul 2022 12:54:52 +0100, Steve McIntyre <steve@einval.com> wrote:
    And I see you uploaded ~immediately - why even bother with an ITP?

    Is it proper procedure to upload without an ITP?


    No ; I have to admit a large percentage of the new packages I upload
    get their ITP minutes before the package is ready.

    Basically: I wait for the bug number before pushing to salsa & NEW.

    I do exactly this and have never had a problem. I maintain a number of
    packages (like Julien does too) and push a bunch of stuff to new from time to time.

    Filing an ITP, waiting for it, having a discussion and then uploading is just beyond
    the time I have these days -- sorry.
    And needless to say there is always a possibility of rejecting a package if deemed in-appropriate.

    --
    Best,
    Nilesh

    -----BEGIN PGP SIGNATURE-----

    iQIzBAABCgAdFiEEPpmlJvXcwMu/HO6mALrnSzQzafEFAmLQLM8ACgkQALrnSzQz afHLYw//UZoMQDSj4tBmCW2Y693ywI4roZt1tIvFMpcDyfND4DvasKZsWS9Y7/K5 XXycYzAOpcZrrFTp5jU0CmVyqxrA8ONXOObSShIq0Lq5lER5k2Qg0cRqfhrXXFIg pmmtH1ID+Kv0Qk/eGu4Re6hP9obq9m93zkbtMnjufI3xe2Bh2qOye8wXpM5SC/oY x7wTvtB2H8nWC4hy3weQ1U4Q4IlpqFCcmYq+NltgCHeYr+Vc1jB3D0ORZxa2N6J4 iM9kf23DDeEZc9w79YQaCqId06glSVOP0gBPTotVlEyUB8ZpR6K3FHxmYuDghQbG /D/wyvCZ1W/gJunWlKOBlwesx9JV5qUlRVHAqIQch/BMmBO86WlK4nv0HzqONAOh 4AH+RKIVmGi51WWEn9grjWsd9wUVU+OVEkZ2rtBLRAliwSHb4sstAHWCuF4HADz0 oxdroy1T/VmFJ5m/lJcmvXWQGEnaZOdB5je/C6hhhIXIydaXoFCj3oUoUoG7j7Yy Be9CmTU984iD601QMu7lYUyaEbc76yvOoohH2nHPGrTiDCdOrUiQACarzIQ3HCFR J94IFaFSr954fZTlmuvpGfi1l2VCll6f2F3ilPfg3j9V1YEOn8jPF8ZQTBV1axdu VMcoztwQvE+NhKUEmaML4WV6sTcBCmYyYgSAx/V8kSNtf+kJfjA=
    =juJZ
    -----END PGP SIGNATURE-----

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Sean Whitton@21:1/5 to Steve McIntyre on Thu Jul 14 17:10:01 2022
    Hello,

    On Thu 14 Jul 2022 at 01:52PM +01, Steve McIntyre wrote:

    IMHO there are 2 points to an ITP:

    * to save effort in case two people might be working on the same
    package
    * to invite discussion on debian-devel / elsewhere

    If people post an ITP and upload iummediately, then I don't think that
    helps on either count.

    Regarding the first point, in previous discussions others have said that
    they don't look at NEW but do look at ITPs, so they still want it posted
    for that reason.

    If the only reason for the ITP is to make lintian quiet then I think
    that's a total waste of time - it's following a guideline blindly
    without understanding the reason for it.

    Definitely.

    --
    Sean Whitton

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Russ Allbery@21:1/5 to Edward Betts on Thu Jul 14 17:20:01 2022
    Edward Betts <edward@4angle.com> writes:

    I've been writing some code to work out the gender balance of speakers
    at a conference. It parses the pentabarf XML of the schedule and feeds
    the speaker names to this module.

    Here's the results for Debconf 22.

    72 speakers

    male 48 66.7%
    unknown 16 22.2%
    female 4 5.6%
    mostly_male 2 2.8%
    andy 1 1.4%
    mostly_female 1 1.4%

    I fear this may be an example of statistics that look meaningful but
    probably aren't because the error bar is much higher than the typical
    consumer of the statistic intuitively thinks it is. Although maybe that's
    not a worry in this case since the program itself says that it totally
    failed to make a guess about a quarter of the time.

    I don't really have any objections to the package being in the archive;
    this is certainly something that a lot of people seem to want to do and
    thus seem to find some utility in doing. But unless one has a
    higher-quality source of data than just names (preferred pronouns, direct self-identification, etc.), I personally would be worried about attaching
    the appearance of scientific accuracy (three significant figures!) to data that, depending on the nationalities involved and the strength of naming conventions and other factors, may be only rough guesswork.

    I know someone who keeps similar statistics as an aid to balancing the
    range of authors of books he chooses to review, and I see why someone
    would want to do that. But he tries to use higher-quality data sources
    than guessing based on names, and that feels like a best practice for that
    kind of thing to me.

    (Also, due to the limitations and history of naming conventions, the
    software is inherently trying to map into a gender binary, which if one is attempting to capture self-identification is likely to be unhelpful for
    many populations, such as ones with lots of people under 30, due to not
    having a way to represent nonbinary people.)

    Anyway, that's just all my personal opinion and I don't think any of that
    says that the package shouldn't be in the archive. We package all sorts
    of not-very-useful software and that's totally fine. But I've worked in identity management fields for long enough to have a professional
    knee-jerk reaction to anyone doing computer analysis of names or trying to record gender in any way other than simply asking people. :)

    --
    Russ Allbery (rra@debian.org) <https://www.eyrie.org/~eagle/>

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Vincent Bernat@21:1/5 to Russ Allbery on Thu Jul 14 17:50:01 2022
    On 2022-07-14 17:14, Russ Allbery wrote:
    (Also, due to the limitations and history of naming conventions, the
    software is inherently trying to map into a gender binary, which if one is attempting to capture self-identification is likely to be unhelpful for
    many populations, such as ones with lots of people under 30, due to not having a way to represent nonbinary people.)

    This one does not. It maps a first name to male, female, androgynous,
    mostly make, mostly female, or unknown.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Jonathan Carter (highvoltage)@21:1/5 to Steve McIntyre on Thu Jul 14 18:00:01 2022
    On 2022/07/14 14:52, Steve McIntyre wrote:
    IMHO there are 2 points to an ITP:

    * to save effort in case two people might be working on the same
    package
    * to invite discussion on debian-devel / elsewhere

    If people post an ITP and upload iummediately, then I don't think that
    helps on either count.

    I believe an ITP is even helpful in that case. It's happened on many
    packages before that the package had an issue and not accepted by FTP
    team, and then eventually the ITP got renamed to an RFP. Also, if I want
    to package something, I (and tools like reportbug) check whether there
    are ITPs/RFPs filed. I /don't/ check whether the package is in NEW.

    So, in short, I think that filing ITPs is still a good practice and the
    times where it should be left out are really some edge / special cases.

    -Jonathan

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Adam Borowski@21:1/5 to Jeremy Bicha on Thu Jul 14 18:50:01 2022
    On Thu, Jul 14, 2022 at 04:05:35PM +0200, Jeremy Bicha wrote:
    * Package name : gender-guesser

    Debian has a Diversity Statement [1] which says that Debian welcomes
    people regardless of how they identify themselves. Trans people and non-binary people face a lot of discrimination, harrassment and
    bullying around the world. That bad treatment of these people is
    against Debian's core values.

    Unless they're Jewish, believe that a woman should be allowed to abort a
    Down syndrome fetus, believe that there's more than just a name to the
    gender, or have a kind of transsexualism that matches their life's
    experiences and is detectable by brain imaging but the loud group says
    doesn't exist.

    The inconsistency here is astounding.

    Therefore, the Debian Project wouldn't
    want to distribute software that appears to facilitate that kind of harassment, regardless of the software license it is released under.
    We might not want to distribute such software even if it also has
    non-harmful uses.

    While not 100% accurate and thus shouldn't be used to determine the gender
    of an _individual_, it's a very useful tools for analyzing larger datasets.

    And where it comes to diversity, we so much need data rather than
    assumptions.


    Meow!
    --
    ⢀⣴⠾⠻⢶⣦⠀
    ⣾⠁⢠⠒⠀⣿⡁ I was born a dumb, ugly and work-loving kid, then I got swapped on
    ⢿⡄⠘⠷⠚⠋⠀ the maternity ward.
    ⠈⠳⣄⠀⠀⠀⠀

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Adam Borowski@21:1/5 to Russ Allbery on Thu Jul 14 20:00:01 2022
    On Thu, Jul 14, 2022 at 08:14:13AM -0700, Russ Allbery wrote:
    Edward Betts <edward@4angle.com> writes:

    I've been writing some code to work out the gender balance of speakers
    at a conference. It parses the pentabarf XML of the schedule and feeds
    the speaker names to this module.

    Here's the results for Debconf 22.

    72 speakers

    male 48 66.7%
    unknown 16 22.2%
    female 4 5.6%
    mostly_male 2 2.8%
    andy 1 1.4%
    mostly_female 1 1.4%

    I fear this may be an example of statistics that look meaningful but
    probably aren't because the error bar is much higher than the typical consumer of the statistic intuitively thinks it is. Although maybe that's not a worry in this case since the program itself says that it totally
    failed to make a guess about a quarter of the time.

    So instead of making a knowingly-bad guess it says it doesn't know?
    That's an upside in my book.

    I don't really have any objections to the package being in the archive;
    this is certainly something that a lot of people seem to want to do and
    thus seem to find some utility in doing. But unless one has a
    higher-quality source of data than just names (preferred pronouns, direct self-identification, etc.)

    Real people who want to switch their visible gender (ie, how others view
    them) do pick a name that matches the gender they want to present to the
    world.


    As of actually using first names for statistics:
    Several years ago, I did stats on who does uploads in Debian.
    My methodology was:
    1. limit packages to "key packages" (RT meaning, ie popcon/d-i/{b-,}deps)
    2. take the last changed-by of every package (this avoids maintainers
    who haven't been seen in 20 years, etc)
    3. for every unique name, manually:
    a. do I recognize that person? If so, use gender I know.
    b. is the first name gender-specific? (I know western and slavic names)
    c. ~60 seconds of web search using DDG (I seemed to extend suspected
    females to >15 minutes somehow...)
    d. if none of the above gave an answer, say '?'
    4. weight every name by the # of packages from 2. (ie, give count of
    packages)

    Obviously every step introduces inaccuracies; eg. I used first-[mid...]-last name combinations, merging distinct spellings only when I spotted them by
    hand. I seem to recall there are two DDs with the same name (I don't
    remember who though), they'd be unified by this methodology. Of course there'll be no error if they're of the same gender but that's not the case
    for other uses of the input data.

    Thus, my stats are _not perfect_. But, as long as I divulge my methodology,
    it is sound science.

    A famous example is one of first phone surveys, that worked by randomly selecting phone numbers. The results turned out to be totally wrong -- with individual-owned phones being still a quite new thing, phone owners tended
    to be affluent and tech-friendly people, and their responses were not representative of the population at large.

    Thus, to be valid science, any use of statistics should disclose the methodology used. But, that doesn't make the results any less valid,
    it merely attaches a caveat. Barring some other error (eg. bogus random generator, ignoring people who hang up, etc), that survey still provided accurate info on the population of phone owners.


    Meow!
    --
    ⢀⣴⠾⠻⢶⣦⠀ Ash nazg durbatulûk,
    ⣾⠁⢠⠒⠀⣿⡁ ash nazg gimbatul,
    ⢿⡄⠘⠷⠚⠋⠀ ash nazg thrakatulûk
    ⠈⠳⣄⠀⠀⠀⠀ agh burzum-ishi krimpatul.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Russ Allbery@21:1/5 to Vincent Bernat on Thu Jul 14 19:30:01 2022
    Vincent Bernat <bernat@luffy.cx> writes:
    On 2022-07-14 17:14, Russ Allbery wrote:

    (Also, due to the limitations and history of naming conventions, the
    software is inherently trying to map into a gender binary, which if one
    is attempting to capture self-identification is likely to be unhelpful
    for many populations, such as ones with lots of people under 30, due to
    not having a way to represent nonbinary people.)

    This one does not. It maps a first name to male, female, androgynous,
    mostly make, mostly female, or unknown.

    Oh, is that what "andy" in the output meant? I thought that was some
    other quirk of the software, but in retrospect I should have figured that
    out.

    Thanks for the correction. Androgynous and nonbinary are not really the
    same thing, but at least the software is trying to incorporate that, to
    the extent that people's names reflect their gender at all.

    --
    Russ Allbery (rra@debian.org) <https://www.eyrie.org/~eagle/>

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Wookey@21:1/5 to Steve McIntyre on Fri Jul 15 00:20:01 2022
    On 2022-07-14 13:52 +0100, Steve McIntyre wrote:
    If the only reason for the ITP is to make lintian quiet then I think
    that's a total waste of time - it's following a guideline blindly
    without understanding the reason for it.

    How do others feel?

    I mostly file ITPs because Lintian moans at me if I don't. I agree
    that ITPs just before uploading, mostly to keep lintian quiet, are
    largely makework.

    I do check them for other people working on stuff. I do file them for
    things that I expect to take a long time. Essentially I find ITPs
    useful for non-trivial packages, but not not very useful for trivial ones.

    Wookey
    --
    Principal hats: Debian, Wookware, ARM
    http://wookware.org/

    -----BEGIN PGP SIGNATURE-----

    iQIzBAABCgAdFiEER4nvI8Pe/wVWh5yq+4YyUahvnkcFAmLQlZQACgkQ+4YyUahv nkdCchAArOJeB7RJ2aWalVpEXuEd/WDZv9QChmPbtvZKrR902bNQ5Ui4gw2rA7T/ 5cKW7wIIhRaJ2A/jn4dtWlAz3/kcrZrUBdkxNHcLVQVySYnk1oLqQU6MJdmkkOio zHbyIPOVVfIHDcIefEkbGOwy22ChbPp1QHX2jNzX6P0QUY5shHJmJUYncP+HGi/8 TYCWNPOF/6dDGp1ceoam+zt0EFQx+wYwzLN595IKBeDppRK9dduaMZsUGMC9bFKf 84MUoqsJtQzvrh+JCnGcFRXDRHN1Nxk+SPtqf68hD2m2tqUlKurn0TUDieqrvxdO lg0Kt3QAjRKnF8nsKJaMY9XxrtLilYvDGPWda0sq5lopnZgqZJpeDT0JoGgbtJLL pr1kirMOiwAWekHlT2IzDS+wHoXdWgVM6NW1gmrTPOTN8UOR0BEpfmD+WXK4Bi6D Hk8DkUBb3ptwiaj9E1VY6L97WwWyil6NgQfdkp6Ko2WZoV1UybUjXzunrX4aLqrr ERAivCcWG69w4IASLV3yLhz3Mc4Ays26PJ2E7i+QADI6WGfVi3Ccpt0Ik+zDlohi zdAdK/oj73QIQz80yuMrVMn5dbmI1n8OdNLFi8LpXyTLNv6MGuS2m84vNaWuCkN7 SihagmvrpQ+ZExCDF4guhjpfNCDbUDBxZjKuX3egvU9eBgxLY0A=
    =tD6y
    -----END PGP SIGNATURE-----

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Adrian Bunk@21:1/5 to Jeremy Bicha on Fri Jul 15 18:40:01 2022
    On Thu, Jul 14, 2022 at 04:05:35PM +0200, Jeremy Bicha wrote:
    ...
    Debian has a Diversity Statement [1] which says that Debian welcomes
    people regardless of how they identify themselves. Trans people and non-binary people face a lot of discrimination, harrassment and
    bullying around the world.

    Our Diversity Statement says that Debian "welcomes and encourages
    participation by everyone".

    People who express how they identify themselves by having a swastika
    tattoo on their forehead also face a lot of discrimination, harrassment
    and bullying around the world. Our Diversity Statement makes it clear
    that we are welcoming and encouraging their participation and are not
    ourselves discriminating against them.

    That bad treatment of these people is
    against Debian's core values.
    ...

    Our Diversity Statement says that we "welcome contributions from
    everyone as long as they interact constructively with our community".

    Debian does not have core values regarding how people are treated
    outside Debian.

    Debian is not a project that fights for trans people or fights for denazification or fights for whatever other non-technical topics
    individual contributors might consider worth fighting for elsewhere.

    Diversity means that in any kinds of conflicts people on all sides
    are encouraged to contribute to Debian as long as they interact
    constructively with our community.

    Therefore, the Debian Project wouldn't
    want to distribute software that appears to facilitate that kind of harassment, regardless of the software license it is released under.
    We might not want to distribute such software even if it also has
    non-harmful uses.
    ...

    The exact opposite of diversity is to call everything one dislikes or
    disagrees with "harassment" or *phobic.

    Thank you,
    Jeremy Bicha

    cu
    Adrian

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Andrew M.A. Cater@21:1/5 to Adrian Bunk on Fri Jul 15 21:20:01 2022
    On Fri, Jul 15, 2022 at 07:05:09PM +0300, Adrian Bunk wrote:
    On Thu, Jul 14, 2022 at 04:05:35PM +0200, Jeremy Bicha wrote:
    ...
    Debian has a Diversity Statement [1] which says that Debian welcomes
    people regardless of how they identify themselves. Trans people and non-binary people face a lot of discrimination, harrassment and
    bullying around the world.

    Our Diversity Statement says that Debian "welcomes and encourages participation by everyone".


    Correct. That implies useful, constructive participation in accordance with
    our community values and not being divisive for the sake of it.

    gender-guesser may have a negative effect on some of our community of contributors and users. It probably doesn't help people who are gender-fluid, non-binary or trans who feel as if they are being categorised by their names.

    The categorisation isn't aware of diversity in language - as it stands, it appears biased to Western languages and only a few of them. The world has
    more cultural groups and lignuistic categories to take account of.

    Consider -

    Debconf is on in Kosovo right now. If I had to work out Albanian
    gender mappings from names, I'd have no clue.

    Then S. Indian - Malayalam character sets?? and names from a number of
    Indian languages then Israel and Hebrew/Arabic
    Taiwan had Chinese character sets and names

    At what point is this useful for a very small subset of the world's population?

    People who express how they identify themselves by having a swastika
    tattoo on their forehead also face a lot of discrimination, harrassment
    and bullying around the world. Our Diversity Statement makes it clear
    that we are welcoming and encouraging their participation and are not ourselves discriminating against them.


    A swastika tattoo on the forehead would be mostly invisible unless meeting someone in person. I couldn't guarantee how people would react on first
    meeting such a person - I'm assuming this is a straw man for argument
    purposes.

    That bad treatment of these people is
    against Debian's core values.
    ...

    Our Diversity Statement says that we "welcome contributions from
    everyone as long as they interact constructively with our community".

    Debian does not have core values regarding how people are treated
    outside Debian.


    No, but it has very clear core values regarding how they are treated by
    and within Debian. The adverse effects on some of our people and our
    users probably outweigh the usefulness of such a script, even if it
    is accurate and useful to all.

    Debian is not a project that fights for trans people or fights for denazification or fights for whatever other non-technical topics
    individual contributors might consider worth fighting for elsewhere.


    It does fight for under-represented / disadvantaged groups within Debian in a Debian context.

    Diversity means that in any kinds of conflicts people on all sides
    are encouraged to contribute to Debian as long as they interact constructively with our community.


    I would ask you (Adrian) to consider whether your mailing list
    message is a fully
    constructive interaction with our community and its values here or is just seeking to stir up opinions and arguments for the sake of it.
    The Code of Conduct calls for a constructive and considered approach to debate on our lists.

    Therefore, the Debian Project wouldn't
    want to distribute software that appears to facilitate that kind of harassment, regardless of the software license it is released under.
    We might not want to distribute such software even if it also has non-harmful uses.
    ...

    The exact opposite of diversity is to call everything one dislikes or disagrees with "harassment" or *phobic.


    I wonder how it would be if you wanted to use a similar script to test familiarity with English in our developers / a test for neurodiversity
    and high functioning autism / a test for colour vision or dexterity to
    single out anybody who's visually impaired or blind or a guess for
    background religion/beliefs/no belief - I don't think any of these (hypothetical, straw man) scripts would be useful or constructive or
    contribute well to our Debian community.

    Thank you,
    Jeremy Bicha

    cu
    Adrian


    With every good wish, as ever,

    Andy Cater

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Tzafrir Cohen@21:1/5 to Andrew M.A. Cater on Fri Jul 15 22:10:02 2022
    On Fri, Jul 15, 2022 at 07:10:55PM +0000, Andrew M.A. Cater wrote:

    Debconf is on in Kosovo right now. If I had to work out Albanian
    gender mappings from names, I'd have no clue.

    I decided to take a random name: the President of Kosovo. In Wikipedia I
    see it is Vjosa Osmani, a name completely unfamiliar to me.

    print(d.get_gender(u"Vjosa"))
    female

    This time the guess happened to be correct.


    Then S. Indian - Malayalam character sets?? and names from a number of
    Indian languages then Israel and Hebrew/Arabic
    Taiwan had Chinese character sets and names

    I tried various Hebrew names: names in Hebrew letters are always
    unknown. My name (a rare one) is unknown even in Latin letters. However
    some common Hebrew names in Latin letters are detected correctly.

    I have not tried to do any proper sampling or experiment. Just a random
    data point.

    --
    mail / xmpp / matrix: tzafrir@cohens.org.il

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marvin Renich@21:1/5 to All on Sat Jul 16 01:10:01 2022
    * Jeremy Bicha <jeremy.bicha@canonical.com> [220714 10:06]:
    On Thu, Jul 14, 2022 at 2:41 PM Roberto C. Snchez <roberto@debian.org> wrote:

    On Thu, Jul 14, 2022 at 11:14:43AM +0100, Steve McIntyre wrote:
    edward@4angle.com wrote:

    Package: wnpp
    Severity: wishlist
    Owner: Edward Betts <edward@4angle.com>
    X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org

    * Package name : gender-guesser
    Version : 0.4.0
    Upstream Author : Israel Saeta Prez <israel@lead-ratings.com>
    * URL : https://github.com/lead-ratings/gender-guesser
    * License : GPL-3 & GFDL-1.2+
    Programming Lang: Python
    Description : Guess the gender from first name

    Oh, not *another* package that tries to guess things from names.

    Do you have a real use for this package?

    Why in the world is that even a relevant question? There are plenty of packages in the archive which are useful to essentially nobody apart
    from the maintainer and there are even packages which are maintained without being useful to the maintainer at all (but rather useful to others).

    There are a *lot* of issues
    in this area, and mis-gendering people is not something to risk lightly...


    "There are a *lot* of issues in this area" seems rather nebulous. In
    which area? Given the fact that we have clear and rather unambiguous guidelines for what constitutes software which is appropriate for
    inclusion in the archive, and given that on its face this software does
    not seem to be in conflict with any of those guidelines, what then is
    the problem? BTW, I'm not interested in any sort of "well I don't like ..." or "such and such could offend so and so ..." sort of arguments.

    Debian has a Diversity Statement [1] which says that Debian welcomes
    people regardless of how they identify themselves. Trans people and non-binary people face a lot of discrimination, harrassment and
    bullying around the world. That bad treatment of these people is
    against Debian's core values. Therefore, the Debian Project wouldn't
    want to distribute software that appears to facilitate that kind of harassment, regardless of the software license it is released under.
    We might not want to distribute such software even if it also has
    non-harmful uses. We don't have to distribute *everything* ourselves.

    People within the Debian community have a right to expect that others in
    the community will not bully, harass, or denigrate them. They do _not_
    have any right to expect that others will not offend them by discussing
    or making contributions that espouse values that are different and
    incompatible with their own. Such an expectation assumes that one set
    of values is correct and the other is wrong. In order for such an
    expectation to be met, only one of the two sets of values could exist
    within Debian.

    Saying that gender-guesser should not be packaged within Debian (using
    the excuse given early in this thread) is excluding a contribution based
    on the values to which that package adheres and possibly the contributor
    and the users who would like to use it. This is contrary to being
    inclusive.

    Being offended by someone else's civil expression of their values
    (including the packaging of a particular piece of software) is not the
    same as being bullied or denigrated. Please stop trying to use the
    excuse "it might offend someone" to block participation or inclusion of software. Instead, be inclusive and acknowledge that others' values may
    be different from and incompatible with yours, and accept that Debian is
    a collection of software from diverse sources and some of it may not
    adhere to your values.

    This is the difference between true inclusiveness and the false
    "political correctness" that seems to be permeating our society today.

    When we can all say, "I disagree with your values, but I accept you as a
    Debian contributor," then we will be truly inclusive.

    ...Marvin

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Ansgar@21:1/5 to Steve McIntyre on Sat Jul 16 10:10:01 2022
    On Sat, 2022-07-16 at 08:51 +0100, Steve McIntyre wrote:
    Are you actually somehow claiming that Debian's core values include
    bad treatment of Jews and those other groups? Seriously, WTF?

    We had project members supporting antisemitic groups on project lists
    and DebConf events, for example in discussions about a DebConf taking
    place in a certain location. Not much happened as a result.

    (FWIW, it was said project members even went so far to try to get
    support for having sponsors not sponsor that DebConf, i.e., directly
    working against the project. Also seems to be fine.)

    Ansgar

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve McIntyre@21:1/5 to Adam Borowski on Sat Jul 16 10:00:01 2022
    Adam Borowski wrote:
    On Thu, Jul 14, 2022 at 04:05:35PM +0200, Jeremy Bicha wrote:
    * Package name : gender-guesser

    Debian has a Diversity Statement [1] which says that Debian welcomes
    people regardless of how they identify themselves. Trans people and
    non-binary people face a lot of discrimination, harrassment and
    bullying around the world. That bad treatment of these people is
    against Debian's core values.

    Unless they're Jewish, believe that a woman should be allowed to abort a
    Down syndrome fetus, believe that there's more than just a name to the >gender, or have a kind of transsexualism that matches their life's >experiences and is detectable by brain imaging but the loud group says >doesn't exist.

    The inconsistency here is astounding.

    I genuinely have no clue what you're trying to say here.

    Are you actually somehow claiming that Debian's core values include
    bad treatment of Jews and those other groups? Seriously, WTF?

    --
    Steve McIntyre, Cambridge, UK. steve@einval.com "We're the technical experts. We were hired so that management could
    ignore our recommendations and tell us how to do our jobs." -- Mike Andrews

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Holger Levsen@21:1/5 to Stefano Rivera on Sat Jul 16 11:50:01 2022
    On Sat, Jul 16, 2022 at 09:12:16AM +0000, Stefano Rivera wrote:
    If you're asking about DebConf 22, we have that information:
    [...]
    I guess we should expose this in our conference statistics. We care
    about it.

    but why? how is gender relevant for participating in DebConf as
    a whole? (i can see how it could be relevant for some events, but
    not for the whole conference.)

    society should be *less* about gender, sex, race, etc, not more.

    that's at least for me the reason why I usually select "decline
    to state". my gender is none of your business for running DebConf.


    --
    cheers,
    Holger

    ⢀⣴⠾⠻⢶⣦⠀
    ⣾⠁⢠⠒⠀⣿⡁ holger@(debian|reproducible-builds|layer-acht).org
    ⢿⡄⠘⠷⠚⠋⠀ OpenPGP: B8BF54137B09D35CF026FE9D 091AB856069AAA1C
    ⠈⠳⣄

    There are no jobs on a dead planet. (Also many other things but people mostly seem to care about jobs.)

    -----BEGIN PGP SIGNATURE-----

    iQIzBAABCgAdFiEEuL9UE3sJ01zwJv6dCRq4VgaaqhwFAmLSiBsACgkQCRq4Vgaa qhxcBg//WkiT1pwRfqCgTs7DerpFibS4YCG6WK3rR1iJyKY6k26QlGdtyVevsFun nrg/RyqMXz/hrUKcb6LinGUfZy0EbrL6XAgSvWQw0L19TUqrSXvi+EbXoRc2eiUE Xst0BL2EUmY+DBfuYMFozfDSf1Px/Vnz94Od1Fu6uy5BDhN8rmbtSHP5NWwXUQjO jsge3INQk8D7B0eX95Wntj9+Ys+KyEOwONlmy9L9a0f1Vzu8dFlV6w2qLd0LY63G AzvwSaRZ1DYR8C7ba2k5oAvEdIqO0Wk8xbJq0i7OsYbOMZOY2d4C4Uwt+Ho4+Tkc IzJw9cDKm0yqix7pKbDTAtFRtMcGh9w+D2PhUV9oNCL1iPH7ivhQyGPma8gtjoe5 1vpHhYuYq7IpcS59ppvR3i3GsM1AxmrNYbeRx0TtJiRwJVTYbHVMIGUxjdKk8UUT NPzRNJrWK55grRKtBGTDEq18KFy7uZPDiuYNPg1GFUl9INL7dhBfe/6Ecr+VZDRo sUQUpxEYkbOtRvcb4ylV8Y5VOEKoCNxk/YvSnRxTH4acL1aujxeaRFKvgwLaLlHN
    TAZK7T/j
  • From Christoph Biedl@21:1/5 to All on Sat Jul 16 11:50:01 2022
    Steve McIntyre wrote...

    IMHO there are 2 points to an ITP:

    * to save effort in case two people might be working on the same
    package

    And having such a lock is a good thing.

    * to invite discussion on debian-devel / elsewhere

    Which might include reactions like:

    * There is already a package that does the same thing. Do we really need
    a duplicate?
    * It's already packaged, possibly under an obscure name or within
    another package.
    * There are issues with the license, the package description and
    similar things.

    If people post an ITP and upload iummediately, then I don't think that
    helps on either count.

    Indeed. And if it's about "Getting through NEW takes so much time", then
    giving it a few days more instead of increasing the risk of a REJECT is
    even the better way around.

    How do others feel?

    To me, uploading immediately after ITP signalizes "I am 100% sure this
    package and my packages will certainly not create objections of any
    kind". Which I find somewhere between highly optimistic and plain
    arrogant.

    So, in my opintion, as a rule of thumb, have a time of three days
    between these two actions.

    Christoph

    -----BEGIN PGP SIGNATURE-----

    iQIzBAABCgAdFiEEWXMI+726A12MfJXdxCxY61kUkv0FAmLSh9cACgkQxCxY61kU kv0J5w/+Js510OpBG3UPxym9ztsWehnxmu2rpWg/H6NvvoCt5t5hVM4T+P3wyD9V /1alUOrQEsMuFGgPr2xMddFjsOQFwb1qEqMNvivjq+mJfblgbebm4maicgvwQQuL R5jiOKI3pZ0t1X7n19W2RK3tHVcKlHr4UTsrDqSASqn1xeu+5nevRcy43edTBGDO /InGnbSkLnaQuPc1YrEF7nbpTOePrbvzJ6weE6NE8ZVhraSt+ZSdZtIZdwablZRG tgqwZ7DMDrBvysM7NYCDS061vKJY/tLph6VS0avsA+cUrIw3hujE4DX6yV71+CvA y815oKRVDHNsASckHztvL7MSMF3JTvkO0djZczSDpP92hr2hRDqqi3T4KCLmGGNI JoyWBmqCLYW7nLqHixgyTE031WBD1tMQmN16OjZxsuFyyNwjh1xfrf8ToJHU3kgy oGKEtxUbCcMMfZ+kKWMDAmNuWKnTEufluChTCtR5TmcWMXx10AU7VB2SMBtmdFcl 7OpIWtYkWleBZC0OoxGh0AYTIrxaPeQZNMjHhlkzEK00vSSpGWxImfWiXcrKlCBU 5mE1fHgEZoQKMwXmpAeqzYn34R5djj+eqsJHizlF2dIg4MoQk3VMelKd1de/noWE 9Su1oQuHmZv8eVqKebCoFowWPhYgQLcwSAYnhEHsHoqmKGFxZpA=
    =6zXn
    -----END PGP SIGNATURE-----

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve McIntyre@21:1/5 to Adrian Bunk on Sat Jul 16 12:00:02 2022
    Adrian Bunk wrote:
    On Thu, Jul 14, 2022 at 04:05:35PM +0200, Jeremy Bicha wrote:
    ...
    Debian has a Diversity Statement [1] which says that Debian welcomes
    people regardless of how they identify themselves. Trans people and
    non-binary people face a lot of discrimination, harrassment and
    bullying around the world.

    Our Diversity Statement says that Debian "welcomes and encourages >participation by everyone".

    People who express how they identify themselves by having a swastika
    tattoo on their forehead also face a lot of discrimination, harrassment
    and bullying around the world. Our Diversity Statement makes it clear
    that we are welcoming and encouraging their participation and are not >ourselves discriminating against them.

    And, to add the bit that a lot of people conveniently ignore when
    trying to make strawman arguments:

    "We welcome contributions from everyone as long as they interact
    constructively with our community."

    --
    Steve McIntyre, Cambridge, UK. steve@einval.com "We're the technical experts. We were hired so that management could
    ignore our recommendations and tell us how to do our jobs." -- Mike Andrews

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Stefano Rivera@21:1/5 to All on Sat Jul 16 12:10:01 2022
    Hi Holger (2022.07.16_09:42:56_+0000)
    but why? how is gender relevant for participating in DebConf as
    a whole? (i can see how it could be relevant for some events, but
    not for the whole conference.)

    It's not. It's for statistics, as we say when we collect it.

    I think it's our business, as a community, and as conference organisers,
    to try to increase the diversity at our events. To me, that means
    increasing speaker diversity, primarily. Attendee diversity won't change
    unless the speaker diversity changes.

    SR

    --
    Stefano Rivera
    http://tumbleweed.org.za/
    +1 415 683 3272

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve McIntyre@21:1/5 to All on Sat Jul 16 12:30:01 2022
    Roberto C. Snchez wrote:
    On Thu, Jul 14, 2022 at 11:14:43AM +0100, Steve McIntyre wrote:

    Do you have a real use for this package?

    Why in the world is that even a relevant question? There are plenty of >packages in the archive which are useful to essentially nobody apart
    from the maintainer and there are even packages which are maintained
    without being useful to the maintainer at all (but rather useful to
    others).

    I think it's a valid question to ask.

    There are a *lot* of issues
    in this area, and mis-gendering people is not something to risk
    lightly...

    "There are a *lot* of issues in this area" seems rather nebulous. In
    which area? Given the fact that we have clear and rather unambiguous >guidelines for what constitutes software which is appropriate for
    inclusion in the archive, and given that on its face this software does
    not seem to be in conflict with any of those guidelines, what then is
    the problem?

    I'll link to

    https://www.kalzumeus.com/2010/06/17/falsehoods-programmers-believe-about-names/

    again, as a start. Assuming *anything* about names is iffy at best,
    and trying to derive other information (whether that's gender, age, nationality, whatever) from names is unreliable as all hell. When
    there is the potential for *also* causing offense from that unreliable information then I'd hope that people would know better.

    BTW, I'm not interested in any sort of "well I don't like ..." or
    "such and such could offend so and so ..." sort of arguments.

    Thanks, I'm well aware that you don't care. Maybe you could try?

    --
    Steve McIntyre, Cambridge, UK. steve@einval.com "We're the technical experts. We were hired so that management could
    ignore our recommendations and tell us how to do our jobs." -- Mike Andrews

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Devops PK Carlisle LLC@21:1/5 to Marvin Renich on Sat Jul 16 12:30:01 2022
    First, by all means, if you want to develop, package and maintain this
    app in keeping with the quality standards a Linux distro like Debian
    demands, go to it.

    Having said that, it sounds like one of the goofiest things I have heard
    of recently, and I can not see ever having a use for it. But that does
    not remotely mean that it should be disallowed. I have done my share of
    coding little functions which I adore, but which most people have not
    heard of, and would have little interest in. It doesn't ruin my day.
    I'll share, but I am Customer Number One.

    With gender-guesser, if I ever saw it on a list of available modules,
    I'd say "Huh, sounds like trouble waiting to happen. No, thanks." and
    that would be all. It may be well-intentioned, but most people aren't
    triggered if a computer gets their gender wrong, and those that are
    triggered by that sort of thing tend to go full Karen if the guess is
    wrong. Therefore you may be setting yourself up for primarily intense
    and negative feedback.

    On 7/15/22 19:01, Marvin Renich wrote:
    * Jeremy Bicha <jeremy.bicha@canonical.com> [220714 10:06]:
    On Thu, Jul 14, 2022 at 2:41 PM Roberto C. Sánchez <roberto@debian.org> wrote:

    On Thu, Jul 14, 2022 at 11:14:43AM +0100, Steve McIntyre wrote:
    edward@4angle.com wrote:

    Package: wnpp
    Severity: wishlist
    Owner: Edward Betts <edward@4angle.com>
    X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org

    * Package name : gender-guesser
    Version : 0.4.0
    Upstream Author : Israel Saeta Pérez <israel@lead-ratings.com>
    * URL : https://github.com/lead-ratings/gender-guesser
    * License : GPL-3 & GFDL-1.2+
    Programming Lang: Python
    Description : Guess the gender from first name

    Oh, not *another* package that tries to guess things from names.

    Do you have a real use for this package?

    Why in the world is that even a relevant question? There are plenty of
    packages in the archive which are useful to essentially nobody apart
    from the maintainer and there are even packages which are maintained
    without being useful to the maintainer at all (but rather useful to
    others).

    There are a *lot* of issues
    in this area, and mis-gendering people is not something to risk
    lightly...


    "There are a *lot* of issues in this area" seems rather nebulous. In
    which area? Given the fact that we have clear and rather unambiguous
    guidelines for what constitutes software which is appropriate for
    inclusion in the archive, and given that on its face this software does
    not seem to be in conflict with any of those guidelines, what then is
    the problem? BTW, I'm not interested in any sort of "well I don't like
    ..." or "such and such could offend so and so ..." sort of arguments.

    Debian has a Diversity Statement [1] which says that Debian welcomes
    people regardless of how they identify themselves. Trans people and
    non-binary people face a lot of discrimination, harrassment and
    bullying around the world. That bad treatment of these people is
    against Debian's core values. Therefore, the Debian Project wouldn't
    want to distribute software that appears to facilitate that kind of
    harassment, regardless of the software license it is released under.
    We might not want to distribute such software even if it also has
    non-harmful uses. We don't have to distribute *everything* ourselves.

    People within the Debian community have a right to expect that others in
    the community will not bully, harass, or denigrate them. They do _not_
    have any right to expect that others will not offend them by discussing
    or making contributions that espouse values that are different and incompatible with their own. Such an expectation assumes that one set
    of values is correct and the other is wrong. In order for such an expectation to be met, only one of the two sets of values could exist
    within Debian.

    Saying that gender-guesser should not be packaged within Debian (using
    the excuse given early in this thread) is excluding a contribution based
    on the values to which that package adheres and possibly the contributor
    and the users who would like to use it. This is contrary to being
    inclusive.

    Being offended by someone else's civil expression of their values
    (including the packaging of a particular piece of software) is not the
    same as being bullied or denigrated. Please stop trying to use the
    excuse "it might offend someone" to block participation or inclusion of software. Instead, be inclusive and acknowledge that others' values may
    be different from and incompatible with yours, and accept that Debian is
    a collection of software from diverse sources and some of it may not
    adhere to your values.

    This is the difference between true inclusiveness and the false
    "political correctness" that seems to be permeating our society today.

    When we can all say, "I disagree with your values, but I accept you as a Debian contributor," then we will be truly inclusive.

    ...Marvin


    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Holger Levsen@21:1/5 to Stefano Rivera on Sat Jul 16 18:20:02 2022
    On Sat, Jul 16, 2022 at 10:05:59AM +0000, Stefano Rivera wrote:
    I think it's our business, as a community, and as conference organisers,
    to try to increase the diversity at our events. To me, that means
    increasing speaker diversity, primarily. Attendee diversity won't change unless the speaker diversity changes.

    I agree, I just don't see how collecting statistics can be useful here. OTOH
    I know about several cases where harmless and unneeded data collection has become harmful later.


    --
    cheers,
    Holger

    ⢀⣴⠾⠻⢶⣦⠀
    ⣾⠁⢠⠒⠀⣿⡁ holger@(debian|reproducible-builds|layer-acht).org
    ⢿⡄⠘⠷⠚⠋⠀ OpenPGP: B8BF54137B09D35CF026FE9D 091AB856069AAA1C
    ⠈⠳⣄

    Imagine god created trillions of galaxies but freaks out because some dude kisses another.

    -----BEGIN PGP SIGNATURE-----

    iQIzBAABCgAdFiEEuL9UE3sJ01zwJv6dCRq4VgaaqhwFAmLS5Q0ACgkQCRq4Vgaa qhzi9g/+PPYj15gbIumasLvGjUhmA2/5erFjPlQi73iWaz8rspocxR8nSdtAvSwF jeFIlIWpGIJnxEPTMhKqdGPa4lSE3RdlRoCYk53NrX/KfvqQKfmr2PW9GKO1rPhr 5dnK6u9Q3Lkpnhjr8p22e9NsxIx2MAwSNxoDqK9KgMq7/5ELn/dZd+lp0UUKjsyk YYd3v4b1Qu6NmfsrHehtihgTOl5nuJl/MBnmMqsCgviqjqhayHzEKEdZo8Y0PH9m ghytz5qoK6bug7BnhWK9pw8MnewJz1WbszVTsVGmaqkACVUUW0Nlthgoep451kzM ygvl5oAqnrCv+aZXROKkPisKBbzxC+4KnfUnaINE6oMGbFeR0e7/7cQcYL+NH6Ps FrTnd2AfenX2zGeQLzb+/gOPUrs6NKX6VF03jCriXL0L0s2cBNf528vZ2izQK+B/ yQIFzlmzbT6mnUpG8Bhkdo/fH4hLzQT1o9LLgqlLRkGg3n42dWgJEqZCxUbsxAhZ Y39DhIB6bEsP8+Y22c0DDXfE1tdgefpcy8cQDT70a4zMDZcEocAnBrp+gpsHX21x 4x0B1g/7LtijNf1ClWevi
  • From Adrian Bunk@21:1/5 to Andrew M.A. Cater on Sat Jul 16 20:10:01 2022
    On Fri, Jul 15, 2022 at 07:10:55PM +0000, Andrew M.A. Cater wrote:
    On Fri, Jul 15, 2022 at 07:05:09PM +0300, Adrian Bunk wrote:
    ...
    Debian is not a project that fights for trans people or fights for denazification or fights for whatever other non-technical topics
    individual contributors might consider worth fighting for elsewhere.

    It does fight for under-represented / disadvantaged groups within Debian in a Debian context.

    What data do you have to prove or disprove whether a group is actually under-represented or disadvantaged within Debian?

    What tools did you use to generate this data?

    The irony is that your "fight" requires exactly the tools you want to
    condemn, and data Debian should better not collect at all.

    ...
    The exact opposite of diversity is to call everything one dislikes or disagrees with "harassment" or *phobic.

    I wonder how it would be if you wanted to use a similar script to test familiarity with English in our developers / a test for neurodiversity
    and high functioning autism / a test for colour vision or dexterity to
    single out anybody who's visually impaired or blind or a guess for
    background religion/beliefs/no belief - I don't think any of these (hypothetical, straw man) scripts would be useful or constructive or contribute well to our Debian community.
    ...

    Most software can be used for many purposes good or bad, looking at
    the vast amount of packages maintained by the Debian Med team I am
    quite astonished that you consider it not constructive contributions to
    Debian when people are packaging software that can be used to diagnose diseases.

    I would rather wonder for how many of your "hypothetical" examples we
    already ship software.

    I wouldn't be surprised if we already ship software that can tell the familiarity with English of a person based on a few emails.

    Steve highlighted the problems of trying to guess gender based on names, determining the biological gender based on voice can be far more
    reliable than using the name. Debian does publish videos with audio that
    can be used for the mentioned usecase of determining the gender of
    Debconf speakers. I would expect that speech recognition tools for deaf
    people either already or in the future will be able to output gender and
    accent of the speaker in an audio recording.

    In the Debian Med or Deep Learning teams we might some day have software
    that can test for high functioning autism of the speakers in the videos
    of Debconf talks.

    Trying to restrict tools is not a new idea.
    An EU directive from 2013 make it mandatory that production or
    distibution of tools primarily for the purpose of committing hacking
    offences must have a maximum sentence of at least 2 years in prison
    in all EU countries.
    Debian ships many such tools, in practice prosecution faces the
    technical reality that the same tools are used for testing the
    security of systems against attacks.

    Prosecuting people caught using these tools for offences works.

    What can realistically work for your examples is not restricting tools,
    but restricting what can be done with data.

    One thing we can and should do to protect members of our Debian
    community is a robust legal response of prosecutions under civil and
    criminal law if people are guilty of privacy abuse through policies or practices when handling personal data that are not compliant with
    applicable legislations like the GDPR.

    Even in cases where such prosecution is not happening, it should be
    clear that privacy abusers are not welcome in our Debian community.

    What is the defined maximum retention time for sensitive personal data
    like sexual orientation, race, ethnicity, religion or political believes
    in the Debian Community Team?
    If there is none or if it is too long, how to fix this swiftly?
    If it is not fixed swiftly, how should Debian act against the abusers?

    ...
    Andy Cater

    cu
    Adrian

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Didier 'OdyX' Raboud@21:1/5 to All on Sat Jul 16 22:54:10 2022
    Le samedi, 16 juillet 2022, 19.36:17 h CEST Adrian Bunk a écrit :
    What tools did you use to generate this data?

    The irony is that your "fight" requires exactly the tools you want to condemn, and data Debian should better not collect at all.

    It does not. The whole argument is "gender-guessing if prone to errors, if you want to know what gender a person identify themselves, ask them".

    -----BEGIN PGP SIGNATURE-----

    iQIzBAABCgAdFiEEJ3k7rA0YCplkx4gZqcb6xg1jAWkFAmLTJXIACgkQqcb6xg1j AWnGNxAAiWG2r6KYyDJAKovuigCy8T+AYYePn97pVAas77OSHHtm+1io6bA9051L 2Po+1loPiTZ9ZBFtTcpJkmSRo07Z2GwmSK0QjV0ScpNAMFt0xOQrB+cyIEViAf5v UBypSUbW4gbSs/l+LWT3sDUHAt79GOoMef3tNJqa05ijheYbosuTwY2LIjdHlzRn e4AIXpYK/SA8kLjDXr7L2OMF5mgDCMCXvd4pTjvyaZhiQgP3UqEfleunNZ9mPFV7 XQ7DgyKMxywykqdBeNRHlpeCN1XXJP3Qk3SHs/FsXtCO0A0iS07FesAqxKD/NbK9 nE+iTu8hjB7rXTkM/RvsLs2Z3+M96jCsitc5LmOJ6aU6Kvc4gBFA4187oR/prYgM AvE9zghhFfB8Z07762n3kAkhUSKLxBnESu4ZzlcnGjWc7bMQkJz3DcLcok7IQYAf S/6q6D4x1h05BG8/arGfynvR28TAT2xdr0nH9lQTOtS08Eho6lTQRSdX2LrGrECX HmOqCwMjw3Q4TR7fEo3FpkYW6Fk5w+dDAMPFEaSoXuhJBOOVg9deOAYc5bkILDkf NtEqL1bYX03GpZ3gczPTJ+cGNgLqT9Fsb0xu2vqQM16bNGXfSNeBJtp8LVOa6r+0 VmY7cCQBLaFFKs49KSrXfFLs2bTI4IFaDA5uBLSOg5e7+hUqlDM=
    =0njQ
    -----END PGP SIGNATURE-----

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Andrew M.A. Cater@21:1/5 to Didier 'OdyX' Raboud on Sat Jul 16 23:30:01 2022
    On Sat, Jul 16, 2022 at 10:54:10PM +0200, Didier 'OdyX' Raboud wrote:
    Le samedi, 16 juillet 2022, 19.36:17 h CEST Adrian Bunk a crit :
    What tools did you use to generate this data?

    The irony is that your "fight" requires exactly the tools you want to condemn, and data Debian should better not collect at all.

    It does not. The whole argument is "gender-guessing if prone to errors, if you
    want to know what gender a person identify themselves, ask them".

    Further to this, Edward has already suggested that he would be prepared
    to withdraw the ITP.

    Can we bring this thread to a close? It seems sensible to ask people
    how they wish to be identified - in name, pronouns, gender - not to
    stress ourselves if someone replies "Would prefer not to say", and to leave
    any automated attempt to guess to some level of obscurity and academic interest. That approach would resolve many of the difficulties already outlined.

    Steve McIntyre has already raised the "Things people believe about names"
    post which has other companions. Could I respectfully ask that the ITP
    be withdrawn at this point?

    With every good wish, as ever,

    Andrew Cater

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Andrej Shadura@21:1/5 to Andrew M.A. Cater on Tue Jul 19 13:50:01 2022
    Hi,

    On Fri, 15 Jul 2022, at 21:10, Andrew M.A. Cater wrote:
    Consider -

    Debconf is on in Kosovo right now. If I had to work out Albanian
    gender mappings from names, I'd have no clue.

    Then S. Indian - Malayalam character sets?? and names from a number of
    Indian languages then Israel and Hebrew/Arabic
    Taiwan had Chinese character sets and names

    No need to go that far.
    Andrea in Germany is traditionally a woman’s name, Andrea in Italy is a masculine name. How can we tell if a certain specific Andrea is named according to the German (Czech, Slovak etc) tradition (and hence likely a woman) or the Italian (and hence
    probably a man)? There’s no way to generally tell this based on the name alone.

    Take Misha/Miša/Миша or Petya/Peťa/Петя. In Russian tradition, these are very likely masculine names, from Mikhail and Petr. In Czech and Slovak, they’re likely feminine names (Michaela and Petra), but not e.g. if the person is an immigrant,
    which is impossible to tell based on the name alone.

    Žeňa/Zhenya/Женя can be either (Eugene or Eugenia). Impossible to tell apart without further clues. (And there are so many "unisex" names!)

    And we haven’t yet touched the topic of people who were given non-traditional names.

    --
    Cheers,
    Andrej

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Adam Borowski@21:1/5 to Andrej Shadura on Tue Jul 19 17:00:01 2022
    On Tue, Jul 19, 2022 at 01:48:17PM +0200, Andrej Shadura wrote:
    No need to go that far.

    Andrea in Germany is traditionally a woman’s name, Andrea in Italy is a masculine name. How can we tell if a certain specific Andrea is named according to the German (Czech, Slovak etc) tradition (and hence likely a woman) or the Italian (and hence probably a man)? There’s no way to generally tell this based on the name alone.

    Take Misha/Miša/Миша or Petya/Peťa/Петя. In Russian tradition, these are
    very likely masculine names, from Mikhail and Petr.

    If only this piece of software had a distinction between "almost always
    male", "leaning male", "neutral", "leaning female", "almost always
    female"... Oh wait, it does!
    Precisely for the reason you mention.

    And we haven’t yet touched the topic of people who were given non-traditional names.

    In which case it says "unknown".


    Meow!
    --
    ⢀⣴⠾⠻⢶⣦⠀ We domesticated dogs 36000 years ago; together we chased ⣾⠁⢠⠒⠀⣿⡁ animals, hung out and licked or scratched our private parts.
    ⢿⡄⠘⠷⠚⠋⠀ Cats domesticated us 9500 years ago, and immediately we got
    ⠈⠳⣄⠀⠀⠀⠀ agriculture, towns then cities. -- whitroth on /.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Andrej Shadura@21:1/5 to Adam Borowski on Tue Jul 19 18:10:01 2022
    Hi,

    On Tue, 19 Jul 2022, at 16:57, Adam Borowski wrote:
    On Tue, Jul 19, 2022 at 01:48:17PM +0200, Andrej Shadura wrote:
    Take Misha/Miša/Миша or Petya/Peťa/Петя. In Russian tradition, these are
    very likely masculine names, from Mikhail and Petr.

    If only this piece of software had a distinction between "almost always male", "leaning male", "neutral", "leaning female", "almost always
    female"... Oh wait, it does!
    Precisely for the reason you mention.

    No, it does not and cannot, since some names are almost always male in one culture but almost always female in another one.

    And we haven’t yet touched the topic of people who were given non-traditional names.

    In which case it says "unknown".

    No, it cannot know about cases when a person is given a name traditionally given to another gender in another culture. Pretty common in the US, for example. Sure, there probably aren’t many cases of women named Michael, but there are many other names
    where you wouldn’t be easily able to tell.

    https://en.wikipedia.org/wiki/Category:English_unisex_given_names

    --
    Cheers,
    Andrej

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Adam Borowski@21:1/5 to Andrej Shadura on Tue Jul 19 18:40:01 2022
    On Tue, Jul 19, 2022 at 06:08:16PM +0200, Andrej Shadura wrote:
    On Tue, 19 Jul 2022, at 16:57, Adam Borowski wrote:
    On Tue, Jul 19, 2022 at 01:48:17PM +0200, Andrej Shadura wrote:
    Take Misha/Miša/Миша or Petya/Peťa/Петя. In Russian tradition, these are
    very likely masculine names, from Mikhail and Petr.

    If only this piece of software had a distinction between "almost always male", "leaning male", "neutral", "leaning female", "almost always female"... Oh wait, it does!
    Precisely for the reason you mention.

    No, it does not and cannot, since some names are almost always male in one culture but almost always female in another one.

    In other words, if you don't pick a culture, the _global_ dataset (ie, the default one) must not assume either. It takes a lot of work to prepare
    such a database, that's why this package is good to have.


    Meow!
    --
    ⢀⣴⠾⠻⢶⣦⠀
    ⣾⠁⢠⠒⠀⣿⡁
    ⢿⡄⠘⠷⠚⠋⠀ At least spammers get it right: "Hello beautiful!". ⠈⠳⣄⠀⠀⠀⠀

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Philip Hands@21:1/5 to Adam Borowski on Tue Jul 19 23:00:01 2022
    Adam Borowski <kilobyte@angband.pl> writes:

    In other words, if you don't pick a culture, the _global_ dataset (ie, the default one) must not assume either. It takes a lot of work to prepare
    such a database, that's why this package is good to have.

    What exactly is it supposed to be good for?

    From what I can tell, at best it produces harmless nonsense, and at
    worst it will cause people to be misidentified in ways that will vary
    between mildly humourous to significantly hurtful.

    It seems to me about as useful as a hammer with a loose head, which would
    allow you to drive nails just well enough that you'll keep on using it
    until it breaks your toe.

    Cheers, Phil.
    --
    |)| Philip Hands [+44 (0)20 8530 9560] HANDS.COM Ltd.
    |-| http://www.hands.com/ http://ftp.uk.debian.org/
    |(| Hugo-Klemm-Strasse 34, 21075 Hamburg, GERMANY

    --=-=-Content-Type: application/pgp-signature; name="signature.asc"

    -----BEGIN PGP SIGNATURE-----

    iQIzBAEBCgAdFiEE3/FBWs4yJ/zyBwfW0EujoAEl1cAFAmLXGVgACgkQ0EujoAEl 1cCdkw//bSAUEGtcEd58AuvRxhsCSzD+bqGAoL53r52Bg57yWeJGO6tbkbp0LabP H86TJcaTjGLZsJ4byrEyEcmNpnJuhvYkcg4PCPbQfamKWnSvwMXHSzClBxj3UkTr j0W02ybVIOwrE6Z5afq2fYvxrTnUgkbXiXjC4Ip8fXZkAr6zt818yuTSs5bZJ5sU /d3kys4J6LBO6F+1ye3VeVf1IDUc75gRMSVH3cQi/4jVvUPhJXKr1azPWdIL2PrF wPP2JLS5HKCrrp02ooab9RUiIGgwc0M3CboFSlOkagAmbC/bxUX5pC7cru67r30Z Mo+GLPyKBLIA6nhAv6vVDnDEzw7fxrErkDdoYUFvfVnRuRfaKaRTViifKJtHlUsA LYb9X8NEQR1zYJOsNQzu4TDYrWxjDyYKoukSE22izQmZcI8c0n1wu5Y8pORM+g9t S+2byk3VDMgr9FMOUCZ1Ijjgl4Z+xLrELw+Y/gO0QD3KTPTmU6+kMU8mQkx9019x TNMjbtekrTVIK7PM/3xeLw78ohryfe1Zq/tqD0WlnCeI7aDhL0YG6xvZ8MjvUha/ 0fAq64EegK2egb+YmIrYTkDW6Jr2+XXgVOkePrdw6TYrRMaOLQUfVOkjci3yaSXI rNX0AjoWFVxnS9H
  • From =?utf-8?Q?"Yao_Wei_=28=E9=AD=8F=E9=@21:1/5 to All on Wed Jul 20 03:00:02 2022
    --Apple-Mail-867E8EC9-EB84-4F59-8D92-0F70EFE04E1B
    Content-Type: text/plain;
    charset=utf-8
    Content-Transfer-Encoding: quoted-printable


    * Package name : gender-guesser
    Version : 0.4.0
    Upstream Author : Israel Saeta Pérez <israel@lead-ratings.com>
    * URL : https://github.com/lead-ratings/gender-guesser
    * License : GPL-3 & GFDL-1.2+
    Programming Lang: Python
    Description : Guess the gender from first name

    Hi,

    I'd like to ask a practical question, do we have anything either in WNPP or in the archive that depends or uses this package?

    Although I guess this library might violate DFSG 5 by itself, I would like to see where it's actually used and why we need the library.

    Yao Wei

    (This email is sent from a phone; sorry for HTML email if it happens.) --Apple-Mail-867E8EC9-EB84-4F59-8D92-0F70EFE04E1B
    Content-Type: text/html;
    charset=utf-8
    Content-Transfer-Encoding: quoted-printable

    <html><head><meta http-equiv="content-type" content="text/html; charset=utf-8"></head><body dir="auto"><div><br></div><blockquote type="cite"><div dir="ltr"><blockquote type="cite"><span></span></blockquote><blockquote type="cite"><span>* Package name &
    nbsp;&nbsp;&nbsp;: gender-guesser</span><br></blockquote><blockquote type="cite"><span> Version &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;: 0.4.0</span><br></blockquote><blockquote type="cite"><span> Upstream Author : Israel Saeta Pérez &lt;israel@
    lead-ratings.com&gt;</span><br></blockquote><blockquote type="cite"><span>* URL &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;: https://github.com/lead-ratings/gender-guesser</span><br></blockquote><blockquote type="cite"><span>*
    License &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;: GPL-3 &amp; GFDL-1.2+</span><br></blockquote><blockquote type="cite"><span> Programming Lang: Python</span><br></blockquote><blockquote type="cite"><span> Description &nbsp;&nbsp;&nbsp;&nbsp;:
    Guess the gender from first name</span></blockquote></div></blockquote><div><br></div><div style="caret-color: rgb(0, 0, 0); color: rgb(0, 0, 0);">Hi,</div><div style="caret-color: rgb(0, 0, 0); color: rgb(0, 0, 0);"><br></div><span style="caret-color:
    rgb(0, 0, 0); color: rgb(0, 0, 0);">I'd like to ask a practical question, do we have anything either in WNPP or in the archive that depends or uses this package?</span><div><br></div><div>Although I guess this library might violate DFSG 5 by itself, I
    would like to see where it's actually used and why we need the library.<br style="caret-color: rgb(0, 0, 0); color: rgb(0, 0, 0);"><br style="caret-color: rgb(0, 0, 0); color: rgb(0, 0, 0);"><div dir="ltr" style="caret-color: rgb(0, 0, 0); color: rgb(0,
    0, 0);">Yao Wei<div><br></div><div>(This email is sent from a phone; s<span style="background-color: rgba(255, 255, 255, 0);">orry for HTML email if it happens.</span><span style="font-size: 13pt;">)</span></div></div></div></body></html>
    --Apple-Mail-867E8EC9-EB84-4F59-8D92-0F70EFE04E1B--

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Tzafrir Cohen@21:1/5 to Andrej Shadura on Wed Jul 20 08:10:01 2022
    Hi,

    On Tue, Jul 19, 2022 at 06:08:16PM +0200, Andrej Shadura wrote:
    Hi,

    On Tue, 19 Jul 2022, at 16:57, Adam Borowski wrote:
    On Tue, Jul 19, 2022 at 01:48:17PM +0200, Andrej Shadura wrote:
    Take Misha/Miša/Миша or Petya/Peťa/Петя. In Russian tradition, these are
    very likely masculine names, from Mikhail and Petr.

    If only this piece of software had a distinction between "almost always male", "leaning male", "neutral", "leaning female", "almost always female"... Oh wait, it does!
    Precisely for the reason you mention.

    No, it does not and cannot, since some names are almost always male in one culture but almost always female in another one.

    And we haven’t yet touched the topic of people who were given non-traditional names.

    In which case it says "unknown".

    No, it cannot know about cases when a person is given a name traditionally given to another gender in another culture. Pretty common in the US, for example. Sure, there probably aren’t many cases of women named Michael, but there are many other names
    where you wouldn’t be easily able to tell.

    https://en.wikipedia.org/wiki/Category:English_unisex_given_names

    Both of you make arguments based on wrong data.
    If you would just tried the software (clone it from its source. No need
    to install anything) you would notice that.

    import gender_guesser.detector as gender
    d = gender.Detector()
    print(d.get_gender(u"Andrea"))
    female
    print(d.get_gender(u"Misha"))
    male
    print(d.get_gender(u"Miša"))
    andy
    print(d.get_gender(u"Миша"))
    unknown
    print(d.get_gender(u"Petya"))
    male
    print(d.get_gender(u"Peťa"))
    unknown
    print(d.get_gender(u"Петя"))
    unknown

    So: the software can give an output "andy", that is androgynous, it also
    has the output "mostly_male" and "mostly_female".

    Furthermore, reading the README, I noticed I can give it more context:

    print(d.get_gender(u"Andrea", "italy"))
    male
    print(d.get_gender(u"Misha", "slovakia"))
    andy
    print(d.get_gender(u"Petya", "slovakia"))
    andy

    However, "unknown" means "Not in my database". It does not mean "Neither
    male nor female". So it seems Adam also did not check the results before posting.

    Maybe gender-guesser belongs in Debian and maybe it doesn't. But please
    try to at least look at the software before judging it.

    --
    mail / xmpp / matrix: tzafrir@cohens.org.il

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Edward Betts@21:1/5 to All on Wed Jul 20 09:00:01 2022
    Sorry about trigging another big thread on debian-devel. I was naive when
    I posted this ITP, I didn't realise it would be controversial.

    Having read the discussion I agree there is a risk of this package being misused. There is no requirement for this package to be in Debian.

    I'm cancelling my ITP.

    Anybody who still feels like they might find this Python module useful can download it from PyPI.

    I've asked in #debian-ftp for the package to be removed from the NEW queue.
    --
    Edward

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Stefano Rivera@21:1/5 to All on Wed Jul 20 10:20:01 2022
    Hi Philip (2022.07.19_20:51:36_+0000)
    In other words, if you don't pick a culture, the _global_ dataset (ie, the default one) must not assume either. It takes a lot of work to prepare such a database, that's why this package is good to have.

    What exactly is it supposed to be good for?

    Seems perfect for statistics from large data sets. The data in those
    kind of things is always messy as hell. Full of mistakes and
    misalignment to whatever lines you are trying to draw through it.
    But, on aggregate, it does give you results, if you can control for the problems.

    A lot of research is done like this, "data science" they call it.

    From what I can tell, at best it produces harmless nonsense, and at
    worst it will cause people to be misidentified in ways that will vary
    between mildly humourous to significantly hurtful.

    On an individual level, but on a large scale, you'd expect things to
    average out. Unless it has a systemic bias towards any particular
    output.

    SR

    --
    Stefano Rivera
    http://tumbleweed.org.za/
    +1 415 683 3272

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Sam Hartman@21:1/5 to All on Wed Jul 20 18:20:01 2022
    "Andrey" == Andrey Rahmatullin <wrar@debian.org> writes:
    Andrey> More sensible than not filing it? This defeats both
    Andrey> purposes of an ITP: getting it discussed and working as a
    Andrey> mutex for people who are thinking about packaging the same
    Andrey> software. Are there other purposes?

    Yes, it's one of the ways people learn about software that is being
    packaged and they might like to become involved in.
    I find reading ITPs

    1) increases my interest in Debian because I see cool stuff people are
    doing

    2) Is one of the ways I learn about software I might find useful

    3) Potentially could point me in the direction of software to contribute
    to.

    All those are valuable to me even when an ITP is filed immediately
    before uploading.

    I agree that we could find other ways of getting that information to
    people who are interested.
    But today, in my work flow, ITPs are useful to me as an ITP consumer
    even if immediately before upload.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Hakan_Bay=c4=b1nd=c4=b1r?@21:1/5 to Sam Hartman on Thu Jul 21 12:20:01 2022
    This is exactly my point of view of ITPs as well, while I'm not as
    involved in Debian in most of the people here, it's a nice and proper
    gateway to see what's happening and what people are working on.

    Also, I have taken note of at least of couple pieces of software which I
    could use developing mine.

    Cheers,

    H.

    On 20.07.2022 19:16, Sam Hartman wrote:
    "Andrey" == Andrey Rahmatullin <wrar@debian.org> writes:
    Andrey> More sensible than not filing it? This defeats both
    Andrey> purposes of an ITP: getting it discussed and working as a
    Andrey> mutex for people who are thinking about packaging the same
    Andrey> software. Are there other purposes?

    Yes, it's one of the ways people learn about software that is being
    packaged and they might like to become involved in.
    I find reading ITPs

    1) increases my interest in Debian because I see cool stuff people are
    doing

    2) Is one of the ways I learn about software I might find useful

    3) Potentially could point me in the direction of software to contribute
    to.

    All those are valuable to me even when an ITP is filed immediately
    before uploading.

    I agree that we could find other ways of getting that information to
    people who are interested.
    But today, in my work flow, ITPs are useful to me as an ITP consumer
    even if immediately before upload.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)