• Collation discrepencies between locales [was: Re: Bits from the rel

    From Peter Pentchev@21:1/5 to All on Fri Apr 20 10:10:01 2018
    On Thu, Apr 19, 2018 at 11:31:18PM +0200, Clément Hermann wrote:
    On 19/04/2018 22:45, Holger Levsen wrote:
    I now wondered if it's not only en_GB.utf8 which is "different", but also the NZ and US variants sort like that (and so differently than C)... not sure if en_FR.utf8 exist, but using it, it sorts differently / like C ;)

    (probably because it doesnt exist, thus the default, C, is used.)

    Indeed, it doesn't exist. At least , for fr_* locale, it seems to be consistent both in the different charsets available (e.g. fr_FR and fr_FR.UTF-8) and country (fr_BE, fr_CA, fr_CH, fr_FR and fr_LU).

    Actually I thought the localization had been made consistently with the apparition of unicode locales...

    Oh, I do so love the (possibly unintended) phrase "the apparition of
    unicode locales"!

    G'luck,
    Peter

    --
    Peter Pentchev roam@ringlet.net roam@FreeBSD.org pp@storpool.com
    PGP key: http://people.FreeBSD.org/~roam/roam.key.asc
    Key fingerprint 2EE7 A7A5 17FC 124C F115 C354 651E EFB0 2527 DF13

    -----BEGIN PGP SIGNATURE-----

    iQIzBAABCAAdFiEELuenpRf8EkzxFcNUZR7vsCUn3xMFAlrZnbEACgkQZR7vsCUn 3xM3sBAAz+TTtMxkD37YMZiC7K8Efct71bmToCLuQSsPgyyUCSDBJSmCqL08Ui+A yS9ujeq0s/8W3Ch5w/y1Z04TZr8UjdenZJCpAQ/mAPHydOQgpvTeECQM+hzaFz2y fgOswVCDppMBXvVFgk/AP9k8D9/2tMvKnlLg2noWTfB2RD0Q5+2EU6delg0IPsLs L//Z9LJ6Z7k3fxEns4VfBFed6mF+hOXbRmOWZSYPjRgNNDoaMZ5bTW1PWLZ6uGx3 ADUZDIkesJh1DDmL8R/jxqAu87boZpGwu6AHZlcjTO8vkDbzp0TISewePNqsdyAh /ln30UiSZAcvLtSSyJcoqCnb45Vrub4elLDTvwX6HHRgXRU6v1vQV950FoBYosMZ 5yAWk6nzFChRqVQwVA6rmVgLVguF8fMUIyfqhytrRbXEXehcGoTa5mJwdu0N1lk5 sADGc9PvkT2EByTwkzjFqUjz3iohhjJ+O72FThI4ddW4EJjnXm2QsY0FFKVoahHw 77u5FQ65xWBA5wBEKP2KnoZKKCsQ1HMqQjflq6eOgrxjSBfMm+mpAGrAJNMsP6zj vMk/GxLfUHy/Cy+pUMOzqDRG4eGeylzJ/2y8mB+BarqMwT7c4uG2PEX+VJtRfypj ekoXCTs+7JaSK4OXwLatKGIwBl96fWlCAnY7zvM3yjWVRhS/XvM=
    =vzZ5
  • From Thomas Harding@21:1/5 to All on Fri Apr 20 14:00:02 2018
    Le 20 avril 2018 09:58:46 GMT+02:00, Peter Pentchev <roam@ringlet.net> a écrit :
    On Thu, Apr 19, 2018 at 11:31:18PM +0200, Clément Hermann wrote:
    On 19/04/2018 22:45, Holger Levsen wrote:
    I now wondered if it's not only en_GB.utf8 which is "different",
    but also
    the NZ and US variants sort like that (and so differently than
    C)... not
    sure if en_FR.utf8 exist, but using it, it sorts differently / like
    C ;)

    (probably because it doesnt exist, thus the default, C, is used.)

    Indeed, it doesn't exist. At least , for fr_* locale, it seems to be
    consistent both in the different charsets available (e.g. fr_FR and
    fr_FR.UTF-8) and country (fr_BE, fr_CA, fr_CH, fr_FR and fr_LU).

    Actually I thought the localization had been made consistently with
    the
    apparition of unicode locales...

    Oh, I do so love the (possibly unintended) phrase "the apparition of
    unicode locales"!

    So, after a so long service to the human being crew, that's time to introduce c_C and c_C.utf8 locale which would both compile^Wcomply with C(omputer's languages, habits and traditions).

    Then, maybe, English spokers would understand they all speak a local English, and that their computer has it's own

    (even iso639-2 and later 3 languages letters codification doesen't express any language variant --- or as exceptions. I was surprised as I formatted a csv version of iso639-2 around ?2000,2003? / trying to setup a Koha database for personal use /
    moreover the person at Library of Congress who was in charge never heard anything about computer parsing).


    --
    Je suis née pour partager, non la haine, mais l'amour.
    Sophocle, /Antigone, 442 av. JC

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)