• How to configure cleanfeed.local to de-peer google-groups?

    From Don@21:1/5 to All on Sat Nov 25 17:52:34 2023
    suck feeds my private, in-house, newserver.

    cleanfeed's installed using software downloaded from:

    <https://github.com/crooks/cleanfeed>

    spamassassin is not deployed at present.

    Path: is unusable as a cleanfeed filter because in my case it always
    contains:

    Path: meow.home.net!not-for mail

    On the other hand, Injection-Info indeed identifies google-groups:

    Injection-Info: google-groups.googlegroups.com; posting-host= ...

    cleanfeed.local was copied from cleanfeed.local.sample and this snippet
    was added:

    sub local_filter_last {
    if ($hdr{Injection-Info} =~ /google-groups\.google\.com/) {
    return reject("Reject google-groups posting to $hdr{Newsgroups}");
    };

    Can anyone offer any ideas as to why it won't work? Given how the very
    next "recipe" in cleanfeed.local does indeed work:

    if (not $hdr{References} and
    $hdr{__LINES__} > 25 and
    $hdr{__LINES__} < 150 and
    scalar @groups > 4 and
    scalar @followups > 4) {
    if ($hdr{__BODY__} =~ /MI5/ or
    $hdr{Subject} =~ /M.{1,3}I.{1,3}5/) {
    saveart('mi5');
    return reject('MI5 Filter');
    };
    };

    it indicates a "healthy" cleanfeed install.

    Thank you in advance for any advice.

    Danke,

    --
    Don, KB7RPU, https://www.qsl.net/kb7rpu
    There was a young lady named Bright Whose speed was far faster than light;
    She set out one day In a relative way And returned on the previous night.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Don@21:1/5 to All on Sun Nov 26 04:34:50 2023
    The following filter de-peers google for me:

    sub local_filter_first {
    if ($hdr{'Injection-Info'} =~ /google-groups\.googlegroups\.com/) {
    return reject('google-groups');
    };

    It turns out google\.com was erroneously used initially instead of googlegroups\.com .

    Danke,

    --
    Don, KB7RPU, https://www.qsl.net/kb7rpu
    There was a young lady named Bright Whose speed was far faster than light;
    She set out one day In a relative way And returned on the previous night.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From yamo'@21:1/5 to All on Sun Nov 26 09:59:00 2023
    Hi,

    Don a tapoté le 26/11/2023 05:34:
    The following filter de-peers google for me:

    sub local_filter_first {
    if ($hdr{'Injection-Info'} =~ /google-groups\.googlegroups\.com/) {
    return reject('google-groups');
    };

    It turns out google\.com was erroneously used initially instead of googlegroups\.com .

    Danke,



    I have done it by cleanfeed but here it is not filtered for all groups.

    For all groups, the best way is in newsfeeds file :


    $BADPATHS=google-groups.googlegroups.com
    ME/$BADPATHS:::






    --
    Stéphane

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From llp@21:1/5 to All on Sun Nov 26 19:44:01 2023
    yamo' a prsent l'nonc suivant :
    Hi,

    Don a tapot le 26/11/2023 05:34:
    The following filter de-peers google for me:

    sub local_filter_first {
    if ($hdr{'Injection-Info'} =~ /google-groups\.googlegroups\.com/) {
    return reject('google-groups');
    };

    It turns out google\.com was erroneously used initially instead of
    googlegroups\.com .

    Danke,



    I have done it by cleanfeed but here it is not filtered for all groups.

    For all groups, the best way is in newsfeeds file :


    $BADPATHS=google-groups.googlegroups.com
    ME/$BADPATHS:::

    Some people are posting real messages from Google.
    It's a shame to filter everything from us.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Eric M@21:1/5 to All on Sun Nov 26 18:49:12 2023
    Le 26/11/2023 à 19:44, llp a écrit :

    Some people are posting real messages from Google.
    It's a shame to filter everything from us.

    You can't destroy usenet for a few posters.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Adam H. Kerman@21:1/5 to llp on Sun Nov 26 18:59:23 2023
    llp <llp@news.usenet.ovh> wrote:

    . . .

    Some people are posting real messages from Google.
    It's a shame to filter everything from us.

    Why is that? This large-scale injection of spam through Google Groups is
    an enormous hint to those posting the tiny number of legitimate articles
    that it's long past time to become a subscriber on a genuine News
    server. They made the choice to continue to associate with a News site
    that has created massive problems for all other News administrators.

    If other News administrators implement a passive Usenet Death Penalty as
    a spam countermeasure, then their articles won't propogate widely. They
    cannot be unaware of the consequence.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From llp@21:1/5 to All on Sun Nov 26 20:30:38 2023
    Adam H. Kerman avait soumis l'ide :
    llp <llp@news.usenet.ovh> wrote:

    . . .

    Some people are posting real messages from Google.
    It's a shame to filter everything from us.

    Why is that? This large-scale injection of spam through Google Groups is
    an enormous hint to those posting the tiny number of legitimate articles
    that it's long past time to become a subscriber on a genuine News
    server. They made the choice to continue to associate with a News site
    that has created massive problems for all other News administrators.

    If other News administrators implement a passive Usenet Death Penalty as
    a spam countermeasure, then their articles won't propogate widely. They cannot be unaware of the consequence.

    In theory, you're right.
    In practice, it will split usenet in two. Those who fully filter
    Googlegroups and those who don't. I find it hard to believe that large commercial servers blacklist google. Without this, an udp of the other
    servers will be useless. For the time being, I prefer the nocems
    solution for filtering this spam.
    I've seen that eternal-september, i2pn or usenet.ovh have followed this
    path, rejecting spam as soon as it arrives on the server and setting up
    nocems for the other servers.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Adam H. Kerman@21:1/5 to llp on Sun Nov 26 20:08:34 2023
    llp <contact@usenet.ovh> wrote:
    Adam H. Kerman avait soumis l'ide :
    llp <llp@news.usenet.ovh> wrote:

    . . .

    Some people are posting real messages from Google.
    It's a shame to filter everything from us.

    Why is that? This large-scale injection of spam through Google Groups is
    an enormous hint to those posting the tiny number of legitimate articles >>that it's long past time to become a subscriber on a genuine News
    server. They made the choice to continue to associate with a News site
    that has created massive problems for all other News administrators.

    If other News administrators implement a passive Usenet Death Penalty as
    a spam countermeasure, then their articles won't propogate widely. They >>cannot be unaware of the consequence.

    In theory, you're right.
    In practice, it will split usenet in two. Those who fully filter
    Googlegroups and those who don't.

    This isn't Usenet's problem. The consequences are well known at the
    point to the poster who has chosen not to become a user on a genuine
    News server that isn't so widely rejected.

    I find it hard to believe that large
    commercial servers blacklist google. Without this, an udp of the other >servers will be useless. For the time being, I prefer the nocems
    solution for filtering this spam.

    I've seen that eternal-september, i2pn or usenet.ovh have followed this
    path, rejecting spam as soon as it arrives on the server and setting up >nocems for the other servers.

    You aren't the one writing them. The ones who are have been putting an incredible amount of work into it and at some point will conclude that
    it's just not worth it.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Kaz Kylheku@21:1/5 to llp on Sun Nov 26 20:26:54 2023
    On 2023-11-26, llp <llp@news.usenet.ovh> wrote:
    Some people are posting real messages from Google.

    Unfortunately, these genuine, well-meaning postings often try try to
    help someone from 1992, who probably figured out their problem in the
    three decades since. :)

    --
    TXR Programming Language: http://nongnu.org/txr
    Cygnal: Cygwin Native Application Library: http://kylheku.com/cygnal
    Mastodon: @Kazinator@mstdn.ca
    NOTE: If you use Google Groups, I don't see you, unless you're whitelisted.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Kettlewell@21:1/5 to llp on Sun Nov 26 20:51:04 2023
    llp <llp@news.usenet.ovh> writes:
    yamo' a présenté l'énoncé suivant :
    Don a tapoté le 26/11/2023 05:34:
    The following filter de-peers google for me:
    sub local_filter_first {
    if ($hdr{'Injection-Info'} =~ /google-groups\.googlegroups\.com/) {
    return reject('google-groups');
    };
    It turns out google\.com was erroneously used initially instead of
    googlegroups\.com .
    Danke,



    I have done it by cleanfeed but here it is not filtered for all groups.

    For all groups, the best way is in newsfeeds file :


    $BADPATHS=google-groups.googlegroups.com
    ME/$BADPATHS:::

    Some people are posting real messages from Google.
    It's a shame to filter everything from us.

    It is a shame, but other, less spammy, news servers do exist, and you
    can use them instead if you wish.

    (I’m still not blocking Google, not that it would affect anyone but me
    if I did, but the persistent spam is making me increasingly sympathetic
    to the server operators who do block Google).

    --
    https://www.greenend.org.uk/rjk/

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From llp@21:1/5 to All on Sun Nov 26 22:12:11 2023
    Le 26/11/2023, Adam H. Kerman a suppos :
    llp <contact@usenet.ovh> wrote:
    Adam H. Kerman avait soumis l'ide :
    llp <llp@news.usenet.ovh> wrote:

    . . .

    Some people are posting real messages from Google.
    It's a shame to filter everything from us.

    Why is that? This large-scale injection of spam through Google Groups is >>> an enormous hint to those posting the tiny number of legitimate articles >>> that it's long past time to become a subscriber on a genuine News
    server. They made the choice to continue to associate with a News site
    that has created massive problems for all other News administrators.

    If other News administrators implement a passive Usenet Death Penalty as >>> a spam countermeasure, then their articles won't propogate widely. They
    cannot be unaware of the consequence.

    In theory, you're right.
    In practice, it will split usenet in two. Those who fully filter
    Googlegroups and those who don't.

    This isn't Usenet's problem. The consequences are well known at the
    point to the poster who has chosen not to become a user on a genuine
    News server that isn't so widely rejected.

    I find it hard to believe that large
    commercial servers blacklist google. Without this, an udp of the other
    servers will be useless. For the time being, I prefer the nocems
    solution for filtering this spam.

    I've seen that eternal-september, i2pn or usenet.ovh have followed this
    path, rejecting spam as soon as it arrives on the server and setting up
    nocems for the other servers.

    You aren't the one writing them. The ones who are have been putting an incredible amount of work into it and at some point will conclude that
    it's just not worth it.

    Sorry, you're wrong ;-)
    I'm the administrator of the usenet.ovh server and I've written
    nocembot for produce Nocem (as well as a filter for cleanfeed to
    reject spam at source on my server).
    But, yes, it takes time.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Adam H. Kerman@21:1/5 to contact@usenet.ovh on Sun Nov 26 22:13:10 2023
    llp <contact@usenet.ovh> wrote:
    Le 26/11/2023, Adam H. Kerman a suppos :
    llp <contact@usenet.ovh> wrote:
    Adam H. Kerman avait soumis l'ide :
    llp <llp@news.usenet.ovh> wrote:

    . . .

    Some people are posting real messages from Google.
    It's a shame to filter everything from us.

    Why is that? This large-scale injection of spam through Google Groups is >>>> an enormous hint to those posting the tiny number of legitimate articles >>>> that it's long past time to become a subscriber on a genuine News
    server. They made the choice to continue to associate with a News site >>>> that has created massive problems for all other News administrators.

    If other News administrators implement a passive Usenet Death Penalty as >>>> a spam countermeasure, then their articles won't propogate widely. They >>>> cannot be unaware of the consequence.

    In theory, you're right.
    In practice, it will split usenet in two. Those who fully filter
    Googlegroups and those who don't.

    This isn't Usenet's problem. The consequences are well known at the
    point to the poster who has chosen not to become a user on a genuine
    News server that isn't so widely rejected.

    I find it hard to believe that large
    commercial servers blacklist google. Without this, an udp of the other
    servers will be useless. For the time being, I prefer the nocems
    solution for filtering this spam.

    I've seen that eternal-september, i2pn or usenet.ovh have followed this
    path, rejecting spam as soon as it arrives on the server and setting up
    nocems for the other servers.

    You aren't the one writing them. The ones who are have been putting an
    incredible amount of work into it and at some point will conclude that
    it's just not worth it.

    Sorry, you're wrong ;-)
    I'm the administrator of the usenet.ovh server and I've written
    nocembot for produce Nocem (as well as a filter for cleanfeed to
    reject spam at source on my server).
    But, yes, it takes time.

    I apologize.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Grant Taylor@21:1/5 to Don on Mon Nov 27 18:14:59 2023
    On 11/25/23 11:52, Don wrote:
    suck feeds my private, in-house, newserver.

    Okay.

    I can't help with cleanfeed itself. That being said, I do have the
    following thoughts.

    Path: is unusable as a cleanfeed filter because in my case it always contains:

    Path: meow.home.net!not-for mail

    That seems like something I'd try to correct.

    Specifically I'd try to reconfigure things to append your news server
    (new on left / old on right) to the existing Path: header value.

    So you could filter in the Google Groups Path: header part.

    Or possibly have whatever you're using to pull down articles simply not
    save articles from Google Groups or simply not re-inject them into your
    server.

    The Path; header, including Google parts, should be in the articles that
    you are downloading.

    I'd suggest you try using that instead of re-inventing the wheel.

    On the other hand, Injection-Info indeed identifies google-groups:

    Injection-Info: google-groups.googlegroups.com; posting-host= ...

    Yes.

    Sorry, I can't help with cleanfeed itself.

    Thank you in advance for any advice.

    Good luck.



    --
    Grant. . . .

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Grant Taylor@21:1/5 to llp on Mon Nov 27 18:16:31 2023
    On 11/26/23 12:44, llp wrote:
    Some people are posting real messages from Google.

    Yep.

    It's a shame to filter everything from us.

    Agreed.

    But each news administrator needs to make a decision of how much time
    they are willing to spend retroactively cleaning up Google Groups spam
    vs proactively blocking all Google Groups messages. It's a personal
    choice for them alone to make.



    Grant. . . .

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Don@21:1/5 to Grant on Tue Nov 28 14:50:17 2023
    Grant wrote:
    Don wrote:
    suck feeds my private, in-house, newserver.

    Okay.

    I can't help with cleanfeed itself. That being said, I do have the
    following thoughts.

    Path: is unusable as a cleanfeed filter because in my case it always
    contains:

    Path: meow.home.net!not-for mail

    That seems like something I'd try to correct.

    Specifically I'd try to reconfigure things to append your news server
    (new on left / old on right) to the existing Path: header value.

    So you could filter in the Google Groups Path: header part.

    Or possibly have whatever you're using to pull down articles simply not
    save articles from Google Groups or simply not re-inject them into your server.

    The Path; header, including Google parts, should be in the articles that
    you are downloading.

    I'd suggest you try using that instead of re-inventing the wheel.

    Yes. Something is obviously unintentionally broken and needs to be
    fixed. yamo' shows a superior solution earlier in the thread.

    Any ideas from the group on how to make suck properly handle headers
    is appreciated in advance.

    Danke,

    --
    Don, KB7RPU, https://www.qsl.net/kb7rpu
    There was a young lady named Bright Whose speed was far faster than light;
    She set out one day In a relative way And returned on the previous night.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Sn!pe@21:1/5 to Don on Tue Nov 28 15:12:46 2023
    Don <g@crcomp.net> wrote:

    Grant wrote:
    Don wrote:
    suck feeds my private, in-house, newserver.

    Okay.

    I can't help with cleanfeed itself. That being said, I do have the following thoughts.

    Path: is unusable as a cleanfeed filter because in my case it always
    contains:

    Path: meow.home.net!not-for mail

    That seems like something I'd try to correct.

    Specifically I'd try to reconfigure things to append your news server
    (new on left / old on right) to the existing Path: header value.

    So you could filter in the Google Groups Path: header part.

    Or possibly have whatever you're using to pull down articles simply not save articles from Google Groups or simply not re-inject them into your server.

    The Path; header, including Google parts, should be in the articles that you are downloading.

    I'd suggest you try using that instead of re-inventing the wheel.


    Yes. Something is obviously unintentionally broken and needs to be
    fixed. yamo' shows a superior solution earlier in the thread.

    Any ideas from the group on how to make suck properly handle headers
    is appreciated in advance.

    Danke,


    I don't know if this might be useful, it works very well for me
    as a Newsreader filter. I killfile on User-Agent: G2/1.0

    --
    ^^. Sn!pe, PA, FIBS - Professional Crastinator.
    My pet rock Gordon just is.

    Google Groups articles not seen here unless poster is whitelisted.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From yamo'@21:1/5 to All on Sun Dec 3 11:43:37 2023
    Hi,

    Don a tapot :

    Yes. Something is obviously unintentionally broken and needs to be
    fixed. yamo' shows a superior solution earlier in the thread.

    My cleanfeed.local has those lines (inside other many lines) :

    sub local_flag_spamsource2 {
    return 1 if $hdr{'Injection-Info'} =~ /google-groups.googlegroups.com|posting.uzoreto.com/;
    return 1 if $hdr{'X-Complaints-To'} =~ /abuse\@usenetserver.com/;
    return 0;
    };

    sub local_filter_last {

    if(local_flag_spamsource2){
    if (not $hdr{Newsgroups} =~ /^news\.|^fr\./i) {
    return reject( 'spam google?');
    };
    };
    };


    Beware my copy past may be broken!
    You can check with :

    perl -wc /etc/news/filter/cleanfeed/etc/cleanfeed.local

    You can adapt them for your config...


    Any ideas from the group on how to make suck properly handle headers
    is appreciated in advance.

    I only use innfeed. I never used suck.

    --
    Stphane
    Sorry for my bad English...

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Ray Banana@21:1/5 to All on Wed Dec 6 07:14:25 2023
    Thus spake "Don" <g@crcomp.net>

    Don wrote:

    In the end, my innd install perfectly handles the Path: header. After
    too much dectective work it turns out easynews uses Path: not-for-mail
    on every single article it hosts - easynews personnel perverted the
    Path: protocol.

    ,-------------------------------------------------------------------
    | "not-for-mail" is a common <tail-entry>.
    `-------------------------------------------------------------------

    https://www.rfc-editor.org/rfc/rfc5537.html#section-3.2.2

    --
    Пу́тін — хуйло́
    http://www.eternal-september.org

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Don@21:1/5 to Don on Wed Dec 6 05:32:47 2023
    Don wrote:
    Grant wrote:
    Don wrote:
    suck feeds my private, in-house, newserver.

    Okay.

    I can't help with cleanfeed itself. That being said, I do have the
    following thoughts.

    Path: is unusable as a cleanfeed filter because in my case it always
    contains:

    Path: meow.home.net!not-for mail

    That seems like something I'd try to correct.

    In the end, my innd install perfectly handles the Path: header. After
    too much dectective work it turns out easynews uses Path: not-for-mail
    on every single article it hosts - easynews personnel perverted the
    Path: protocol.

    Danke,

    --
    Don, KB7RPU, https://www.qsl.net/kb7rpu
    There was a young lady named Bright Whose speed was far faster than light;
    She set out one day In a relative way And returned on the previous night.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Tom Furie@21:1/5 to Ray Banana on Wed Dec 6 07:10:20 2023
    Ray Banana <rayban@raybanana.net> writes:

    Thus spake "Don" <g@crcomp.net>

    Don wrote:

    In the end, my innd install perfectly handles the Path: header. After
    too much dectective work it turns out easynews uses Path: not-for-mail
    on every single article it hosts - easynews personnel perverted the
    Path: protocol.

    ,-------------------------------------------------------------------
    | "not-for-mail" is a common <tail-entry>.
    `-------------------------------------------------------------------

    https://www.rfc-editor.org/rfc/rfc5537.html#section-3.2.2

    It's not usually the *whole* path though. xsusenet do that. Perhaps only
    on the reader side, to "protect their users' anonymity" or
    whatever. Their IP address looks to belong to Abavia.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)