Wouldn't it be worthwhile having NoCeM notices of type "binary" or
like to help cleaning non-binary newsgroups from these unwanted
articles? Naturally, other kinds of "binary" stuff could also be in
these notices, and not only yEnc.
Just asking, in case a current NoCeM issuer would be interested in
adding such filters. (I'm not going to send NoCeM notices.)
Hi all,
I've noticed yEnc-encoded articles in some newsgroups of the Big-Eight
(have a look at soc.culture.french for instance). Examples:
<17ba4ef578674e9c$60891$141478$64d91c8e@news.vipernews.com>
<1O6IN.329500$7uxe.279980@fx09.ams1>
Wouldn't it be worthwhile having NoCeM notices of type "binary" or like
to help cleaning non-binary newsgroups from these unwanted articles? Naturally, other kinds of "binary" stuff could also be in these notices,
and not only yEnc.
Just asking, in case a current NoCeM issuer would be interested in
adding such filters. (I'm not going to send NoCeM notices.)
Just asking, in case a current NoCeM issuer would be interested in
adding such filters. (I'm not going to send NoCeM notices.)
Why don't you send NoCeM messages?
Do you filter yenc out?
Hi all,
I've noticed yEnc-encoded articles in some newsgroups of the Big-Eight
(have a look at soc.culture.french for instance). Examples:
<17ba4ef578674e9c$60891$141478$64d91c8e@news.vipernews.com>
<1O6IN.329500$7uxe.279980@fx09.ams1>
Wouldn't it be worthwhile having NoCeM notices of type "binary" or like
to help cleaning non-binary newsgroups from these unwanted articles? >Naturally, other kinds of "binary" stuff could also be in these notices,
and not only yEnc.
Just asking, in case a current NoCeM issuer would be interested in
adding such filters. (I'm not going to send NoCeM notices.)
Just asking, in case a current NoCeM issuer would be interested inThat looks pretty easy to filter out, but I'm not seeing these on my
adding such filters. (I'm not going to send NoCeM notices.)
servers due to another "feature" of the articles. I'm happy to add
filtering for yenc as I don't serve binary groups on my servers, so
this would only check text newsgroups.
I'll get on that in a few days, but I'll check here first in case
someone has reasons that I should not do so.
Julien <iulius@nom-de-mon-site.com.invalid> wrote:
Hi all,
I've noticed yEnc-encoded articles in some newsgroups of the Big-Eight
(have a look at soc.culture.french for instance). Examples:
<17ba4ef578674e9c$60891$141478$64d91c8e@news.vipernews.com>
<1O6IN.329500$7uxe.279980@fx09.ams1>
Wouldn't it be worthwhile having NoCeM notices of type "binary" or like
to help cleaning non-binary newsgroups from these unwanted articles?
Naturally, other kinds of "binary" stuff could also be in these notices,
and not only yEnc.
Just asking, in case a current NoCeM issuer would be interested in
adding such filters. (I'm not going to send NoCeM notices.)
I don't understand. Isn't misplaced binary content addressed at the
Cleanfeed filter?
Hi all,
I've noticed yEnc-encoded articles in some newsgroups of the Big-Eight (have a look at soc.culture.french for instance). Examples:
<17ba4ef578674e9c$60891$141478$64d91c8e@news.vipernews.com>
<1O6IN.329500$7uxe.279980@fx09.ams1>
Wouldn't it be worthwhile having NoCeM notices of type "binary" or like to help cleaning non-binary newsgroups from these unwanted articles?
Naturally, other kinds of "binary" stuff could also be in these notices, and not only yEnc.
Just asking, in case a current NoCeM issuer would be interested in adding such filters. (I'm not going to send NoCeM notices.)
Julien LIE a prsent l'nonc suivant :
Hi all,
I've noticed yEnc-encoded articles in some newsgroups of the Big-Eight (have >> a look at soc.culture.french for instance). Examples:
<17ba4ef578674e9c$60891$141478$64d91c8e@news.vipernews.com>
<1O6IN.329500$7uxe.279980@fx09.ams1>
Wouldn't it be worthwhile having NoCeM notices of type "binary" or like to >> help cleaning non-binary newsgroups from these unwanted articles?
Naturally, other kinds of "binary" stuff could also be in these notices, and >> not only yEnc.
Just asking, in case a current NoCeM issuer would be interested in adding
such filters. (I'm not going to send NoCeM notices.)
I don't have these articles on my server.
Same here. After looking deeper, these seem mostly in groups that my
servers do not carry, and if they are carried, the articles are filtered by cleanfeed (before spamassassin in my setup).
Seeing that Ray seems to carry these groups, and looks like he's doing a great job identifying the articles, I'm going to delay diving into this issue. Maybe take some time to work with Perl without tearing my hair out first :)
Thus spake Retro Guy <retroguy@novabbs.org>
Same here. After looking deeper, these seem mostly in groups that my
servers do not carry, and if they are carried, the articles are filtered by >> cleanfeed (before spamassassin in my setup).
That is also the case here. I just added a check for binary articles to filter_first (before all tests) to add the articles to the NoCem queue
and then continue with the normal cleanfeed processing. I have, however, added a filter to eliminate the most obvious bogus group names like "a.b.something".
Seeing that Ray seems to carry these groups, and looks like he's doing a
great job identifying the articles, I'm going to delay diving into this
issue. Maybe take some time to work with Perl without tearing my hair out
first :)
;-)
PS: You seem to have an apprentice spam boy on i2pn2: <uu8uit$3h91i$1@i2pn2.org>
I just added a check for binary articles to
filter_first (before all tests) to add the articles to the NoCem queue
and then continue with the normal cleanfeed processing. I have, however, added a filter to eliminate the most obvious bogus group names like "a.b.something".
Incidentally, in your NoCeM notices, wouldn't it be useful to list all
the newsgroups they are sent to? Only the first one is currently
written whereas they could for instance be written on subsequent lines starting with whitespace, or on the same line. (I agree it would lead
to more lengthy messages or lines.)
I think some newsgroups should be marked as allowing binaries or HTML. <CAOLa=ZSo7ngBUxkfR+EEojhr4a-mM+3=f-P1H36hnhJukEqGVA@mail.gmail.com>
in linux.kernel.git was caught in the Bot-misplaced_binary filter but
looks like a valid article.
As for <XMJON.158253$t8cc.153345@fx06.iad> in alt.binaries.clip-art,
which was only posted to that newsgroup, maybe it should be considered
valid as posted in a newsgroup with a "binaries" component.
I'm using News::Article::NoCeM from CPAN to generate NoCeM messages and
it puts each additional newsgroup on a separate line starting with a TAB
and ending with CRLF, which led to people (wrongly) complaining about the structure of my messages. Currently, I'm testing a patch for News::Article::NoCeM that will put all newsgroups on the same line as the M-ID with a TAB between the M-ID and the first article and a blank
between the individual group names.
I think some newsgroups should be marked as allowing binaries or HTML.
<CAOLa=ZSo7ngBUxkfR+EEojhr4a-mM+3=f-P1H36hnhJukEqGVA@mail.gmail.com>
in linux.kernel.git was caught in the Bot-misplaced_binary filter but
looks like a valid article.
My filter makes use of the is_binary () function in Cleanfeed, which in
turn relies on some configuration variables. The problem in the case of
the linux.kernel.git messages is that some of them have a Content-Type
of multipart/mixed with the PGP signature included as a Base64 encoded attachment.
Sounds great with a one-line list of newsgroups, separated with a
space, thanks.
FYI, it will be useful with the perl-nocem program shipped with the
next release of INN (2.7.2) as I have added the possibility to only
process a subset of Message-IDs within a notice, according to specific
rules by the news admin (sort of a local function called like in cleanfeed.local). Having the whole list of newsgroups will permit for instance to process Message-IDs of articles posted to a newsgroup
actually carried by the server. Or more complex cases like processing
NoCeM notices for only a subset of newsgroups (if someone does not
want to cancel anything in some newsgroups) or not taking into account notices from "john" or of a given type, except for a subset of
newsgroups.
Is it an issue to open upstream to Cleanfeed, to fix the is_binary() function?I think some newsgroups should be marked as allowing binaries or HTML.My filter makes use of the is_binary () function in Cleanfeed, which in
<CAOLa=ZSo7ngBUxkfR+EEojhr4a-mM+3=f-P1H36hnhJukEqGVA@mail.gmail.com>
in linux.kernel.git was caught in the Bot-misplaced_binary filter but
looks like a valid article.
turn relies on some configuration variables. The problem in the case of
the linux.kernel.git messages is that some of them have a Content-Type
of multipart/mixed with the PGP signature included as a Base64 encoded
attachment.
Sounds great with a one-line list of newsgroups, separated with a
space, thanks.
Done now.
FYI, it will be useful with the perl-nocem program shipped with the
next release of INN (2.7.2) as I have added the possibility to only
process a subset of Message-IDs within a notice, according to specific
rules by the news admin (sort of a local function called like in
cleanfeed.local). Having the whole list of newsgroups will permit for
instance to process Message-IDs of articles posted to a newsgroup
actually carried by the server. Or more complex cases like processing
NoCeM notices for only a subset of newsgroups (if someone does not
want to cancel anything in some newsgroups) or not taking into account
notices from "john" or of a given type, except for a subset of
newsgroups.
Is that the -i option in perl-nocem (I'm using INN 2.8 snapshots)?
Is it an issue to open upstream to Cleanfeed, to fix the is_binary()
function?
Cleanfeed from Github does not handle Content-Type: multipart/mixed
except for HTML, so it was my own fault, obviously. Quick fix applied
now, is_binary() still misses lots of binary attachments encapsulated in separate entities.
I think I will make Cleanfeed more Mime-aware (MIME::Parser) and add
local config variables for allowed/disallowed mime types when I find the time.
Sysop: | Keyop |
---|---|
Location: | Huddersfield, West Yorkshire, UK |
Users: | 546 |
Nodes: | 16 (2 / 14) |
Uptime: | 10:29:56 |
Calls: | 10,387 |
Calls today: | 2 |
Files: | 14,060 |
Messages: | 6,416,691 |