spamassassin-dev September 2011 archive
Main Archive Page > Month Archives  > spamassassin-dev archives
spamassassin-dev: [Bug 6659] New: Empty Content-Type causes lear

[Bug 6659] New: Empty Content-Type causes learning of binary file

From: <bugzilla-daemon_at_nospam>
Date: Fri Sep 16 2011 - 14:51:50 GMT
To: dev@spamassassin.apache.org

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6659

             Bug #: 6659
           Summary: Empty Content-Type causes learning of binary file
           Product: Spamassassin
           Version: 3.3.1
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Learner
        AssignedTo: dev@spamassassin.apache.org
        ReportedBy: spamassassin-bugzilla@joolee.nl
    Classification: Unclassified

I'm receiving a few hundred mails a day with small attachments that are (afaik)
correctly parsed and nothing happens with the attachments when checking the
message to be spam. When autolearning the E-mails as spam, the attachments are
being decoded and parsed by the bayes algorithm. The only strange thing I can
find in the message (appart from the text content obviously being a phishing
mail) is the header of the attachment part:

------=_NextPart_000_0006_01CC51AC.63F30F00
Content-Type: ;
        name="report_1609.pdf.zip"
Content-Transfer-Encoding: base64
Content-Disposition: attachment;
        filename="report_1609.pdf.zip"

I suspect the empty content-type causes the attachment to be decoded.

Running spamassassin in debug mode causes it to hang a the following lines:
Sep 16 15:07:12.279 [8264] dbg: bayes: Using userid: 1
Sep 16 15:08:48.746 [8264] dbg: bayes: seen
(bf76e190b8121487c91051758a402dd20b18eaa6@sa_generated) put

Manually calling sa-learn hangs for a while at the "decoding base64" part:
Sep 16 15:34:12.786 [18308] dbg: message: decoding base64
Forgot tokens from 1 message(s) (1 message(s) examined)
Sep 16 15:35:49.764 [18308] dbg: plugin:
Mail::SpamAssassin::Plugin::Bayes=HASH(0x3891ba0) implements 'learner_close',
priority 0

-- Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.