Top > Notes > Spam Report

This page documents the amount of spam I'm getting ... in short, far too much.

There are a number of 'addresses' feeding into this spam archive. One is a single personal address served by a mail server with a high level of spam control. Another is an old personal ISP mail account. The other two are domains which forward all addresses into the address which feeds this archive; one of which is now redundant but which lives on as a proportion of this archive. Finally there is the domain which I run myself which is relatively new.

I make no effort to hide the email addresses, and in fact my current address is published on the web. This may seem daft, but I'll be dammed if I let the spammers push me into hidding.

The script that generates the report is fairly simple and written in Python; it makes use of Ploticus to generate the graphs. It is a very simple script that is not worth publishing especially as it is very setup dependent (and depends on MH mailboxes).

I am currently 'refreshing' the script that generates this page so things may be broken from time to time. Note that any references to 'injection IP address' is very new and may well be completely wrong, but refers to the IP address where the spam was inserted into the mail system.

Statistics

The average size looks high for email, but a distribution graph (too boring to include) shows that only a very small number are large in size; the vast majority are small. However there is a slow increase over time ... a symptom of spammers using more images to avoid text analysis blocking.

Graphs

Officially the overall number of spams per day has an average of around 80; however this is consistently climbing. In fact the end of the graph I am looking at is consistently higher ... more like an average of 200 a day. Some of this increase may have something to do with an occasional message sent to SPAM-L ... spammers are known to do this. However it is probable that the amount of spam is simply growing.

Above shows the size (in Kbytes) of the spam received each day. Just shows how much disk space and bandwidth is wasted by one victim.

Above shows the average size of spam per day. Previously I claimed that it was odd that we were not seeing a gradual increase over time. However it is plain that the increase is here ... cause by spammers starting to use images to get past text blocks. Indeed the images that the spammers are using are intentionally made difficult for OCR software to scan.

This graph shows the average SpamAssassin score over time. I believe that this graph is dominated by local factors ... when SpamAssassin was upgraded, when the Bayes database was purged, etc.

Recipients Report

The following table is a list of what appears to be the address the spammer sent to, and the number of spams that recipient received. The ones show emphasised have been used by myself at times in the past.

Interestingly this shows that the spammers do not just target addresses that they have obtained by web scraping or by even more dubious means, but will send out to guessed at addresses. Interestingly it seems that the software with the 'off by one' error that resulted in mv@meredithm.fsnet.co.uk (i.e. chopped off the first character of the address) is no longer in use ... my newer addresses aren't being chopped.