/[pkg-listmaster]/trunk/bits/list-update-200803
ViewVC logotype

Contents of /trunk/bits/list-update-200803

Parent Directory Parent Directory | Revision Log Revision Log


Revision 163 - (show annotations) (download)
Tue Mar 4 00:03:22 2008 UTC (5 years, 2 months ago) by tviehmann
File size: 6552 byte(s)
think positive, use less -
1 From: whoever send it
2 To: debian-devel-announce@lists.debian.org
3 Subject: Bits from the listmaster team
4
5 Hi everyone,
6
7 the listmaster team is constantly trying to improve the setup of our
8 listserver. Thus, quite a few things have happend since our last update
9 in September of last year. Here are some highlights:
10
11
12 lists.debian.org moved to a new hosting location
13 ------------------------------------------------
14
15 lists.debian.org has been moved to man-da[2] to avoid several problems
16 with DNSBL pointing to our list server. We'd like to thank Brainfood
17 for hosting lists.debian.org and dealing with the insanity of spam
18 reporters for so long. After the move of the list service to the new
19 machine, we also decided to move the list archives to that machine
20 (which means the list archives are on the same machine as the MX, and
21 consequently suffer fewer delays).
22
23 If you haven't already, please add lists.d.o's new ip, 82.195.75.100,
24 to your whitelists.
25
26
27 New list archive search engine
28 ------------------------------
29 With the move of lists.debian.org to the new hosting, we took the
30 opportunity of deploying a new search[3] based on Xapian Omega. The
31 index comprises some 3.5 million messages, approximately 100k of which
32 are estimated to be spam. In order to improve your search experience,
33 we have prepared the indexing software to benefit from our spam removal
34 plan (see below).
35
36 As before, searching by author and list is supported, but the new
37 search should be more language- and encoding-aware. Work is in
38 progress to provide our adaptations to upstream and implement
39 improvements based on our experience and the hints we got from the
40 friendly people at Xapian.
41
42
43 Config cleanup
44 --------------
45 The config cleanup is another big project which seems to turn into an
46 ongoing task. Since the last update we decided to unify some global
47 files for all lists, and move all list specific config to extra files.
48 (This follows the layout the inventors of smartlist, our mailinglist
49 software, had in mind.)
50
51 We also want to move some information like moderation status or maximum
52 mail size per message to a global file, which is also used by the
53 listarchive and some more informational or statistical tools.
54
55 To check if lists are configured correctly we subscribed an address to
56 all 182 mailinglists and checked back a month later for ham/spam-ratio,
57 and other anomalies. We found some wrong spam-rules, which led to some
58 false positives and other 'backdoors' which bypassed some of our
59 spamrules, which lead to false negatives. We also found lists which are
60 supposed to carry only informational mails from an automatic system, so
61 we could tighten the rules, and on the other hand we could drop the
62 usual spamfilters for those lists, so distribution gets faster and we
63 need less CPU/memory ressources to get one mail through.
64
65 We also implemented the usual 'Precedence' and 'List-*'-headers on all
66 lists (we had some lists where those were missing) and automatic
67 responses, so we are now a little more net-friendly with our service.
68
69 While reviewing things we found that our bounce handling had some
70 issues, see the next section for information about that.
71
72 Better bounce handling
73 ----------------------
74 We checked our bounce handling because we have more than 500 bounces
75 for some lists, and in the process found that we didn't have working
76 bounce handling for other lists (other-*, deity, *-digest,
77 debian-private). There were also problems in handling and recognizing
78 mailadresses containing = or ! characters.
79
80 Bounces of debian-private subscription are still manualy handed by the
81 listmasters, but we now address these issues and forward such addresses to
82 da-manager@debian.org.
83
84 To address the other mailing lists we rewrote some parts of our
85 bounce handler.
86
87 While analysing the bounces streaming in, we found that a lot of bounces
88 are caused by content filters which reject listmail back to us (which
89 violates the RfC). Even worse: the majority of those are false
90 positives.
91
92 To let those people know we'll implement a notification system, which
93 will notify users about bounces, and remind forcibly removed users about
94 their unsubscription.
95
96 This is a service for those people with a temporarily unavailable or
97 broken mailbox, so they see that they (or their provider) has a broken
98 mail setup or resubscribe back to all lists after their mailaddress is
99 functional again. These notification will be sent out at a maximum of
100 once a week, up to a month after the last unsubscription happened.
101
102 Both notification systems are in testing now and will be activated
103 shortly after this mail.
104
105
106 List archive spam
107 -----------------
108 As avid followers of debian-project will know, we have implemented
109 support to weed spam out of the www list archives. While we want to
110 get rid of as much spam as possible our paramount objective in this
111 effort is to preserve the integrity of the archive (e.g. keeping URLs
112 constant for past messages and avoiding removal of non-spam mail). This
113 means that the submissions we receive from users clicking on the
114 spam-report button of the list archive must be verified manually and
115 each nomination has to be checked by independently by several people.
116 Some 1000 spam messages have been deleted from the archives of
117 debian-jave, debian-project, debian-python, and debian-vote.
118 To help out or learn more please visit our wiki page[4].
119
120 How to help
121 -----------
122 You can help us in a few important areas:
123
124 * Spam rules -- If you notice spam getting through the spam filters,
125 and have ideas for improving our filters, we accept patches to our
126 rulessets, which are publicly available via svn.[5]
127
128 * Encoding issues -- If you notice encoding problems of messages sent
129 after November 2007 in the archive, please contact
130 listmaster@lists.debian.org with a link to the problematic message
131 and an explanation of the problem.
132
133 * Avoid bouncing spam -- If you don't want your MTA to accept spam,
134 please just discard it instead of 550'ing, at least when a message
135 comes from liszt.debian.org
136
137 * Troubleshooting -- If you notice a problem with a message that
138 you've sent to a mailing list which hasn't arrived, please provide
139 us with as much information as possible, including Date/Time (UTC),
140 From, To, Message-Id, delivering IP, and the logfile entries from
141 the delivering host.
142
143 [1] http://www.brainfood.com
144 [2] http://www.man-da.de
145 [3] http://lists.debian.org/search.html
146 [4] http://wiki.debian.org/Teams/ListMaster/ListArchiveSpam
147 [5] svn://svn.debian.org/svn/pkg-listmaster/trunk/spamassassin_config

  ViewVC Help
Powered by ViewVC 1.1.5