[info] [IP] The Future of Internet Immune Systems
Eugen Leitl
<eugen at leitl.org> on
Thu Nov 22 07:42:00 UTC 2007
----- Forwarded message from David Farber <dfarber at cs.cmu.edu> -----
From: David Farber <dfarber at cs.cmu.edu>
Date: Wed, 21 Nov 2007 18:23:14 -0500
To: ip at v2.listbox.com
Subject: [IP] The Future of Internet Immune Systems
X-Mailer: Apple Mail (2.915)
Reply-To: dave at farber.net
Begin forwarded message:
From: dewayne at warpspeed.com (Dewayne Hendricks)
Date: November 20, 2007 6:56:29 PM EST
To: Dewayne-Net Technology List <xyzzy at warpspeed.com>
Subject: [Dewayne-Net] The Future of Internet Immune Systems
The Future of Internet Immune Systems
Written by Cory Doctorow
11/19/2007
<http://www.internetevolution.com/author.asp?section_id=479&doc_id=139358&
>
Bunhill Cemetery is just down the road from my flat in London. It’s a
handsome old boneyard, a former plague pit (“Bone hill” -- as in,
there are so many bones under there that the ground is actually kind
of humped up into a hill). There are plenty of luminaries buried there
-- John “Pilgrim’s Progress” Bunyan, William Blake, Daniel Defoe, and
assorted Cromwells. But my favorite tomb is that of Thomas Bayes, the
18th-century statistician for whom Bayesian filtering is named.
Bayesian filtering is plenty useful. Here’s a simple example of how
you might use a Bayesian filter. First, get a giant load of non-spam
emails and feed them into a Bayesian program that counts how many
times each word in their vocabulary appears, producing a statistical
breakdown of the word-frequency in good emails.
Then, point the filter at a giant load of spam (if you’re having a
hard time getting a hold of one, I have plenty to spare), and count
the words in it. Now, for each new message that arrives in your inbox,
have the filter count the relative word-frequencies and make a
statistical prediction about whether the new message is spam or not
(there are plenty of wrinkles in this formula, but this is the general
idea).
The beauty of this approach is that you needn’t dream up “The Big
Exhaustive List of Words and Phrases That Indicate a Message Is/Is Not
Spam.” The filter naively calculates a statistical fingerprint for
spam and not-spam, and checks the new messages against them.
This approach -- and similar ones -- are evolving into an immune
system for the Internet, and like all immune systems, a little bit
goes a long way, and too much makes you break out in hives.
ISPs are loading up their network centers with intrusion detection
systems and tripwires that are supposed to stop attacks before they
happen. For example, there’s the filter at the hotel I once stayed at
in Jacksonville, Fla. Five minutes after I logged in, the network
locked me out again. After an hour on the phone with tech support, it
transpired that the network had noticed that the videogame I was
playing systematically polled the other hosts on the network to check
if they were running servers that I could join and play on. The
network decided that this was a malicious port-scan and that it had
better kick me off before I did anything naughty.
It only took five minutes for the software to lock me out, but it took
well over an hour to find someone in tech support who understood what
had happened and could reset the router so that I could get back online.
[snip]
-------------------------------------------
Archives: http://v2.listbox.com/member/archive/247/=now
RSS Feed: http://v2.listbox.com/member/archive/rss/247/
Powered by Listbox: http://www.listbox.com
----- End forwarded message -----
--
Eugen* Leitl <a href="http://leitl.org">leitl</a> http://leitl.org
______________________________________________________________
ICBM: 48.07100, 11.36820 http://www.ativel.com http://postbiota.org
8B29F6BE: 099D 78BA 2FD3 B014 B08A 7779 75B0 2443 8B29 F6BE
More information about the info
mailing list