





|
YOU KNOW you're in an advanced civilisation when one human being
can change the world. But what are you in when one computer can change
the world?
Mountain View
And Moses went up from the plains of Moab unto the mountain of Nebo,
to the top of Pisgah, that is over against Jericho. And the LORD
shewed him all the land ... unto the utmost sea ... and the LORD said
unto him, This is the land which I sware unto Abraham, unto Isaac, and
unto Jacob, saying I will give it unto thy seed.
-- Deuteronomy XXXIV i-iv
IF AN alien had landed on earth thirty years ago and said, not
"Take me to your leader", but "Take me to your knowledge" (and why
are all fictional aliens politicians, not academics?), then
he would swiftly have been shown towards a copy of the
Encyclopaedia Britannica. There was no place you could point
to and call Knowledge, but Britannica was the closest mankind
had to offer.
If he landed today and made the same demand, though, we would
produce for him a computer with an Internet connection, and we'd enter
the address
http://www.altavista.digital.com. And (I'll say straightaway) he
still wouldn't get something we could point to and say "All human
knowledge is here" -- but he'd be a lot closer.
And what is Alta Vista? Well, if you've already got here you
probably know, but for completeness I'll say that Alta Vista is the
most powerful computer that Digital have ever built. It contains ten
processors and 6Gb of RAM (at current prices that's £150,000 worth,
though that's without reckoning on a quantity discount). It is used to
run Digital's database software, and in fact was conceived as a way of
publicising this software (and also Digital's ability to build
computers this powerful, should anyone else ever ever need one).
Its mission, though, is what's important. It could have been
adapted from The Hitch-Hiker's Guide To The Galaxy:
Alta Vista: |
What is this great task for which I, Alta Vista, the greatest
computer in the History of the Digital Equipment Corporation, have
been called into existence?
|
Programmer: |
O Alta Vista Computer, the task we have designed you to perform
is this. We want you to tell us ... the Answers!
|
Alta Vista: |
The Answers? The Answers to what?
|
Programmer: |
Life! The Universe! Everything!
|
Alta Vista: |
Tricky.
|
Programmer: |
But can you do it?
|
Alta Vista: |
Yes. I can do it.
|
Alta Vista's mission is to create a searchable index of every word on
every page on the whole of the World Wide Web. A bunch of less awesome
computers take upon themselves the task of fetching once each Web page
there is, and updating the database accessed by the main server.
This is an endeavour of utter grandeur. Mankind still has a large
body of what might be called "legacy knowledge", knowledge that exists
only on paper (for instance, the 1966 alien's Britannica). But already
most truly new knowledge appears on the Web as it is created. Alta
Vista's effect is wider than that of a new gizmo on a computer
somewhere. Its breadth of "reading" means it is becoming nothing less
than a instantly searchable concordance to the entire of human
knowledge. It's not "a" database. It's the database.
If you work with the Internet as much as I do, you'll need, like
me, to step back and pause awhile as the enormity of that prospect
sinks in. Never before in human history could such a thing have
existed. Like the atomic bomb, it cannot be unmade. The collective
consciousness of humanity (or, if you don't feel like being that
mystic, the individual's sense of community and importance)
must adjust to encompass it.
In the olden days, when a physicist wanted a partial differential
equation solved, a student was called in to solve it. At the end of
three years of appalling and unremitting slog, the physicist had the
answer and the student had a PhD in physics. In modern times a
computer can give the answer in a fraction of the time, freeing
physics PhD students to do more entertaining things, such as read
their friends' Web pages.
The Web, and Alta Vista, promise an overthrow of the current world
order in the textual and subjective disciplines matching that brought
in the numerical disciplines by the single computer. No longer does it
take a PhD's worth of work to discover what Shakespeare thought about
astronomy: a search on Alta Vista can provide references to each time
he mentions it. (Of course, Alta Vista automates only the
data-gathering phase of such a PhD, just as Fortran automates only the
results-gathering phase of a PhD in theoretical physics. But
historically these, and not the demanding and intellectually
interesting phases, have often been the greater parts in time and
effort of both species of PhD.)
Alta Vista also provides a "gold standard" of knowledge: a place
where you can be sure something is, if it exists at all. This in itself
is not new: Great Britain (like most countries) operates "copyright
libraries" which is an unfortunate name for a system whereby a certain
set of libraries (the British Library, the National Libraries of
Scotland and Wales, and the libraries of Cambridge and Oxford
universities and of Trinity College, Dublin) receive one copy each of
every book which is published.
As I'm a graduate of Cambridge University, I'm in theory allowed to
use the University Library. I don't reckon I ever will, though. I went
in there once, when I was an undergraduate, seeking a certain piece of
information. The UL is a vast building, which reputedly served as the
inspiration for the design of Deep Thought in the television
Hitch-Hiker's Guide To The Galaxy. It is grim and Orwellian
in architecture, and, once you get in, the staff prove to be equally
grim and Orwellian. I had a lot of time on my hands, so I eventually
found the right section, pulled out in turn each of the five or so
books on the topic, and found nothing. Searching the books took ten
minutes; gaining admittance and finding them had taken all afternoon.
Fortunately, this most reclusive piece of information was nothing
to do with my degree; I was reading Alan Garner's novel Red
Shift and wanted to know how to use the Lewis Carroll cipher to
decode a message given in the endpapers.
(At least I never felt in fear for my life in the UL, which I
certainly did in the lift [I was told it was called a paternoster lift]
in Birmingham University library. Is that still there? What is
their accident insurance like?)
Searching Alta Vista for "+carroll +cipher", on the other hand,
gives the right answer immediately. By the time Alta Vista had been
invented, though, I'd eventually noticed some odd things about letter
frequencies in the message and derived for myself the still, small
fact which the UL hadn't provided: the Lewis Carroll cipher is the
same as the Vigenère cipher.
Sadly, though, Alta Vista is not complete, and for reasons more
depressing than the unavailability of the three big commercial
information sources: Britannica, the OED and
Larousse Gastronomique. It seems that the Alta Vista team
underestimated the sheer size of the undertaking. Their help pages
still optimistically say that the database is continually being
rebuilt and is completely refreshed every "few days"; some documents
it serves, though, are six months old (one way you can tell is by
documents which contain CGI script includes giving the server's local
date and time), and www.iota.co.uk has been submitted several times
over a month with no sign that Alta Vista has ever visited here.
But this is not to detract from the magnitude of Alta Vista's
influence. Whether Alta Vista is halfway there, or only a tenth, it
provides compelling reason for completing the task. If Digital are
feeling they've proved their point and are disinclined to spend
further money on this "advertising campaign", then surely someone else
will take up the challenge; perhaps even the British Government -- has
there yet been suggested any better project for the Millenium
Commission to spend the Lottery money on? (Enhancing the provision of
Knowledge with the proceeds from the Lottery, a direct tax on
Stupidity: now that's creative government!)
Knowledge has changed. I don't think it's over-egging the pudding to
say that the availability of an instantly accessible full-text index of
all human textual output is the biggest change to come over knowledge
since the invention of the printing press. And maybe it's bigger,
because you can mass-produce information all you like if it doesn't end
up on the desk of the person who needs it, who might never have known
it existed.
In days of long eld, an academic might have made a pilgrimage of
knowledge to Rome or another great library of antiquity; now that
printing has been invented, the months that would take have been cut
to the day or half-day it takes to visit the UL. Two orders of
magnitude improvement. But a search on Alta Vista takes seconds:
three orders of magnitude further improvement.
We are faced with an age in which knowledge is no longer parochial.
It must be like this being a Catholic: the idea that there is an
ultimate recourse, a highest court of knowledge, a point where answers
become definite. Alta Vista isn't quite an omnipope, of course, as it
will sometimes say
No documents match the query.
and I suspect his Holiness the Pope isn't in the habit of
turning round to enquiring cardinals and telling them he just doesn't
know. But at least when Alta Vista gives in, we will know -- as no
enquirer of any previous generation could ever have known for certain
-- that what we ask is beyond the ultimate limits of the corpus of
human knowledge. The era in world history in which it's credible or
useful to say "I don't know" is passing. In the future the only
opposite of "I know" will be "Nobody knows".
-- Peter Hartley, 9th April 1996
Afterword: 26th April 1996 Alta Vista has now got round
to indexing Iota's site! Seven weeks after the URL was submitted,
and eight days after Scooter first fetched robots.txt,
Iota's pages and mine started showing up in searches. I'm very pleased
about this, not only for reasons you'll know if you've read the above
essay, but also because traffic to our site has doubled as a
result of people finding us in Alta Vista searches!
Afterword: 7th May 1996 Digital have themselves put together
a more prosaic, but by the same token more informative, explanation of Alta Vista.
Afterword: 3rd June 1996 The people who built Alta Vista: do
they lie awake at night thinking, "We've changed the shape of
knowledge, we've changed world civilisation"? I suppose, seeing as I
lay awake last night thinking about them thinking that, that they
must.
Afterword: 5th August 1996
Date: Sun, 04 Aug 1996 21:05:45 -0700
From: Timothy McGinnis <ntsgrocr@cwnet.com>
To: pdh@utter.chaos.org.uk
Subject: No documents match the query.
Yours is the only page that Alta Vista returns to
the query (No documents match the query.) I just
thought you might find that interesting.
All Rites Reversed -- Copy What You
Like
|