Weblogs: November 2007 Archive Page

Well, Google didn't say "glarbifulous" on its own, but I had a good reason to search the internet for a nonsense word.

In order to confirm my feeling that the Associated Press's preference for "Web log" is far less popular online than the traditional "weblog," I did a quick Google search.

12,900,000 Google hits for ["weblog"]

I expected that. For years, my own blog has been ranked anywhere from 99 to about 180 out of however many hits there are for "weblog," and I've been tracking that number every year when I submit my annual faculty report. I thought that maybe that number was a little lower than I remembered, but I realize that Google's numbers fluctuate as it re-indexes older sites.

I wasn't surprised when I found only a paltry

250,000 Google hits for ["web log"]

... since only AP writers format the term that way. But when I tried to exclude the AP articles that use "web log," I found... 

24,700,000 Google hits for ["web log" -AP]

Why do I get ten times more hits  for what should be a more restrictive search? 

The Googly weirdness does not stop there. When I include AP, why do I get 25,000 more hits than when I exclude it?

275,000 Google hits for ["web log" AP]

The nonsense word "glarbifulous" appears nowhere on the internet (though that will change once Google notices this post). I was quite surprised, then, to see that after excluding "glarbifulous" from my search, I find...

175,000,000 hits for [weblog -glarbifulous]
That's more than ten times as many sites as I get when I don't exclude the nonsense word. How can so many more pages NOT have a word that doesn't exist?

Maybe Google has paid closer attention to the quality of pages that contain the word "weblog," removing a lot of junk results that it figures are pointless. But maybe when I ask Google to do a search for something less popular, it thinks I might actually be interested in some of the sites it would otherwise ignore. Suddenly, every single page in its database that doesn't include "glarbifulous" becomes potentially relevant, since each of those pages has met a criterion that I have specified.

That seems to make sense, but it also seems, well, twisted. I just did a search for "the" by itself, and "the -glarbifulous" and got similar results.... about twice as many hits for the more restrictive search.
Categories: , , , , , ,
According to Rebecca Blood, in early 1999, Jesse James Garrett posted a list of 23 web sites that posted links and brief commentary. The Wayback machine's archive of Garrett's site returns this list from early 2000.

bradlands
bump
camworld
flutterby
genehack
gulker
hack the planet
honeyguide
jjg.net infosift
linkwatcher metalog
ltseek
macronin
nowthis
obscure store
peterme
psyberspace
rasterweb
rc3.org
researchbuzz news
robot wisdom
scripting news
windowseat
whump.com more like this
It might be interesting to see what happened to each of these sites.  When I started blogging later in 1999, I hadn't heard of a single one of these, though I was very familiar with the genre of what was then called the "list of links."  In February of 1998, while working at the University of Toronto' s Engineering Writing Centre, I urged web authors to "Annotate Your Lists of Links."   Later that same year, one of the e-school staff members e-mailed me a link to Arts & Letters Daily, which was precisely that -- an annotated list of links, carefully selected and always worth visiting. 

When I first started blogging in the spring of 1999, I closely copied the format of A&L Daily, which used multiple columns, did not date its entries, and used "[more]" as the link. (I first dated an entry on July 20, 1999, because I was writing about the 1969 moon landing, and I wanted to emphasize that the event took place exactly 30 years earlier, and I've dated every entry since then -- about 5500 separate entries.)

Arts & Letters Daily, which did not focus on technology issues, was not on the early 1999 list of 23 sites that have become accepted as the canonical list of early blogs. There must have been many, many other sites that were not part of this particular subnetwork; I seem to remember Blood's claims that

Before the weblog genre had a name (the term "weblog" was coined by Jorn Barger, 10 years ago next month... his site, "robotwisdom," is one of the canonical 23), home pages had guest books, web-based discussion boards had postings and threads, and in the pre-Google days when new content was hard to find on the web, a "What's New?" page (with a collection of short links) was an important part of large, active websites.  Many sites featured a "link of the day" or a "link of the week," though you often had to click the link to find out what was on the other end.  Dating from about 1995 was the concept of the "Web Ring," which was a standard interface that webmasters put on their home pages, with "next" and "previous" links that went offsite, to other pages in the "ring" (populated by a centrally-hosted database). 

After Googling for a bit, I just learned that the Web Ring concept was invented by Sage Weil, apparently in May 1994. In 1995, he started a company that was eventually bought out by GeoCities, which was in turn bought by Yahoo!  I remember now that the Yahoo! Webrings was a bit controversial, since Yahoo! didn't implement all the features of the original WebRing concept, though recently a Webring 2.0 concept was spun off from Yahoo!

One final note... an undergraduate student of mine, Kirsten Schubert, wrote a term paper on weblogs in 2002, which was well before there was any published scholarship on the subject. It's a good time capsule of what was available at the time -- general articles on hypertext rhetoric and digital authorship. (When teaching that class, I hadn't yet come across Mortensen and Walker's 2002 article, Blogging Thoughts -- the first academic essay focusing on blogs.)
Categories: , , , , , ,
Categories: , , , , , ,

About this Archive

This page is a archive of entries in the Weblogs category from November 2007.

Weblogs: October 2007 is the previous archive.

Weblogs: December 2007 is the next archive.

Find recent content on the main index or look in the archives to find all content.

Powered by Movable Type 4.23-en