Eric misses the point of Google indexing weblogs, and sometimes ranking them rather high for some searches:
most people would consider google to be a better service if i, and a relatively small number of other people, didn’t get in the way of the information they really want.
He’s referring to the way he tops the list for galaxie 500 window crank by first talking about trying to fix his, and then talking about finding his entry while searching for information. What he’s missing, though, is two things: that’s an okay first shot search phrase, but as soon as he saw that it didn’t work, he should have revised his search, rather than expecting Google to guess right every time, and then searching for something like galaxie 500 “window crank” (note that many of the original results are talking about the window on one car, and the crank(shaft) on another) would point out that he’s number one because Google doesn’t have a damn thing useful for that search.
However, someone else (if they are at least half-bright), searching for that phrase because they have a broken window crank on their Galaxie, could go to Eric’s weblog, track down the entry, see the magic words “i’m fortunate to have the original shop manual”, track down his email address, and ask him if he’d be willing to trade copies of a few pages for some extra parts. That’s quite a bit more useful than the other results, offering to sell one car with a missing window, a Galaxie, and yet another car with a particular crankshaft.
I feel bad about the fact that at this moment I’m Google’s top result for http error 500, not because I’m “clogging up” the results and keeping people from finding useful information, but because I’ve been shown that there isn’t any useful information, and I’m still too lazy to fix the situation. There are a very few half-decent pages listing a bit of information about every HTTP response code, a million or so pages just repeating the explanations from the spec (“The server encountered an unexpected condition which prevented it from fulfilling the request.” – gee, thanks), and none that really understand how the web works today, and provide useful information in a useful way.
In the old days of directories, you would dig through Yahoo to find a link to a single page listing HTTP error codes, and find the one you were looking for, and maybe get lucky and get a helpful explanation as well. That’s not how it goes anymore: people search for something far too specific, and then give up. What the web needs for this particular class of query are separate pages, with the error code and name in the URL and in an <h1>, with an explanation of what it means, why you are seeing it, and what to do next (“something’s screwed up in your script, or your .htaccess, or your server: rename .htaccess and see if it works, then look in your error.log or ask your host to look, or try to remember what you just did to your server”).
Google isn’t saying “sorry, this sucks for what you searched on, but I’m so confused by all this incestuous linking that I just have to give it to you anyway,” what it’s saying is “I don’t have anything useful for that query, but people seem to think this guy knows his way around, and he’s at least used the words in your search, so maybe he linked to something that will prove useful.” So far, I haven’t, but that’s my failure, not Google’s. If you actually look at the sorts of things where Google ranks you “too high”, you’re likely to find that either nobody has anything useful to say, or there’s no way for Google to tell what’s authoritative yet. If three hundred pages include the term Googlewash, but Google hasn’t done a monthly reindex yet, all it can do is show you those, and hope that some highly ranked one linked to the right place. Once it reindexes, it can see that a couple hundred used the term in a link to one page, and that’s probably an authoritative source about it, though to cut down on Googlebombs, a page that also includes the term in the page title will probably come first, even with only fifty inbound “Googlewash” links. Your weblog entry isn’t ranked high for some searches because Google’s confused, it’s ranked high because Google needs your help, and expects you to link to things it hasn’t had a chance to sort out and fully index, or to link to the most useful thing about the keywords that your entry is highlighting. So get to it; it’s damn sure not going to be “content/6/30195.html” providing searchers with useful information and links.