mac.column.ted: Searching for good searching
Ted Landau
November 2008
How is it that the Web sites of many of the world's biggest and most technology-oriented companies have such pitiful search capabilities? Searching these Web sites can be a bit like taking off in a jumbo jetliner and discovering that the aircraft's primary navigation tool is an AAA TripTik.
I first added a search capability to MacFixIt back in the 90's. It didn't cost me a penny. I got it from a site that offered the code for free. As crude as it was, it worked better than some of the search engines I have dealt with in recent months. The problems with most of these search engines boil down to one of two extremes: either they provide too many (mostly irrelevant) results or they provide no (or almost no) results at all. Either way, you don't find what you're looking for.
At the one extreme, the search facilities seem incapable of separating wheat from chaff. Typically, these search engines have only a bare-bones set of features, failing even to include Boolean searching or the ability to search for an exact phrase by putting the text in quotes. On such sites, if you searched for "Apple TV," you might wind up with dozens of results. Unfortunately, they would mostly be pages that match only the term "Apple" or only the term "TV" or, at best, both "Apple" and "TV" but in separate unrelated paragraphs. If you're lucky, buried in all of this might be one result that actually talks about the Apple TV device. Finding it, however, could be harder than locating Osama bin Laden.
At the other extreme are search engines that too often come up empty, citing "No results found." Or perhaps they'll grudgingly offer one or two hits, which would be fine except that the results bear no relation to your search. Imagine if you went to a site devoted to current television shows, searched for "CSI" and found no matches. You'd be justifiably shocked. Surely, at least one of the CSI shows must be mentioned on at least one page of the site. In fact, you may recall having been to such a page on a previous visit to the site. As it turns out, your memory is correct. The problem is that the search facility is unable to give you the existing links. This example is made up, but I have had real-life situations that were not much different.
As a troubleshooter, I spend significant time searching the support pages of various technology sites. Somewhat surprisingly, these sites are far from immune from the sorts of search failures just described. Sometimes, the problem is that these sites don't have adequate support information in the first place. However, more often, the search facility is the primary culprit.
For starters, take Microsoft's Mactopia Help and How-To site. It could serve as an example of problems at both extremes. As a semi-random test, I picked Mactopia's Entourage Support page. Things looked bad right from the start. There were no sign of any "advanced" search options. Indeed, I quickly confirmed that putting search terms in quotes had no effect.
My first search was for "mail does not open." The search engine came back with 20 matches. The top five matches were: Set up an e-mail account automatically; Create and send a message; Verify and rebuild the Entourage database; Set up a POP e-mail account manually; and Label a message as junk or not junk. Only the database rebuilding article even vaguely related to what I wanted to know. However, the text of this article did not actually include my search terms. It was hard to figure out why any of the articles turned up in the results.
I tried a second time, using a different variation: "can't open message." The top response now was Entourage and Exchange: powerful partners. Optimistic, but hardly what I was hoping for. Looking down the list, I eventually found one article that dealt with an inability to open/view attachments. Close enough to be taken seriously as a match although not a direct hit. Could it be that Microsoft truly had no articles on my sought symptom? Perhaps. Still, if that's the case, I'd rather accurately get no results than a bunch of useless ones.
For my next test, I made sure in advance that there was a relevant match to be found. As it turns out, Microsoft's main (not Mactopia) support site is not just for Windows; it includes support for Microsoft Office 2008 for Mac. Unfortunately, a Mactopia search apparently doesn't check the main site (which is a problem all by itself!). The main site, however, does have its own Microsoft Support search engine. This is what I now intended to use. Perusing the main site, but not doing any search, I eventually came across a list of Office 2008 for Mac support articles. I picked one of the articles, titled You experience some issues when you use Entourage 2008 for Mac after you use the Time Machine backup tool. The question now was, assuming I didn't know this article existed, could I find it from a search? I entered "Time Machine backup" as the search terms. Given that these words were in the title of the article, I assumed I had placed a good bet for getting a match. I was wrong. Only one article came up in the results. And it was a useless article titled Description of Office 2008 for Mac Service Pack 1 (12.1.0).
I don't mean to pick on Microsoft. Many other corporate Web sites aren't any better. For example, I have had similar headaches at Comcast's Web site. In this case, I searched for an actual symptom I was having: when I attempted to access On Demand, I got a "communications error." I tried Comcast's search engine both with the terms "on demand communications error" and "on demand error." The former term was a dead end; the top result was Comcast and Best Buy Offer The Ultimate HD Movie Experience At Home. The latter term yielded a more promising result: a top item titled Where can I get help for Comcast Games on Demand issues? But even this was ultimately unsuccessful. I eventually wound up at a page with a link to a useful-sounding article; unfortunately, the link didn't work.
Entering other search term variations fared no better. When I tried "On Demand trouble," for example, I got 85 results, none of them relevant. The item that made it to the top this list did so because it contained the following sentence: "Available July 30th On Demand, Harold and Kumar get in trouble trying to sneak a bong onboard." Sounds like fun, but not likely to resolve my communications error.
I finally stumbled across an AskComcast page. Using it, I asked "What is an On Demand communication error?" This, at last, yielded positive results, pointing me to an item that described an "unable to process request" error. Reasonably close to what I wanted to know. I counted this as a success, but it should have been much easier to get here.
Apparently in recognition of the anemic quality of their in-house searches, many Web sites have switched to using a custom version of Google for their search facility. A wise choice. Actually, if you simply go to Google's own site and search from there, you'll likely get better results than from the non-Google search engines on these other sites. For example, when I entered "On Demand' Comcast 'communications error'" at Google, all of the top hits pointed to the solution I sought -- although none of them were from the Comcast site. This can be disappointing if you really want an official statement from the vendor, but is otherwise much better than the alternative.
Do the people responsible for these inferior search engines ever try to use them? Are they aware of how pathetic they are? Do they realize how difficult it is for the typical visitor to actually find the information they seek, even if the information is on the site? Do they even care? Or is it all part of some intentional and cynical plot? I don't have the answers yet. But I do know that I won't find them by searching these vendors' Web sites.
Resources
As a webmaster I have to say it's not My fault. The decision makers only want quick flashy sites and don't give a flip about usability or content. It is a shame.
<p>As a webmaster, it should be part of your job to explain to them <em>why they should care</em>.</p>
<p>I know it's an uphill battle, but you are the person with the knowledge and skill about the topic, and it is your job to help your employer understand the parts of your job that are relevant to their decision-making.</p>
Another problem is online vendors. For example, in many cases a search for "wireless mouse" will return every computer that comes with a mouse, and every wireless router, as well as wireless mice. Obviously the search is a logical "or".
One solution that often works is to use Google, and use the "site:" operator to restrict the search. In the search in Google, type "wireless mouse site:vendor.com". This does not always work; Some sites do not allow search engines to index them.
To Wal-Mart's credit, they have very recently made improvements to their search engine, and it is MUCH better. (After years and years, thanks for finally noticing!)
Part of what you're talking about is a known fundamental problem with search that even Google has not solved in a practical way. It's a question of key words vs. actual attributes (meta data). The proposed solution is called the Semantic Web. See http://en.wikipedia.org/wiki/Semantic_Web
Go to Google.com, click "Advanced Search". At the bottom of the advanced search enter the specific domain to be searched in "Search within a site or domain". Not a bad option, however, it does not appear to allow adding additional domains to that search.
Even if a Web site has a good, well-written search function, it will never be as fast as Google, because Web sites run on servers and Google search runs on one of the world's most powerful distributed supercomputers.
If I want to find something on a Web site, I <i>never</i> use that site's built-in search. I always use Google. Just put the Google site: command into a search and cha-ching!
For example, if you want to find hammers at the Home Depot Web site, forget the Home Depot search tool. Go to Google and type
hammer site:homedepot.com
If you want to find articles about the Apple TV on CNet News, don't use the CNet search command. Instead, go to Google and type
"apple tv" site:cnet.com
Hell, I use this to search for things on my own blog, which has almost two thousand entries. Rather than sifting through my own blog's search function, if I'm looking for something I'll go to Google and type
prolactin site:tacit.livejournal.com
and there it is. That's all there is to it!
I often search on discussion forums, where the internal search for whatever bizarre reason often requires four or more characters in a word for a search to work. This is aside from the fact that they don't always understand Boolean or quoted searches.
But I'd like to see the results in chronological order based on time of a posting, but is there a way to make Google realize it's searching a forum and organize the results that way?
Not as far as I know. Google doesn't return results chronologically even in a forum.
Really??
Try this: spray paint site:homedepot.com
and see if any of the four or five returns show you anything about the hundreds of spray paint cans that they carry.
I fat fingered the search term above. It should read:
"spray paint" site:homedepot.com
"spray paint" site:homedepot.com
Results 1 - 10 of about 46,500 from homedepot.com for "spray paint". (0.17 seconds)
I'm not sure why you say you get "four or five returns". I get 46,500 returns.
Of course, "spray paint" isn't a good search term, because it will include not only cans of spray paint but also tools for spray painting, books on spray paint techniques, and so on. If you are looking for colors of spray paint, or types of spray paint, it's helpful to search smarter. "spray paint" rustoleum site:homedepot.com will, for example, help narrow the range.
It also doesn't help that a lot of Home Depot's product information is in Flash movies, which are difficult to search. Even the Home Depot bhe built-in search function for "spray paint" returns nothing but tools and machines for painting, rather than types of spray paint. The first returned hit on the built-in search is "3M Paint Spray/Pesticide Respirator Assembly Medium 4/cs," which is probably not what a person searching for "spray paint" is most likely to want.
Homedepot.com is a badly designed, poorly organized Web site with information grouped and presented in ways that make it either inaccessible to search or difficult to search. The built-in search is pretty dismal; Google is only a little better.
Ted, your third paragraph nails it - this happens to me on Google far, far too often. I thought search engines were designed to be time-savers. Far from it.
photographer--2008 - thanks for the tip.
---
Knowledge is limited. Imagination encircles the world. - Albert Einstein
You can also set up a Custom Google Search that returns links only from a set of sites you define.
A FileMaker Custom Google Search is here
http://www.searchfilemaker.com/
I set up a Custom Google Search for Photoshop Tutorials here
http://www.google.com/coop/cse?cx=001044389222327874554%3Avyrbwm-k_bi
The process is free and simple, the only difficult part is coming up with the websites to be included.
.
The current practice makes the categories under the "Show..." box a bit pointless.
Recent searching for espresso machines was an improvement. Sears and Lowe's had one of the better searching results, providing quite a few results. In Walmart's store, however, I found models that are not found in searching the Walmart site.
Last week I searched for a Nuvi 880 on Amazon's site after reading Consumer Reports. Amazon did not provide any results for this model last week because someone at Amazon had spelled 'Nuvi' as 'Nvi' or 'N vi', and I found it by searching for '880.' Somehow, today it appears in a search for 'nuvi 880' at Amazon, but it provides 20 other GPS models, including some from other manufacturers who do not use 'nuvi' in their names.
Consequently, most of my searches start with Google. For these commercial sites, it is a steep learning curve. If they don't get it, then they don't get the business or the dollars.
<p>great idea to use google instead; just add the vendor's domain as filter to your search so it will return only hits from their site.</p>
- by ahb1 December 11, 2008 1:58 AM PST
- Speaking of search engines, I was shocked (and boy do I mean shocked) to discover that Yahoo and AltaVista are much faster to index web pages than is Google. I even tested this discovery by putting some unique, four word phrases, on a web page, and checked each day to see who found it first. Yahoo usually found the phrase in about a week. AltaVista soon followed. Google took 4-8 weeks (and sometimes many months) to crawl that page and get it indexed. I had no idea that such was the case.
- Like this Reply to this comment
-
(20 Comments)I also discovered that if you want information on how to make or build something, Yahoo is far superior as it gives you all of the Flickr hits which I found was the kind of information I was looking for.