Greg R. Notess |
ON THE NET
Refining the Internet in '97
DATABASE, December 1997 |
It seems that almost all public databases are planning some Web access with private databases following suit. The huge intranet market makes access to proprietary databases via the Web an attractive option. Legacy databases move to corporate intranet access. Commercial databases are finding ways to offer Web access with log in or domain restrictions that provide access only to authorized users. Government information is flowing across the Net with national, regional, and local governments jumping quickly into the stream.
Northern Light is not just noteworthy for its ability to search its Special Collection and the Web. Its database of Web sites ranks quite favorably in terms of size with the other major contenders. In addition, it has made a significant advance over other Web search engines with its Custom Folders. These folders are an early attempt at offering an important sorting capability to Web search results. Hits are sorted into several folders that might include keywords, type, and source. This sorting feature is one of the best improvements from Web search engines for making it easier to browse for relevant documents. Northern Light is still under development, but it is a search engine to watch.
Users can also customize AltaVista search options. This new preferences feature can set the advanced or simple search as the default as well as allowing specific displays. It does not use cookies, as does HotBot, to set these options. Instead, it gives a new URL to use for a bookmark.
HotBot also has expanded its search capabilities. One of the most useful is the addition of title searching. This field search for keywords in Web page titles is available as "words in title." Title searching has been available on AltaVista and Infoseek with use of the "title:term" syntax. HotBot added this important field search capability in the drop down menu, which has made it a bit harder to notice, but it will also accept the "title:term" syntax. While HotBot should be commended for adding this capability, unfortunately it does not seem as reliable as the title search from AltaVista and Infoseek. This may have been a one time problem with its title search, but on a number of searches, HotBot failed to find pages that contained the search term in the title. And these pages were found in the HotBot database using other searches.
Another new HotBot search feature is the page type limit. This offers four radio button choices: Any, Front Page, Index Page, and Page Depth. Any is the default option. The Page Depth button uses a default value of three, but it can be changed to other numbers as well. Choosing the Front Page option limits the search to pages that are the central pages on a specific computer. For example, it will find http:// www.name.edu, but not http://www. name.edu/~jsmith/. This is a great way to limit a search and to cut out all those subsidiary pages on a site. The Index Page limit is similar, but can pick up top level pages within a directory or subdirectory, in addition to central pages. The index page limit can find central pages for individuals and departments within a larger site. For example, it would find http://www. name.edu/~jsmith/index.html, but not http://www.name.edu/~jsmith/bio. html. In addition, the Index Page limit will find pages that use either the http://www.name.edu/~jsmith/ or the http://www.name.edu/~jsmith/index. html format (since these refer to the same page). Unfortunately, HotBot may get rid of this new page depth search option before the end of the year, so do not get too accustomed to its presence.
Excite has some similar follow-up search options. It has featured its "More Like This" link for years, but it now also suggests possible related terms that can be added to the search by checking a box next to the term. Excite's big change this past year was to move toward a channel metaphor, structuring much of its content around broad topics. However, that change related to the subject categories more than the straight Web search itself. Its Power Search provides a form to help construct a query, but except for its ability to change the number of results shown, all that it offers can be requested directly in the search box by users who know the proper commands.
Yet even as the total number of Web pages increased, the databases of Web pages seemed to stagnate. Surprisingly, 1997 saw little growth in the overall size of the Web databases. True, some pages were taken down or ceased to exist for other reasons. However, the overall trend has been toward growth. Most of the Web databases reported a similar number of hits or even a few less hits than the same keywords found in 1996. HotBot still claims about 50 million pages and the others have not shown any large growth in the total number of pages.
What does this mean for the information seeker? Be sure not to expect a comprehensive search from any single Web database or even from all of them. There are plenty of unindexed Web pages and information resources. Use the Web search engines as one tool in the information quest, but recognize their limitations and lack of comprehensiveness.
While Microsoft and Netscape argue the merits of their respective approaches, the question that remains to be answered is whether or not and how much the public wants to use push technology. |
The Web browser is now just one part of the Internet software included in the two products. Both have grown so large that home users should expect to wait at least an hour, probably much more, to download the entire installation file. Netscape introduced its new Netscape Communicator software suite midyear. As in 1996, the suite included email and newsgroup capabilities, but the 1997 version made significant improvements in those capabilities. Communicator also includes features for HTML editing, collaboration, calendaring, and scheduling, and push technologies. Internet Explorer's version four counters with many of the same features, but ties all of it in more closely to the Microsoft Windows 95 operating systems and will be even more integrated with Windows 98.
For the information professional, keeping up with the latest versions of these browsers remains an important consideration, since some sites use only the newest features to deliver information content. Except for the push technologies, most of the other new features may have had little impact on the information seeker. They primarily refined navigation ability, HTML display, and layout. Netscape users can enjoy Navigator's new Personal Toolbar that permits the inclusion of personal favorites on one of the top toolbars. This ability to customize the browser desktop makes for even quicker access to your most-used Web sites.
Overall, 1997 has been a year of refinements to the Internet as an information conduit and as an information resource. |
Microsoft's version of push technology, included in its Internet Explorer, is referred to as "channels" and uses its Channel Definition Format. Netscape added its Netcaster component to the Communicator suite. This relies on standard HTML and Javascript to deliver content. While Microsoft and Netscape argue the merits of their respective approaches, the question that remains to be answered is whether or not and how much the public wants to use push technology. While it can be very useful for quickly changing information like stock quotes, is a steady stream of news on your computer more distracting or informative?
The push clients can allow the user to determine the interval at which channels are updated in addition to choosing which channels to subscribe to. The updates can run in the background while the user is working on other projects or using other software on the computer.
Overall, 1997 has been a year of refinements to the Internet as an information conduit and as an information resource. More tools, more content, and more users have become involved with the Net. As the Internet becomes more ubiquitous, it is no longer a question of when to use the Net, but how.
Communications to the author should be addressed to Greg R. Notess, Montana State University Libraries, Bozeman, MT 59717-0332; 406/994-6563; greg@notess.com ; http://www.notess.com.
Copyright © 1997, Online Inc. All rights reserved.