Comparing
Open Source IndexersInfomotions Musings; May 29, 2001 by Eric Lease
Morgan
Describes the history and features of eight open-source search engines, freeWAIS-sf
(aging code and hard to install, but good for searching email and public domain
etexts); Harvest (powerful gathering features for
frequently-changing data stores, good with structured documents); ht://Dig
(tricky to configure, no phrase searching, automatic stemming and match word
highlighting); Isearch (weak documentation and
support, easy to install, dated interface, Z39.50 support); MPS
Information Server (zippy indexing of both text and structured data, Z39.50
support, Perl API, limited documentation); SWISH-E
(simple to install engine, CGIs in Perl and PHP still beta, good for HTML
pages, recognizes new META tags, sorts results by field; WebGlimpse
(easy to install and configure, requires commercial version for customized
output); Yaz/Zebra (mainly Z39.50, no Perl API, mainl
y a toolkit to index and respond to distributed client queries). Article also
points out that chaotic information is less than helpful and encourages organization,
structure and vocabulary control.
WebGlimpse
-- Combining Browsing and SearchingUsenix presentation, January 10,
1997
Contents of a presentation at the Usenix Technical Conference in early 1997,
points out some major issues of context and location in searching that have
not really been addressed by other tools.